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CTTGGTTTGA TTTTAGGCAA GGTAATGGTT AATAACCCAT TTTCAAAACT AGCAGTAATA 

TGTTGCTTAT CAACAGCTTC AAAATCAAAT TGACGCATTA ATGATTCGAA GTTACGCTCA 

5 

TCTAAAATGA GTTGTTCAGA TTTGTATTTT GCGCTTCTAG TAGCTTGAAT AGTGAGCGwA 

TTAttATTGA AATCgATACT AATAtCTccC TG 

(2) INFORMATION FOR SEQ ID NO: 231: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 
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CGTCTTCTCT 


TGGTTATTCT 


CTGTGTTTTG 


TCTTTGTTCA 


AATTCGATTT 


TTTGTTGTTC 


60 


GAATGAATGT 


AATACTTTAT 


TTTTTTTTGG 


CACATAATCC 


AAGTnATTTT 


TAGGAATTAA 


120 


TATACGGTCT 


TTAAATGCTT 


CTTGTATTTT 


GCTCACAATC 


AATTGGTATA 


GTTGCTCTTC 


180 


TTTTGATAAA 


CGCACTTCTA 


GTTTTGTTGG 


ATGAACATTT 


• 

ACGTCTACTA 


AGATTGGATC 


240 


CATTTCAATA 


TTAATATAAC 


AAATCGGGAA 


CCTACCTATT 


GTTAAGAGTG 


TATGATAGCC 


300 


TTCTAAAATC 


GCTTTATTTA 


GCATAAAGTT 


TTTAATGTAT 


CGTCCATTAA 


TAAAAATAGA 


360 


AATATAGTGC 


TTATTACTTC 


TAGAATGTTC 


AGGCTTTGCA 


ACAAAACCTT 


CAATGTGATA 


420 


ATCACTTGTA 


TCTCCAGATA 


TATGTACTAA 


ATCTGGTGCA 


ACTTTCATCC 


CATAAATCTC 


480 


TGCCATCACT 


TCATTAGTTC 


GTCCTGAACC 


ATTTGTACTT 


AACATTGTTT 


TGCCATCTGA 


540 


AATGAGTGCT 


ATTCGAATGT 


CCGGATGGCT 


CATTGCCATT 


CTGTTGACAA 


TATCTGTTAT 


600 


TTTACCTAGT 


TCAGTGTATA 


AACTTTTAAT 


ATATTTTAAA 


CGTGCTGGTG 


TATTATAAAA 


660 


TAATGATTCT 


ACAAGTATAT 


CTGTTCCTTT 


TTTCGCTTTT 


GCAGGCTTAT 


GATTTAATAT 


720 


TTCACCATTT 


TCTACATATA 


TTTCATTTCC 


ATTAGCATTA 


TCCGTGCAAG TCTTCAATGT 


780 


TACTTTAGCA 


ACTGATGAAA 


TACTGGCTAG 


TGCTTCACCA 


CGGAATCCTA 


ATGTCCTAAT 


840 


ATGAAATAAA 


TCTTCATCTT 


GATCTAATTT 


ACTAGTCGCA 


TGTCTATGAA 


ATACTAATCC 


900 


TAAGTCTTCC 


GCTTCAATTC 


CGCTTCCATT 


ATCGACTACG 


CGAATAGATT 


GGACGCCAQA 


960 


tTCCTCTACT 


TCAATGCTTA 


TTTCTGTAGC 


GCCTGCATCT 


ATAGCATTTT 


CCAACAGTTC 


1020 


TTTCACAACA 


GAACTCGGTC 


TTTCAACTAC 


TTCACCTGCT 


GCGATTTTAT 


TTGCTAATGA 


1080 


GGTTTGGAGT 


TCTTTAATTT 


TCCCCATTTT 


GCAACACCTC 


TATTTTAATT 


GATTTTGTAA 


1140 
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TTGTAGTTCA ATCTCGCTTT TTTGATCATT TTCAAACJU^ TcAAATGATG 
GTCTTTTTGA GATAAAGTAT CaGTTGTTTC TTCaACACrT aAGTTTAAAT 
AATTTCCAGG TTCATTTTCG ACCATTTTTA AATTTGATAT CGATGATtTT 
ACGCTTCAAA CTCGCTTAGA ATCACTTGTG CTCTGCTAAT AACTTTTTCA 
CTAATTTCGC AACTTGAATA CCATAAtATC GTCAACTGCA CCATCTrTGA 
GAATATAAGT TCACCTTTAT ATTCATTAGC AGCGACGTGA ACATTTTTTA 
TGCTTGATCT AATGTTGTCA ATTCATGATA ATGTGTTGAA AATAACGTTT 
TGTTTCAGCT ACATACTCTA TCATTGCCTG CGCTAAAGCT AAACCGTCAT 
ACCACGTCCA ATTTCATCGA AAATAATCAA ACTATCCTCT GTTGCATAAG 
TTGTGCTTCT AGCATTTCTA CCATAAACGT ACTCTTACCT GAAACCAAAT 
ACCTATTCTA GTGAATATTT GATCAAATAT AGGTAACACT GCCTCTTTAC 
AGCTCCCATT TGGGCCATTA TACTAATTAT GGCAACTTGT CTCATATATG 
AGACATATTC GGACCTGTAA TTAAATATAT AAATGTTTCA TTATCTAATC 
AGGCACATAG TCATTATAAT CCATTACTCT TTCCACTACT GGGTGCCTAG 
TTCTAATGTT TTATTTTCAC TAAATGAAGG CCTAGTGTAA TTATATTTTT 
TGCAAAGCTC TGTAAACAAT CTAGCTCTGA AATAATTTTA GCTTGTTGTT 
AGTATATTTT TTAACTTCTT CACGTAGCTG AACAAATAAT TGATATTCTA 
TTTGTCTTCC GCACCTAAAA TGATATCTTC TTTTTCTTTA AGTTCATCAG 
TTCAGCATTC GATAACGTTT GCTTCCTCAT ATAACCAAAT TCACTTCGTT 
CAAGTTGGCA CGTGTTATTT CTATAAAATA ACCAAACACT TTATTAAAGC 
TGATTTTATT CCTGTACGTT GTCTTTCTTT GGCTTGTAAT TCTGCTAACC 
GTTTTTTGAA GCTTCAAGAT ATTCATCTAA TTGCX3TATTA AAACCAACTT 
GCCATCTTTA ACTGAAATTG GTGGTTCTTC TACTAAACTC TGTTCTAATA 
ATCATCaAGG GGTTCTAGTT GATTAACTTG TACAAGAGTA TTCTGATTCA 
TAATGCTTTA ATATTCGGTA TTTCAGAAAT GGAATGTTTA AGTTGAATTA 
ATTAACATTT CCGTAACTAA CACGCCCAAC AAGACGTTCA ATATCATACA 
ATATGTTCTT AAGGTOTCTC TTTCTATGAA ATGAGCACTA AATTCATCAA 
TCGTGCTTCA ATTTGTTCTT TACTTATTAG TGGTCTATCT ATCCATTGTT 
TGCTCCCATT GGTGTTTTCG TTTCGTCCAT TAGCCAAAGT AGCGTTCCTT 



CyTGTTCAAA 
TTTCTTGATT 
TCACCAGCAG 
GGTAAATCAG 
CTTTATGCAA 
GACTTGGTAA 
TAGCATGTGA 
ACGTTGAAGT 
TTAATGCCTT 
CATCTGCCGC 
AAGGGACATA 
TCGATTTACC 
GACAATTATT 
ATTCCACTAA 
GAGCAATTTC 
GTAAACGTTC 
ATTCGATGGC 
TTATAAAACG 
CAAAATTTTG 
TTATTTTCAA 
ATGTTTTTCC 
TGAATAGTCC 
TATCAAGTAA 
TAGAATTTAG 
AATCTCTCGC 
CTTGATTAAG 
CGATATCTAA 
TTAAGCGGCG 
TTTTTGATTT 



1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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ATCTATAGCT GCATATTGAA CAACATCCTC GATATGCGAT AAATCACGTT TTTGTGTATG 3 060 

ATGAATATAA TCTAGCAATA ATTGTGTCGC TTGATACaTT AATTTATGTT CAGTTTGATT 3120 

CACACTATAG ATTtCTGATG ATAACGTTTC CCTGACTGT 3159 



(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



IS 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 
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TAAGCGAGAA 


GCAATTGGTG 


TTATGTATGC 


TAGTGATAAA 


CCAACAGGTG 


AAAGTACAAG 


60 


GTCATTTGCT 


GTTTATTTCT 


CTCCTGAAAT 


TAAGAAATTT 


ATTGCAGATA 


ATTTAGATAA 


120 


ATAAATCATC 


CATCCATACA 


TTGATAAATG 


ATTTTyAGAA 


ATTAACAACA 


AAATCAACAA 


180 


TTTTAAACAT 


CTCTGTGATT 


CTATTTATTC 


GAAATGATTT 


AAAAAATAAA 


ACTTCAAAAA 


240 


CCTAACCTTA 


TATTTATACG 


AATACTTAGA 


GGAGCACAAA 


AATGAATAAA 


AATATAATCA 


300 


TCAAAAGTAT 


TGCAGCATTG 


ACGATTTTAA 


CATCAGTGAC 


TGGCGTCGGC 


ACAACAGTGG 


360 


TTGAGGGTAT 


TCAACAAACG 


GCTAAAGCTG 


AACATAATGT 


GAAACTAATC 


AAAAATACTA 


420 


ATGTAGCACC 


ATACAATGGT 


GTCGTTTCGA 


TAGGATCTGG 


AACAGGTTTC 


ATTGTCGGTA 


480 


AAAATACAAT 


TGTTACCAAC 


AAGCATGTCG 


TTGCAGGTAT 


GGAAATTGGT 


GCACATATTA 


540 


TAGCGCATCC 


CAATGGTGAA 


TATAATAATG 


GCGGATTTTA 


TAAAGTTAAA 


AAAATTGTCC 


600 


GTTATTCAGG 


TCAAGAAGAT 


ATTGCCATTC 


TACATGTGGA 


AGATAAAGCT 


GTTCATCCAA 


660 


AAAACAGGAA 


TTTTAAAGAT 


TACACAGGCA 


TTTTAAAAAT 


AGCATCAGAA 


GCTAAAGAAA 


720 


ATGAACGCAT 


TTCAATTGTT 


GGCTATCCAG 


AACCATATAT 


AAATAAATTT 


CAAATGTATG 


780 


AGTCAACAGG 


AAAAGTGCTG 


TCAGTTAAAG 


GCAACATGAT 


TATTACTGAT 


GCTTTCGTAG 


840 


AACCAGGCAA 


CTCAGGTTCA 


GCTGTATTTA 


ACAGTAAATA 


CGAaGTCGTA 


GGTGTTCACT 


900 


TTGGTGGAAA 


CGGCCCTGGA 


AATAAAAGTA 


CAAAAGGATA 


TGGTGTTTAT 


TTCTCTCCTG 


960 


AAATTAAGAA 


ATTCATTGCA 


GATAACACAG 


ATAAATAAAT 


CCTTACATAG 


ATAAATGATT 


1020 


ttaaaaatta 


ACAACAAACT 


CAACaATTCA 


AATCATCTCT 


GTGATTCCAT 


TTATTCGAAA 


1080 


TGATTAAAAA 


AAATAAAACT 


TCAAAAAGCT 


AACATTATAA 


TTATACAAAT 


ACTTAGAGGA 


1140 


GCAGAAAAAT 


GaATAAAAAT 


ATAATCATCA 


AAAGTATTGC 


AGCATTGGAC 


GATTTTTAAC 


1200 
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C2) INFORMATION FOR SEQ ID NO: 233: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 
TGATAAGTCA TTTAAATTGT CACCTATTGA CATGACTTCT TTCATTTCAA TCCCTAATCT 
IS TTCGGCAATT GTTTCTAGCG CAATACCTTT TTGTGCATCT GAATGCGTTA TTTCTATAT-r 

TCCTCTCGAA GATGATGATA TAGCTAAATT CGGAGAJcTCA GCTAAAATTT TACTAGCITT 
GTCAATTTTT TCTAAATTTC CATCAAATGC TAATATTTTC ATAATTAATT CACCAGGTAT 

20 

GTTTTCAATA GCATCATAAT TATCAACAAC TyTCAACGTA CCATTATCTA TGCGTCTTTG 
AATACCATTT TTAATACGCT CAACGTTTGC ATGTTGACCT GCACGCTCAG CAATATCTAT 
GTAAATGTCT AAATCTCTTT GTGGATCTTC AGTATAAATC GCACGACTCG TGTATACTTG 
ATAATAAATA CCTGCATCTT TTAAAACATT TGTAATTTTG TGTACTAACG AITTATTAAG 
GTGTGAAGTG CTCATTACAT TGAAAGTTTC ATCACGTACT TCAGCACCAT TCAAACAAAT 
ATATGGTACT GTTAAATCTG TGTCAGCAAC TGGTGCTTGk GCTTCATAAA ATGCTCGACC 
TGTCGCGATA ACAACCGTTA TCCCTTGTTC TTGAGCGTAT TTAATCGCAT CAATATTAGG- 
TTGAGAAATT TCATGTGCTG CATTAAGTAG CGTGCCATCC ATATCAGTGG CTATTAGTTT 
35 TATCATTATG TnACCTCGTT TCGTAAATnT AAAATCTTGT TCTTAAATAA GrATATATAC 

TCAGCGCACA TACTTTtCTA TTAmCATTTA TATkGTCATT aATTTATCAT ATAATGTAAT 
TCTaACAAAT nTTAAtTAGT ATGTACTATC GTCTAATTGG TGGATTTCTT ATTGGCTCTT 
AAgTTTTtAA AAAATGTTGT TAATAATGTG CTACATGCTT CTTTAAGTAC ACCTTTATCA 
ACAATTGCAC GATGATTAAA ATTAGATTGT TGCAATAAAT TCATTAAACT GCcACTACAA 
CCACCTTTAG GATCATCTGC GCCATAGACG ACTCTTGGAA TGCGACTCAT TACAATTGTT 
CCTGCGCACA TGACACATGG TTCTAAGGTT ACATATAATG TGCAACCTTC TAAACGCCAA 
CTACCTAACA CTTTGGCTGC ACGTTCAATT GCAATATGTT CAGCATGCGC CGTTGGTTGT 



GCACCTATAG GTACTTCGCC TAGTTGAGCT GCTTTTTTAG CTTCTTCAAT CGCTAATX3TC 
ATAAAATATA TATCATTTGT CATTTATGTC CAGATACCTC ACTTATGGTA CAATACTCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 



1320 
1380 
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CTATTGGCGT AGGTAAATCT TCACTTGCAC ACAAATTAAG TCAAACTTTA GATTTTTATO 1500 

AAGAAAAAGA AATCATCACA GAAAATCCAT TTTTATCAGA CTTTTATGAA GATATCTCTA 1560 

AATGGAGTTT TCAAACTGAA ATGTTCTTTT TATGCAATAG ATATAAGCAA TTTCAAGATG 1620 

TAACACAACT AAATCAAGGT GTAGTTAGTG ATTATCATAT ACATAAAAAT AAGATATTTG 16 80 

CTAAAAATAC TTTGAGTTCT GTTGAATTTC AGAAATTCAG TAAAATTTAT GATATTTTAA 174 0 

CTGAAGATAT GATTATGCCG AATATGATTA TCTTTTTAGA TGCAGACCTT GATGTGTTAA 18 00 

AATCTAGAAT TGCTAAACGT AACCGTAGTT TTGAGCATCA AATAGAaGtG AtAcTGTaAg 1860 

TTAAAAAAAG ATTATCGTGA GTATTATGAG TCCTTACAAA GTAATGGTTC AAATGTAGTT 1920 

TTAATCGATA CllACTTCTAT TGATTTTCTT AAAAATGAAC AAGATTACGA AGATATATTA 1980 

CATATTATAT TACCTATGAT AGGAGATATT ACCAATGAAT AATTACGGTA TTCCACAAAA 2040 

TGCCATTATA ACCATTGCAG GTACAGTTGG TGTTGGAAAA TCAACACTAA sGCAAGCACT 2100 

TGCAGATAAA TTAAACTTTA AAACGTCTTT TGAAAATGTC GAACATAATC CATATTTAGA 2160 

TAAATTTTAC AGCGATTTTG AACGATGGAG TTTCCATTTG CAAATTTACT TCTTAGCTGA 2220 

ACGTTTTAAA GAACAAAAGC GTATGTTTGA ATATGGTGGT GGCTTTGTCC AAGATCGATC 2280 

AATTTATGAA GATGTTGATA TTTTTGCAAA AATGCATGAA GAAGAAGGCA CAATGAGTAA 2340 

AGAAGATTTC AAAACATATT CAGACTTATT TAATGCCATG GTCATGACAC CTTATTTTCC 24 00 

TAAACCTGAT GTAATGATTT ATTTAgAATG TAACTATGAT GAGGTCATTG ATCGTATTAT 24 60 

TGAACGTGGT CGCGAAATGG AAATTAATAC AGACCCTGAA TACTGGAAAA AGCTATTTAA 2 520 

ACGCTATGAC GATTGGATTA ATAGCTTTAA TGCATGTCCA GTTGTACGTA TCAATATTAA 2580 

TGAATATGAT ATCCATAAGG ACCCCGAATC TTTAAATCCT ATGATAAACA AAATTGCTCG 264 0 

AATTATTCAA ACATATCGAC AAGTAGATAC ACGATAAAAG ACTAAAGACA TAGCGTATAT 2700 

40 GTTTATATTC AATGTATATT CCATAGATAT TATCGATTAT TTTATCAATT CTATCGAATA 2760 

CATTAATTCA CATATACACT ATGTCTTTCT TTTTAATTTA AAGCTTCTAA AATATCTGCC 2 820 

GCACTATTTA AAATAATATC AGCTTCATGT AATTCTTCTT TTGTTGCAAT ACCTGTTAAT 2880 

ACACCTATTG CCATACCTAA ATTTGCATTA CTTGCTGTCT TCATATCATT AGCAGTGTCT 2 94 0 

CCTACTATAG CTACTTTCTG AGGATCTACA TTATATTGCT CAAATAAAGG CGATAATACT 3 0 00 

TTAGGATTTG GCTTCTCATA GGCATCCGCT TCGGTAGAAA TGATCAAATC GAACAACGAG 3060 

GTAGCATTGG TATGTGCTAA AAATTGTTCT ACACCTTTTT TAGTATCACT CXTTAACAATA 3120 

CCAAGTTGAT AGCCTTTTGC TTTCAAATCG ATAAGTGCTT CTTTAACACC TTCTACCCAA 3180 
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GTATCTTGTC CCGTCACATC ATTAAATGCC TGGATAATTT GTTGTAAAGA TCCTGAACCC 


3300 




ATCACTGATT TTGGATCAAT AGATTCTTTA ATGACACCGA GTTGTCTTAA AGCAGCTTCT 


3360 


5 


TTATTATGTA CTGGGAAAGT CTCAAGCAAT 


' GATTGTACAA ATCGTACCCC TATTTTTTCC 


3420 




CAACTTCTAT 


' CAAATTCAAT TAACGTACC^a 


XV.XXXAXL.AA 


ATAATATCCA 


. TTCCATTGaT 


3480 


10 


ATCAATACTC 


CTATTTATTT ATTTCGTATT 


>%XuCX\jA1 xC 


TATGATATTC 


GTTATCCCCT 


3540 




GAAAATGAAC 


TCGTAGTATT GTTCTATTTA 




I AAATATAAT 


AATAAGTGAA 


3600 




ATCCCCTTCA 


ATACTTAACA ATAAACATTV; 




^^^^ 71 ^P^^ 1^ ^^^^ ^ f9% 

TTATTAC CAT 


GCTTCGCTTC 


3660 


IS 


ATTGAAAGGG 


ATTTTAGTCA TflATTA A r'T' f ' 




TTTTCATGAT 


TATATTCAAT 


3720 




TTTTATTAAT 


ATTTTGGTAP AAfV?APT*r^r* 


A A ^^^^^^^^ 


TATCTTCTAA 


AGTACCATTT 


3780 




TGAATACCAG 


TATAGACGTP GTATAaTT'TT' 


1 \ X AA X i i 


CACCAGTCTC 


ATTATTATTA 


3840 


20 


ATAACGATTT 


CACGATCTTP GT ATPTH a H T 


O X V- (JAVJALt 


GTGAAATAAC 


TGCTGCAGTA 


3900 




CCACTACCAA 


*^ A X X v« X o X X X v..inw v» X 


i xAitJAlATG 


ATTCGAATAA 


TTCATCGATT 


3960 


25 


GAAACGCGGC 


w X v— X X VbWAV^ X X V-inXAX X 


AAGTTTTTAG 


CTAATTCGAT 


AATAGATTTA 


4020 


CGTGTAATAC 


CAGGTAAAAT ACTGCCUkTrr 


AA W X Cx VjVj 1 \j 


TAATTACTTT 


GCCATTTTCA 


4080 




ACGAAGAAAA 


TGTTCATGCT ACCAAPTTPT 


X^^VjAXAiAl X 


TCTGTTCAAC 


ACCATCAAGC 


4140 


30 


CATAATACTT 


GGTCATAACC TAATTTATTT 


wV_>iX XA^X X 1 


GTGCTAATAA 


ACTTGcCGCA 


4200 




TAGTTACCTG 


CAACTTTTGC AAAGCCTAf*A 


w v> V. V.-A^\jaA 


CAGCACGCAC 


ATATTCATCT 


4260 




TCTACATAGA 


TTTTAGTTGG TTTTAAAGTT 


TPAOfAr^PAT 


AATATGCACC 


TGAAGGAGAT 


4320 


35 


AAAATAATTA 


ATAATTTATA CTGATGTGAT 


Gr^APPAAPGr* 


CAAGTGCCCC 


TTCTGTTGCA 


4380 




AAAACAAATG 


GACGAATATA TAATGATTGA 


X X X X 


CAGGAATCCA ATCTCTTTCA 


4440 




ATATeAACTA 


ATTGfTTTTAG CCCCTCTAAC 


X X ^ X X X 


CGTCTACTTG 


AGGCATTTCT 


4500 


40 


AATCGTGCTA 


ACGAGTTATT AAGACGCTTA 


n*%n X X X X N« X X 


CAGGACGGAA 


AAGTGCAACT 


4560 




T C CCCATCTC 


TTTTATATGC TTTTAATCCT 


TCGAATACCG 


ATTGACCATA 


ATGAACACCT 


4620 


45 


TGTGCAGCAG 


GTGAAATTTC AATAGGACCA 


TAAGGTACTA 


TCTTCAAATC 


ATGCCATCCT 


4680 


TTATCTGCAT 


CATAATCATA ACTCAACATA 


TAATCAGTAA 


AATATTTACC 


AAAACCTAGT 


4740 




TGAGATGTAT 


TTGGTTTTTG TTTTAATGTT 


TCTCGTCGTT 


CAACTTTAAC 


TGCTTGTGAC 


4800 


SO 


ATGGTGATTG 


CCTCCTAATA ATATTGTATA 


AGAATTTGTT 


TAACTTAAAT 


TATAACAATC 


4860 




CaTATTTTGC 


TGTTCAACAA ATTTTCTAAA 


AATTCAAAAT 


TAATTAACAG 


ATTTCTAGAA 


4920 




AGACTATATC 


TTTTAOTATA AACGTATTAA 


TTTCACAGAG 


ACAAGTAATC 


TGTGTTTTAC 


4980 
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TAAGTATAAT GAATAATATT AGAATTCATG CACTA G TTTA TTAAAATAAA GAGTAATTTA 5100 

AAATATCATT CCGTGTATTA AAGTGAATGG AAATGATTAG TTATTATTTT TAACAGTATC 5160 

5 TTTTTGTTCA ATAGCTTCTA ACATTAATTT AGTCATGCTC GCTAAATCAT ATTTAGGATC 5220 

AAATCCCCAT TCGCCACGTG AACAGCTTGT ATCAATAGAA TCCGGCCAAC TATTAGCGAT 5280 

ACCTTGTCTA ATAGGATCAA CATCGTAATC TAATGTAAAA TTGGGATAGT ATTCTTGAAT 5340 

10 

TGCTTCTTTT ACCATCTCTG GATCAAAACT CATTGCGCTC AAATTATAAC CATTTCTAGT 5400 

TTCTAATTTA GCX3TCGTCTG CTTCCATAAG TTTAATAATT GCTTCAATTG CATCATCCAT 5460 

ATACATCATA TCCATATACG TGCCTTTATC TATGAAGCTT GTATAATGAC CCTCTCTTAC 5520 

75 

TGCTTTGAAG TATATTTCAA CAGCATAGTC TGTAGTACCG CCACCTGGCT CTTTAACATG 5580 

CGAGATTAAA CCTGGGAATC TAACACTTCT TGTATCTACA CCAAAACGTT TGAAATAGTA 564 0 

TTGACACAAT AATTCTCCAG CTACTTTATT TACACCATAC ATTGTCGTAG GTTGCTGAAT 570 0 

CGTTACTTGT GGCGTATTAA CTTTAGGAGT TGAGTCTCCA AATGCACCAA TTGAACTTGG 5760 

TGTGAAAAAG TGCAAATTAT AAGTTCTTGC AGCTTCTAAT GCATTCATTA ATCCACCCAT 5820 

25 ATTTAAATCC CAAGCTAGAA TTGGATTTTT CTCAGCAGTT GCTGATAATA ATGCTGCCAT 5880 

ATGCATTAGA CTATCCGCTT CAAAGTCCCT AACTAACTCA AACATACGGT CACGATCTGT 594 0 

TACGTCTAAG ATTTCAAATG GTCCATTTTG TACAGGTGAG TCTGCTTCAG GTTCCCTAAT 6000 

ATCTGTAGCA AGAACATTAT CTGTCCCATA AATTTCTCTG CACTTAACAA CTAATTCTGT 606 0 

ACCAATTTGT CCTAATGCAC CAGTAATCAT AATTTTTTTC ATAGAAATAT CTCCTTTGtC 6120 

TCTTCTATAT AGCTATAGTC CATCACAAGC GGaCATAATA TTCATTTTCA TAATAATTAT 6180 

35 

AATATAAAAG CG C TTT C TTG TATATATGAC ATGTACATGT TGCTGATATk TCTGTAAATG 6240 

GAACTTCTAG TTGTATTAAT TG A TTTT A GT AATTTATAGC GTTTATTATT GCTAATTACT 63 00 

GtCAAATTAA ATTTTTTATC CCTCAACTCT TAAACTCTGG ATATCTTTCA TTATATTAGC 63 60 

40 

TTTTTTATAA CCATGGATAT CATGTAAAGC CTTATAAGCn TTAAATAATG TTTCATACCT 64 20 

TTGTACTTnT TCCGCTTCTG GATT 644 4 
45 (2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GCCCATGAGA CAATTTTACT TGCTTTTCCC ATTGGTTATG ACGTTCTTAT TACATAGATT 
TAAACCGAGA AATATTATTC AAACGCTATT TATTGTATCG TTGATTTCTT TAGGACTTAT 
GATAGTGATT CATTTCATCA CTGGAGATAA TTCACGTGTG TATTTTGGGA CAGATACACG 
ACTGCAAACT TTATTGCTTG GTTGTATATT AGCATTTATT TGGCCTCCGT TTGCTTTGAA 
AAAAGATATT TCTAAAAAGA TTGTCGTATC ATTAGATATT ATAGGGATAT CTGGTTTTGC 
GGTTCTAATG ACTTTGTTCT TTATAGTTGG AGACCAAGAT CAATGGATCT ATAATGGAGG 
ATTTTACATT ATATCATTTG CAACTTTATT CATTATTGCA ATTGCGGTAC ATCCTTCTAG 

TTTATTTGCT AAATTTTTAA GTATGAAACC TTTACTAATT ATAGGTAAAC GATCATATAG 480 

CTTATACTTA TGGCATTATC CTATCATTGT TTTTGTGAAC AGTTATTACG TACAAGGACA 540 

AATACCGGTA TACGTTTATA TTATAGAAAT TTTGTTAACA GCGTTAATGG CTGAAATTTC 600 

20 GTATCGCTTT ATTGAAACAC CTATACGTAA AAAAGGATTT AAAGCTTTTG CATTTTTACC 660 

TAAAAAGAAG GGGCAATTTG CTAGAACAGT GTTAGTTATC CTATTATTGG TTCCGTCTAT 720 

CGTTGTGCTC AGTGGACAGT TTGATGCACT TGGCAAACAA CATGAAGCCG AGAAGAAAGA 780 

GAAGAAGACG GAATTTAAAA CAACGAAGAA AAAAGTCGTT AAAAAAGATA AGCAAGAGGA 840 

TAAGCAGACA GCGAATAGCA AAGAGGATAT TAAAAAGTCA TCACCACTAT TAATTGGTGA 900 

CTCGGTCATG GTGGATATTG GTAATGTCTT TACTAAGAAA ATACCAAATG CACAAATTGA 960 

TGGTAAAGTT GGACGGCAAC TCGTTGATGC TACACCAATT GTGAAATCGC AATATAAAGA 1020 

CTATGCTAAA AAAGGTCAAA AAGTTGTAGT AGAGCTTGGT ACAAATGGGG CATTTACGAA 1080 

AGATCAATTA AATGAACTAT TGGATAGTTT TGGAAAAGCA GACATATATT TAGTTTCTAT 1140 

TAGAGTACCT AGAGATTATG AAGGTAGAAT AAATAAATTA ATTTATCAGG CAGCTGAAAA 1200 

GCGCTCTAAT GTACATCTAG TCGATTGGTA TAAAGCTTCT GCAGGTCATC CGGAATACTT 1260 

TGCATATGAC GGTATTCACT TAGAATATGC AGGTAGTAAA GCGCTGACTC AnTGATTGT 1320 

AAAAACGATG GAAACACATG CTACAAATAA GAAATAATTT GATGCACTAA ACTTTTGAAA 13 80 

TATTACATTA CTTCTGATAT TTATTATCAA AAATGATGTA TTTCATTAAA AGTTTAGTGC 1440 

45 TTTTTTATTT TCAAATCCCA TAGTAACGGT GCAGAAAAAG TGTTGTAAAC ATTCTAATTG 1500 
GTATATTACA TTCAATGAAG CTTTATTAGG AACAGATTAC ATTATQATAA CAAAGCCCGC 
AAGACACCTA ATCTCTGTTA TAGTTTGTTT TGTCGCAAAA CTATAAAAGT TATAATTGTT 

TGCATACTAA AAAAATAAAA AATATAAAAT TTAAAATAAT TGAGTCGCTA ATGACTATAT 1680 

GTATAGAGTG TTTTGATTAT TGGGAGGATA TTTAATTATG AAAAAAATCG TTACAGCTAC 1740 
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ACAAAATAAC AATGGATATA ATTCTAATGA CGCTCAATCA TACAGCTATA CXTTATACAAT ' 186 0 

TGATGCACAA GGTAATTATC ATTACACTTG GACAGGAAAT TGGAATCCAA GTCAATTAAC 192 0 

GCAAAACAAC ACATACTACT ACAACAACTA CAATACTTAT AGTTATAACA ATGCATCTTA 1980 

CAATAACTAC TATAATCATT CATATCAATA CAATAACTAT ACAAACAATA GTCAAACAGC 204 0 

AACAAATAAC TATTATACTG GTGGTTCAGG TGCAAGTTAT AGCACAACAA GTAATAATGT 2100 

TCATGTGACT ACAACTGCAG CGCCATCTTC AAATGGTCGT TCAATTTCTA ATGGTTATGC 2160 

ATCAGGAAGT AACTTATATA CTTCAGGACA ATGTACTTAT TATGTATTTG ATCGTGTTGG 22 2 0 

TGGGAAAATT GGTTCAACAT GGGGTAACGC AAGTAATTGG GctAACGCAG CTGCATCATC 22 8 0 

TGGCTATACA GTGAACAATA CACCAAAAGT TGGTGCTATC ATGCAAACAA CACAAGGCTA 234 0 

TTACGGTCAT GTTGCTTACG TTGAAGGCGT TAACAGCAAC GGTTCTGTTC GTGTTTCAGA 24 00 

AATGAACTAT GGACATGGTG CTGGTGTGGT TACGTCTCGT ACAATTTCAG CAAACCAAGC 24 60 

AGGTTCATAT AATTTCATTC ATTAATCAAA TGTAAATCAA ATGACGTCAA TATTCTCTAA 252 0 

CATGAGAGTA TTGGCX3TTTT TGTTTTATAT AAATATAAAT GAGAGCGGTT TATTCACTGA 2580 

2S TCTTTAGGGA ACTAAGTAAT AAAGTGATAA TTTATACTAT GTCAGTATGA TTGAGAGTGA 264 0 

TTCAATTTAG ATGAAAACCA TGAAAAAATA TATTAAAACA GCATTTTTTT GTAGTATGTA 2700 

TTGGTTAATT GTTCAACTAA ATATAGCAAA TTTAGGTACA AGAATTCCTG ATAAGTATTT 2760 

TCGTCAGAAG TACATAATAT TTAAATCATT TAACTTTGAG AAGCATGGAA AATTTTGGAA 282 0 

CAAATGGTTT TACGTAAGAA AATGGAAACA TAAGATTTTA GATGGTCATC AGCTTAATCA 28 8 0 

AAATATATAT GATCAGCGTC ATTTAATGAC AATCAATACT GATGAAATTG AAAAAATGAT 2 94 0 

TATAGAGACA AAGAGGGCAG AGTTGATTCA TTGGATATCG ATACTTCCAG TCATCATATT 3000 

CAATAAAGGC CCTCGTTTAG TAAAGTATAT AAATATTTTC TATGCAATCA TAGCTAATGT 306 0 

TCCAATCATT ATTGTGCAAC GCTATAATCG ACCGAGATTA ACGCAGTTAC TACGCATATT 3120 

AAAACGAAGA GGTGAACGTC ATGACTAAAC ATATCATCGT TATTGGTGGT GGCTTAGGTG- 318 0 

GGATTTCTGC AGCAATTCGA ATGGCACAAA GTGGCTATTC GGTCTCATTA TATGAACAAA 324 0 

4S ATAATCATAT AGGAGGCAAA GTGAATCGTC ATGAATCAGA TGGCTTTGGC TTTGATTTAG 3300 

GTCCATCTAT TTTAACGATG CCTTATATTT TTGAAAAATT ATTCGAATAT AGCAAGAAGC 3360 

AAATGTCAGA CTACGTTACA ATCAAGCGAT TGCCACATCA ATGGCGTAgC TTTTTTCCAG 3420 

ATGGAACGAC TATCGATTTG TATGAAGGTA TTAAAGAAAC AGGTCAGCAT AATGCX3ATAT 34 80 

TGTCGAAACA GGATATAGAG GAACTGCAAA ATTATTTGAA TTATACAAGA CGAATCGATC 3 54 0 
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TTCATGGGCC ATTAAATGCT CTTATTAATT ATGATTATGT ACATACTATG CAACAGGCCA 
TAGACAAGCG TATCTCGAAT CCATACTTGC GACAAATGTT AGGCTATTTT ATCAAATATG 
TAGGTTCTTC ATCATACGAT GCGCCAgCTG TATTATCTAT GTTATTCCAT ATGCAACAAG 
AGCAAGGCCT TTGGTATGTA GAAGGTGGAA TCCATCATTT AGCCAATGCC TTGGAAAAGc 
tAGCGCGTGA AGAAGGTGTC ACAATTCATA CAGGTGCACG TGTGGACAAT ATTAAAACAT 
ATCAAAGACG TGTGACGGGT GTCAGATTAG ATACAGGTGA GTTTGTAAAG GCAGATTATA 
TTATTTCAAA TATGGAAGTC ATACCTACTT ATAAATATTT AATTCACCTT GATACTCAAC 
GATTAAACAA ATTAGAGAGG GAATTTGAGC CGGCAAGCTC AGGATATGTG ATGCATTTAG 
GTGTTGCTTG CCAATACCCG CAATTAGCAC ATCATAATTT CTTTTTTACG GAAAATGCTT 
ATCTCAATTA TcAACAAGTT TTTCATGAAA AGGTATTGCC AGATGATCCG ACCATTTATC 
TAGTAAATAC GAATAAAACT GATCACACAC AAGCGCCAGT AGGTTATGAA AATATCAAAG 
TCTTACCACA TATTCCATAT ATTCAAGATC AGCCTTTTAC CACTGAAGAT TATGCGAAGT 
TTAGGGATAA AATTTTGGAT AAATTAGAAA AAATGGGACT TACTGATTTA AGAAAACACA 
TTATTTATGA AGATGTTTGG ACACCGGAGg ATATTGAAAA AAATTATGGT TCTAATCGTG 
GTGCAATATA TGGTGTTGTA GCAGATAAAA AGAAAAACAA AGGATTTAAA TTTCCTAAAG 
AAAGTCAGTA TTTTGAAAAC TTGTACTTTG TAGGTGGATC AGTAAATCCT GGTGGTGGCA 
TGCCAATGGT TACATTAAGT GGGCAACAAG TCGCAGcAAg ATAAACGCGC GAGAAGCGAA 
GAATAGGAAG TGATATCTAT GAAATGGTTA TCACGAATAT TAACAGTAAT AGTGACCATG 
TCTATGGCGT GTGGTGCaTT GATATTTaAT CgTAGACATC A 
(2) INFORMATION FOR SEQ ID NO: 235: 

7 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 
TATTCGTGCG CAATGGGCTA AATTAGGTCT AGGTTTAGAT TATAGTAGAG AACGTTTTAC 
TTTAGATGAA GGTTTAAGTA AAGCAGTTAA AAAAGTTTTT GTTGATTTAT ACAATAAAGG 
AATTATTTAT CGTGGCGAAC GTATTATAAA TtGGGATCCn AAAGCACGTA CAGCTTTATC 
TGaTATTGAA GTAATACATG AAGATGTTCA AGGTGCGTTT TATCATTTTA AATATCCTTA 
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TACAGCGATT GTTGTTAACC CTAATGACGA ACGATACAAA GATGTAATCG GTAAAACTGT " 3 60 

TATATTACCA ATCGTAGGAC GCGAACTGCC TATTTTAGCA GATGAGTATG TTGATATAGA 420 

CTTCGGTTCT GGTGCTATGA AAGTGACACC AGCACATGAC CCTAATGATT TTGAAATTGG 4 80 

TCAAAGACAT CAATTAGAAA ATATTATCGT TATGGATGAA AATGGTAAAA TGAACGACAA 540 

AGCGGGTAAA TATGAAGGTA TGGACCGTTT TGATTGTCGT AAACAGCTAG TTAAAGATTT 600 

AAAAGAACAA GATTTAGTTA TCAAGATTGA AGATCATGTT CATTCTGTAG GTCATTCAGA 6 60 

ACGATCTGGC GCTGTTGTTG AACCATATTT ATCAACACAA TGGTTTGTGC GCATGGAAGA 720 

CTTAGCGAAA CGTTCATTAG ATAACCAAAA AACAGATGAT CGTATTGATT TTTATCCGCA 780 

ACGTTTCGAA CATACATTTA ACCAATGGAT GGAAAATATT AGAGATTGGA CGATTTCAAG 84 0 

ACAATTATGG TGGGGTCATC AAATTCCGGC TTGGTATCAT AAAGAAACAG GCGAAATATA 900 

TGTTGGAGAA GAAGCGCCAA CTGATATTGA AAATTGGCAA CAAGATGAAG ATGTATTAGA 960 

TACGTGGTTC TCaAGTGCTT TATGGCCtTT CTCyACGTTA GGTTGGCCTG ATTTAGAAAG 1020 

TGAAGACTTT AAACGATACT ACCCAACAAA TGCCTTAGTT ACAGGTTACG ATATTATCTT 1080 

25 TTTCTGGGTA GCACGCATGA TATTCCAAGG CTTAGAATTT ACAGATCGTC GTCCATTTAA 114 0 

TGATGTATTA TTACACGGTT TAGTTCGTGC TGAAGACGGG CGTAAGATGA GTAAATCATT 1200 

AGGTAATGGT GTGGATCCAA TGGATGTTAT TGACGAATAC GGTGCTGATA GCTTGCGTTA 1260 

30 CTTCTTAGCA ACAGGTTCAT CTCCAGGACA TGATTTAAGA TACTCAACTG AAAAAGTTGA 1320 

GTCAGTGTGG AACTTTATCA ATAAAATCTG GAATGGGGCA C G TTTCAGTT TAATGAATAT 1380 

CGGTGAAGAC TTTAAAGTTG AAGATATCX3A TTTAAGTGGT AACTTATCAT TAGCAGATAA 144 0 

ATGGATTCTA ACACGTTTAA ATGAAACGAT TGCAACAGTT ACTGATTTAA GTGACAAATA 1500 

TGAATTCGGC GAAGTTGGAC GTGCATTATA TAATTTCATT TGGGATGATT TCTGTGATTG 1560 

GTACATTGAA ATGAGTAAAA TTCCAATGAA TAGTAATGAT GAAGAACAAA AACAAGTTAC 1620 

ACGTTCAGTA TTGAGTTATA CTTTAGACAA TATTATGAGA ATGCTACATC CATTCATGCC 1680 

ATTTGTAACA GAGAAAATAT GGCAAAGTTT ACCACATGAA GGTGACACAA TTGTTAAAGC 174 0 

45 TTCATGGCCA GAAGTGCGTG AATCATTGAT TTTTGAAGAA AGTAAACAAA CAATGCAACA 1800 

ACTTGTTGAA ATCATTAAAT CTGTAAGACA ATCACXSTGTA GAAGTAAATA CGCCATTGTC 1860 

TAAAGAAATA CCTATTTTAA TTCAAGCTAA AGATAAAGAA ATTGAAACAA CTTTATCACA 1920 

AAACAAAGAT TATTTAATCA AATTCTGTAA TCCTAGTACC TTAAATATTA gCtGACX;TGG 1980 

AAAWTCCTGA GAAAGCAATO ACaTCAGTTG TAATTGCAGG TAAAGTGGTA TTACCATTAG 204 0 
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AAAGCGAATT AGATAGAGTA GATAAAAAGC TCTCTAATGA 

CTGAAAAGGT TATAAATGAA GAAAAACGTA AAAAACAAGA 

GTGTGAAGGC AAGAATTGAA CPATTAAAAG CATAGGAGTT 

AGAGCTTGTA TTGGATACAC GAAAGAACTA AATTTGGCAT 

TGGAATGGAT GCTAGCACAA TTTAATAATC CTCAAAATAA 

GTGGCACAAA TGGTAAAGGC TCTACAGTTG CTTACCTTAG 

GTTATGAAGT AGGTACATTT ACGTCGCCGT TTATTGAAAC 

TAAATGGTGT GCCAATATCA AATGACGCTA TTGTAGAATT 

TAAGTGAAAT GATGGAACGT QAAACAGATT TAGGTGTTGC 

CAGCGATGAT GTTTTTATAT TTTGGTGAAA TACATCCTGT 

CTGGATTGGG TATAAAGAAC GATTCGACAA ATGTCTTTAC 

CTAGTATCGG TCTAGACCAT ACAGATATTT TAGGTGGTAC 

ATAAAGGCGC GATTATAAAG CCTAACGTTC CAGTGATATA 

CATTAAAATA TGTTCGTGAA CGCGCAATTG AACAACATGC 

GAGAAATTGT TGTTGTATCG CAAAATGATG AATTTACTTA 

TAGAAACAAT CATTTTAAGC ATGTTAGGTG AACATCAGAA 

TAACAGCTCT TATTGAATTA AATGAACAAG GATTAATTGA 

TAGACGGTAT TGAATCAGTT CGTTGGACTG GACGTATTGA 

TAATCATTTT GGATGGCGCA CATAATTCAG AGAGTATAGA 

AACAGTACCA TGATAAAGAA AAAGTAGATA TTTTGTTCTC 

TTAA(S3AGAT GGTCAAACAT TTAAGTTTAA TTGCGCATAC 

ATTTTCCGAA AGCGTTACGC AAAGAAGAAA TTGTAGGTAG 

AATTAGTAGA TGACTACGTT GAATTTATAA AAAATTATCA 

CCGGTAGTCT GTATTTCATA AGTGAAGTTA AATCAA 

(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 7481 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxxble 

(D) TOPOLOGY: linear 



AAACTTTGTA 
TTATCAAGAA 
AGTAACAATG 
CAAACCAGGT 
CATTAAGGGT 
AACAGCTTTA 
ATTTAATGAA 
AGTATCACGT 
AACTGAATTC 
TGATTTTGTC 
ACCGGTTTTA 
TTATCTAGAT 
TGCTGTTAAA 
AAAGCCAATT 
CCGTTATAAA 
ACAAAATGCT 
ATTAGATTTC 
GCAGGTGCAT 
TGCTCTAATT 
AGCAATAAAC 
GTTTTATGCA 
TATTGAAAAT 
AGGTGATACA 



AGTAAAGCAC 
AAATATGATG 
AATTACCTAG 
GTTAAACGTA 
ATTCATGTAG 
GTTGAAAATG 
CGAATTAGTC 
ATTAAACCAG 
GAAATAATCA 
ATTGTTGAGG 
TCAATCTTAA 
ATTGCTAGGG 
AATGAAGATG 
GAATTAGATA 
GATTATGAAT 
GCATTAGCCA 
AATAAGATGA 
GACAAACCTT 
GATACAATTA 
GGAAAACCGA 
ACTGAATTTG 
GATGAAATAC 
TTAGTAATTA 



2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3516 
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TGAGTGATAG AATCAAAAAA GCCATCTCAA AAATTAATCA AGCAAACAAC ATTCCAAACA 6 0 

ATGsTCGCAA ATCACCAATG TATCACTCTC CAATTACGTA ACTATGATTT AATTTAAGCA 120 
TAGTTATTGA GGTTTTGTGA TATATAGTAT AAAATTAATG AGAATTAAAT TTAATAATGT 18O 
AAAATTCATm TTCgGGGTCG GGTGTAATTC CCAACCGGCA GTAAATAAAG CCTGCGACCT 24 0 

GCTAGTATGT ATCATATTAG TGGCTGATCT AGTGAGATTC TAGAGCCGAC AGTATAGTCT 3 00 

GGATGGGAGA AGATGGAGGT TTTTTGTTGT GCAATAATCC TCCTATTCTT ACGAGATGAA 3 60 

TGGAAGGAGA AAATTGAATA TGCAACAAAA TAAACGTCTT ATCACAATAA GTATGTTGAG 4 20 

CGCGATTGCG TTTGTGTTAA CTTTTATCAA GTTTCCTATA CCATTTTTGC CACCATACTT 4 80 

AACTTTAGAT TTTAGTGATG TACCGTCACT ACTAGCTACA TTTACGTTTG GACCAGTTGC 54 0 

CGGTATCGTA GTTGCACTGG TTAAAAATTT ATTGAACTAC TTATTTAGTA TGGGCGATCC 600 
20 AGTTGGACCA TTTGCTAACT TTTTAGCAGG CGCAAGTTTC TTATTAACTG CTTACGCCAT 6 60 

CTATAAAAAT AAACGTTCAA CAAAATCTTT GATTACTGGA TTAATCATTG CAACAATCGT 720 
TATGACTATC GTGTTGAGTA TTTTGAACTA TTTCGTTCTA TTACCTTTGT ACGGTATGAT 780 
ATTTAACTTA GCTGATATCG CAAATAATCT TAAAGTAATC ATTGTTTCAG GAATTATACC 84 0 

ATTCAATATT ATTAAAGGTA TCGTTATTTC TATTGTATTT ATTTTACTAT ATAGAAGGCT 900 
TGCX3AATTTC TTGAAAAGAA TTTAATCAAA TTAAAGCAAA ATAATATACA CATAATAATA 960 

AAAAGCAGGT GACTATCAAT AAACGATAGC TTGCCTGCTT TTTCTATAGA ACATTTGTCT 102 0 

AAAAAATCAA TTATTCaAAT TTTAAAGCGT CCCCATCAAA TGATTCGTCT GCAATTTTAA 1080 

TAGAATCTGt AGGGCATCCa TCAATTGCAT CTTCCATATC TTCATATAAT TCCtCAGGTA 114 0 

CTTCTGCAGT ACCTTGGTTA TCGTCAAGGA TTACGAAAGC AATACCTTCX3 TCGTCGTAAT 1200 

CATMTATATC TGGCGCTGCT GCACCGcATG CACCACATGC AATACAAGTA TCCATATCAA 1260 

40 CGATTGTATA TTTTGCCAAT GTCITCGCCT CCTTTGATAA AAATGCTAAA ATAGTAATGT 1320 

GACTAAAATT TTAGACAGCA TCATTTTTAT TTTCAAATTA TCOGTTTTAC AGAGTGAGGG 13 80 

TTAAATTTGC AACACATTAT AAAAACAGCA TTACAACAAA CATTTAACTA TAAAACAAAT 14 40 

AAAAGTATTT ACAATATCTT AGTTGGTAAG AAATCTCACC AAACCTTTTT TGACGCTTGT 1500 

AGTCAACAAC AGTTGTCATT ATATCACAGT TTACCACTAT TAAAATATCC GTCTTTTGAG 1560 

CTATTTtTAG AAAAAATCAA TGAATTTAAT GCTGAAATGG AAATCATGTT GCATCCTAGA 1620 

SO 

TATACATTTG aAAGCATGGG GCAAACATTT CAAGCAATTC AACTATTAGT GCAAaCCATG 1680 

TCTAATACCA AACAACATGT TTTTCATTTT GTACCAATCT CTCAAAATAA TAAGATACAm 174 0 
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AATGAACTAC ATAACTrGTT TAAAGCAATA ACTTTAAAAG GGCCATGTTA CTTACATTAT 
TATTTGCAAG GCTATGATGA ACCAATGTAT ACGAGACAGC AAGTTAGTTT AATAGAAAAG 
CTATCTCAAC AGCAATTGTT TGAATACGAA ATGAATAATT TAGTGACAAT GATGTTTGAA 

TTAGAAAGTG GAGAATATAC TATTTTATCA AAAATAATAA TGAAACCTAC ATTATTAAAT 204 0 

CAAACTTATA TTACTTATAC AAAATTGCTT GAACAATTCA CGATGGAAGA TATAGCGGCT 2100 

CAACAACAAG TTAAAATCAA TACTATCGAA GATCATGTAC TTGAAATCTT AATCAAAGGT 2160 

TACATGTCTA ATTACGATGA TTATGTTGAA CTAGAAGATC AACTCCAGTT TTTGAATTTT 2220 

TATCAACAGC ATCGTGGCGA ACGATTAAAA TTTTACAAAG AACAATTTGA CACGTTATCA 2280 

TATTTTCAAT TAAAAGTATT AATCGTTGGA TTTGAAAGAG GTGATCTGAA TGTTGCATGA 234 0 

TATTTTACGA AACAAATTTG GATTCGAGAG TTTTAAACCX3 GGACAACAGG AAATTATAGA 2400 
AAGTATAATG TCTCAACAAC ACACTCTAGG TATACTTCCA ACTGGAAGTG GAAAGAGTTT 
GTGTTATCAA ATACCTACGT ATTTATCAGG TAAGCCGACA TTAATTATCT CACCGTTAAT 
ATCTTTAATG GATGACCAAG TTATGCAGTT GAAAATAAAT GGAGAAAAAC GTGTAACATG 
TATTCACTCT GGTATGGATG AAATTGAGAA AAAGCATAAT ATTAAATGTT TACGACATAG 

CCGCTTCATC TTTCTAAGTC CAGAATTTCT CCTGCAACCG TCAAATTTTA AATTAATATC 2700 

TATGATAGAC TTTGGCATGA TTGTTCTAGA TGAAGCACAT TGCCTATCTG AATGGGGATA 276 0 

TGATTTCAGA CCACATTATG CTCTAATAGG AAAAGTAACA AAGCATTTTA AAGAAGCGGT 2820 

TGTCTTAGCA TTGACAGCAA CTGCACCACC GCATTTACAA GATGATTTGA CGGAAATGTT 288 0 

AGCGATTCAA TTCAATGTTA TTAAAACTAC AATGAATCGC CCAAATATAA GCTTTAAGCA 294 0 

TCTTAATTTT CATGATGATG AAGATAAAAT TGAATGGTTG CTGCCGTTTC TACAACAGTC 3000 

GGGAGCAACG ATTATTTATG TCTCATCGAA AAAGATGTGT CTGAATTTAG CGCAACTTAT 3060 

^0 TTATCATTCA GGTTTTCTTA CAGGTATTTA TCATGGTGAT ATGAATTATC AAGAGCGACA 3120 

CACAGTTCAA CAACAATTTT TAAATAATGA TATTCCGATT ATAGTCGCAA CGAGTGCTTT 3180 

TGGTATGGGA ATTAATAAAA AAGATATTCG CACAATCATT CACTTTCATC TTTCAACAAG 324 0 

TCCTTCTAAC TACATTCAAG AAATTGGCCG TGCGGGTCGC GATGGTGAAC TAAGTCAGGC 33 00 

AATTAGTTTA TTCCAACXX3G ACGATAAATA TATTTTAGAA ACGTTATTAT TTGCAGATAT 33 60 

GATAACAGAA GAAGATGTAC AAAATTTCGA AATAGGAGAA TTTTTAGCTC CCGATAAACA 34 20 

AGCCGTTTTG ACAACGTTGc AATCATTCTA TAGTATCGGC GCCTTGaAAC AGATATTTAA 34 80 

GCAATCATTT AAACGAAAGC AATTAGGATT CTTTCGCATG ATTGGCTATT GCAAATTGGA 354 0 
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ATGTTGTGAC AATGATTCTA ATATAACTGA TATCXKIAATT TTAAATAAGA AGAAGGTAAT 366 0 

TAGAAGTATT GGATTTGATG AAAAGTTGCA AAATTTATTT CTCAGATAGT ATTACTTTAC '3720 

TAAAAGAAAA TTGACAAGCT ATAATTAGTG TATACACAAT TGAAAAATGA TTGAAATAAT 3 780 

TTTGAAAAAT ATACATAAAC ATATGTCATG TGGGTATATT TTATGTAAAA TCATTGTAAT 3 84 0 

AGAATAGAAA GGAAGATGGC TATGTCTAAT AATTTTAAAG ATGACTTTGA AAAAAATCGT 3 900 

CAATCGATAG ACACAAATTC ACATCAAGAC CATACGGAAG ATGTTGAAAA AGACCAATCA 3 960 

GAATTAGAAC ATCAGGATAC AATAGAGAAT ACGGAGCAAC AGTTTCCGCC AAGAAATGCC 4 020 

CAAAGAAGAA AAAGACX3CCG TGATTTAGCA ACGAATCATA ATAAACAAGT TCACAATGAA 4 080 

TCACAAACAT CTGAAGACAA TGTTCAAAAT GAGGCTGGCA CAATAGATGA TCGTCAAGTC 4140 

GAATCATCAC ACAGTACTGA AaGTCAAGAA CCTAGCCATC AAGACAGTAC ACCTCAACAT 4200 

GAAGAGGAAT ATTATAATAA GA_ATGCTTTT GCAATGGATA AATCACATCC AGAACCAATC 4260 

GAAGACAATG ATAAACACGA TACTATTAAA AATGCAGAAA ATAACACTGA GCATTCAACA 4320 

GTTTCTGATA AGAGTGAAGC TGAACAATCT CAGCAACCTA AACCATATTT TACAACAGGT 4 3 80 

GCTAACCAAT CAGAAACATC AAAAAATGAA CATGATAATG ATTCTGTAAA ACAAGATCAA 4440 

GATGAACCTA AAGAACATCA TAATGGTAAA AAAGCAGCAG CTATTGGTGC TGGAACAGCA 4 500 

GGTGTTGCAG GTGCAGCTGG TGCAATGGCT GCTTCTAAAG CTAAGAAACA TTCAAATGAC 4 560 
GCTCAAAACA AAAGTAATTC TGGCAAGGCG AATAACTCGA CTGAGGATAA AGCGTCTCAA . 4 620 

GATAAGTCTA AAGATCATCA TAATGGCAAA AAAGGTGCAG CGATCGGTGC TGGAACAGCA 4680 

GGTTTGGCTG GAGGCGCAgC AAGTAAAAGT GCTTCTGCCG CTTCAAAACC ACATGCCTCT 4740 

AATAATGCAA GCCAAAACCA TGATGAACAT GACAATCATG ACAGAGATAA AGAACGTAAA 4 800 

AAAGGTGGCA TGGCCAAAGT ATTGTTACCA TTAATTGGAG CTGTACTAAT TATCGGTGCA 4 860 

40 TTAGCGATAT TTGGAGGCAT GGCATTAAAC AATCATAATA ATGGTACAAA AGAAAATAAA 4 920 

ATCGCGAATA CAAATAAAAA TAATGCTGAT GAAAGTAAAG ACAAAGACAC ATCTAAAGAC 4 980 

GCTTCTAAAG ATAAATCAAA ATCTACAGAC AGTGATAAAT CAAAAGAGGA TCAAGACAAA 504 0 

GCGACTAAAG ATGAATCTGA TAATGATCAA AACAACGCTA ATCAAGCGAA CAATCAAGCA 5100 

CAAAATAATC AAAATCAACA ACAAGCTAAT CAAAATCAAC AACAGCAACA ACAACGTCAA 5160 

GGTGGTGGCC AAAGACATAC AGTGAATGGT CAAGAAAACT TATACCGTAT CGCAATTCAA 52 2 0 

TACTACGGTT CAGGTTCACC GGAAAATGTT GAAAAAATTA GACGTGCCAA TGGTTTAAGT 5280 

GGTAACAATA TTAGAAACGG TCAACAAATC GTTATTCCAT AATATAACTA TATAAATTGT 534 0 
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TAAATTGCGC TTATAAGTAT GTAGCGGTTT TTTCATTTTT CAAAGTTTGT TATTTAACAA 
GGTCTTGTCT CGAATATTGG CATATCAATT TAACTTTTTA AATAGTCATC AHMJiGATAA 
AACACCACAA TCAACAAATT TAACGAGGAA GAATAAAAAA TAAATCAACA TATTAAATTG 
TAGTGTTATT CAACTCCGTA GCTAACAATT CTCTATTCAC ATTAAACAAA TTGTCAAAAA 
TATATCATAA ATCTTCAAGC ACAGACTTAG CGCATCAATC ACTGAACTGT TATAATAGTT 
TGGGATTAAA GGAGGCCGAA ACAATGCAAA AAGTTGAAAG TATCATAATT GGTGGAGGGC 
CATGCGGATT AAGTGCGGCT ATTGAACAAA AAAGAAAAGG TATTGATACC TTAATTATTG 
AAAAGGGTAA TGTCGTTGAA TCAATCTACA ATTATCCTAC TCACCAAACA TTTTTCTCAT 



CAAGTGATAA ATTAAGTATT GGGcGAgTAC CGTTTATCGT TGAAGAAAGT AAACCAAGAC 
GTAATCAAGC GCTAGTTTAT TACCGAGAAG TTGTAAAACA TCATCAATTA AAAGTAAATG 

20 CATTTGAAGA AGTATTAACT GTTAAAAAAA TGAATAATAA ATTTACTATT ACTACGACGA 

AAGATGTTTA TGAATGTCGA TTTTTAACAA TCGCGACAGG CTATTATGGT CAGCATAATA 
CATTAGAAGT TGAAGGTGCG GATTTACCTA AAGTGTTCCA TTATTTTAAA GAGGCACATC 

^5 CGTATTTTGA TCAAGATGTT GTAATTATCG GTGGTAAGAA TTCGGCTATC GATGCTGCTT 

TGGAGTTGGA AAAAGCTGGT GCTAACGTGA CGGTTCTATA TCGTGGTGGA GATTATTCGC 
CTTCAATTAA ACCGTGGATA CTTCCAAATT TCACAGCATT AGTAAATCAT GAAAAAATTG 

30 

ACATGGAATT TAATGCTAAT GTTACCCAAA TAACTGAAGA TACTGTGACT TATGAAGTAA 
ATGGTGAAAG TAAAACGATA CACAATGATT ATGTATTTGC GATGATTGGT TATCATCCCG 
ATTATGAATT TTTAAAATCT GTAGGCATTC AAATTAATAC AAATGAATTT GGAACAGCGC 6 540 

CTATGTATAA TAAAGAAACA TACGAAACAA ATATCGAAAA TTGCTATATT GCAGGTGTAA 6600 
rrGCTGCAGG GAACGATGCG AATACCATTT TTATTGAAAA TGGTAAATTC CACGGGGGCA 6660 
TTATTGCTCA AAGCATGCTA GCTAAGAAAC AAACGCCCTT AGAATCATAA AAATAAAGGT 6720 
CTATGTACTA AATAACTTAG TTTTACAACG ACTGACATTC ATGATATGTC AAATGAGGTT 6780 
GATGACTATT GATTGTACAT AGACCTTTTT ATGTTACGTA TTCATTATAA TTCAAAATAT 6840 
45 GATTTGATTT CAGCTTTATC TAAATTGTTG CTTAACGCGA CTAATAATTT TAATCTTGCT 6900 
TTTGGACCAT TCAAGCCGTT AGAAAAAATA AAACCTTGTT GTGCGAGTTG GTAACCACCA 6 960 

CCATCGTATG CGTAAGTTGG ACTCACAATA CCATTAAAGG AACGTGAAAC TAGCACAATA 7020 
GGTATATTTA AAGATACTAA TTGTTGAATG CCTTCTAATG CGCTTGGAGG TATGTTGCCT 
TGTCCTAACG CTTCAATAAC CATACCATCG ATACCTTCTC GACTATAAAA ACTAAAAATG 



5940 
6000 
6060 
6120 
6180- 
6240 
6300 
6360 
6420 
6480 



7080 
7140 



55 
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TCCAATGCTT GTTGGCGATA TGGCATATGA TGGAATTGCA CACGATCCTT TGTCAATACA 7260 

CCTAGCGGAC CATGATTTGG ACTTTGAAAT GTGTTTGTAT TAGACGTATG T G TTTTGGTA 7320 

ACATTACGCG CCGTATGAAT TTCATCATTA AaTACAACCA TCACGCCTTT ATGACGGGCC 7380 

TTTTCATCAG AGGCAACGCG AATAGCGGAA ATATAATTAT ATAATCCGTC AGAACCAATT 7440 

TCATTAGACG AGCGCATTGG CCAAGAAnTA ACAACAGGTT G 74 81 



10 

(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 



35 



40 



45 



ATGAATTATA 


CCTTCCACTA 


GAAGTGTCGG 


TATTTATGTG 


CCTGGTGGTA 


AAGCAAGTTA 


60 


TCCTTCAACG 


GTATTAATGA 


CAGCGACGTT 


AGCACAAGTA 


GCAGGTGTGG 


AAAATATTGT 


120 


CGTTGTGACA 


CCACCTCAAC 


CTAATGGTGT 


GTCTCAAGAA 


GTATTAGCAG 


CATGCTATAT 


180 


TACGCAAGTT 


GATCAAGTAT 


TTCAAGTTGG 


TGGTGCTCAA 


AGTATTGCTG 


CGTTAACTTA 


240 


TGGCACAGAA 


ACGATACCTA 


AAGTTGATAA 


GATTGTAGGT 


CCAGGCAATC 


AATTTGTTGC 


300 


ATATGCCAAG 


AAATATTTAT 


TTGGACAAGT 


AGGTATTGAC 


CAAATAGCTG 


GACCAACAGA 


360 


AATAGCACTT 


ATTATTGATG 


AAACAGCAGA 


TTTAGATGCC 


ATCGTATATG 


ATGTTTTTGC 


420 


ACAAGCAGAA 


CATGATGAAT 


TAGCACGTAC 


ATATGCCATT 


AGTGAAGATG 


CGCAAGTCCT 


480 


TAAAGATTTA 


GAATCACGTA 


TTGCTAAAGC 


ATTGCCTAAT 


GTGGACAGAT 


ACGACATTGT 


540 


TTCTAAAAGT 


ATCGCTAATC 


AACACTACCT 


TATCCATGCT 


AGTAATTTTG 


ATGAAGCATG 


600 


CCATGTCATG 


AATACAATCG 


CGCCTGAACA 


TGCGTCGATT 


CAAACAGTAA 


ATCCTCAACC 


660 


ATATATTGAA 


AAAGTGAAAT 


ATGTGGGTGC 


ATTGTTTATT 


GGACATTATT 


CGCCAGAGGT 


720 


CATAGGAGAT 


TACGTTGCAG 


GTCCAAGTCA 


TGTATTACCT 


ACAAATAGAA 


CAGCTAGATT 


780 


TACCAATGGG 


TTATCGGTCA 


ATGATTTCTT 


AACACGGAAC 


ACGGTCATCC 


ATTTATCAAA 


840 


AGATACGTTT 


GAACAAATTG 


CTGATTCAGC 


ACAACATATT 


GCTCATGTTG 


AAGCATTATA 


900 


CAATCACCAG 


CAGTCTATTT 


TAATACGTCA 


GTCTTAGGGG 


AGTGTAATTG 


AAATGATTTA 


960 


TATTGATAAA 


AATGAAAGTC 


CAGTTACGCC 


GTTGGATGAA 


AAAACAATGA 


CGTCTATTAT 


1020 


TAGTGCAACg 


CnATATAATT 


TATATCCTGA 


TGCAGCATAT 


GAACAATTCA 


AGGAAGCTTA 


1080 



55 



1017 



10 



25 



EP0 786 519 A2 

G^TTCJKPAJKG TTAATGCTGA TCATGCCAGA AC3GTCCGGCA TTAACGCTAA ATCCTGATTT ' i20( 

TTTTATGTAT CAAGCATATG CGGCACAAGT AAATCGTGAA ATTGCATTTG TAGATGCAGG i26( 

ATCAGATTTA ACGTTTGATT TGGAAACCAT TTTAACGAAA ATCGATGAAG TACAACCATC 132( 

ATTTTTTATT ATGAGTAATC CACATAACCC TTCAGGCAAG CAATTTGATA CGGCATTTTT 13 8( 

AACAGCTATT GCAGATAAGA TGAAAGCATT AAACGGATAC TTTGTCATTG ATGAAGCATA 144c 

TTTAGATTAT GGTACGGCAT ATGACGTGGA ACTGGCACCA CACATCTTAA GAATGCGTAC 15OC 

ATTATCAAAG GCGTTTGGAA TTGCCGGCTT AAGATTAGGT GTCTTAATTA GTACTGCTGG 156C 

AACGATAAAG CATATTCAAA AAATAGAACA TCCATATCCA TTAAATGTAT TTACGCTAAA 162 C 

TATTGCGACT TATATTTTTA GACATAGAGA AGAGACAAGA CAATTTTTAA CGATGCAACG 16 8 C 

ACAGTTAGCT GAGCAGTTAA AACAAATATT TGATACACAT GTTGCAGATA AAATGTCAGT 174 0 

20 GTTCCCATCA AATGCTAATT TTGTACTTAC TAAAGGCTCA GCAGCGCAAC AATTAGGACA 1800 

ATACGTATAT GAACAAGGAT TTAAACCTCG CTTTTATGAT GAGCCGGTGA TGAAGGGCTA i860 

TGTAAGATAC TCAATTGCAA CAGCATCACA GTTAAAGCAA TTAGAAGAAA TTGTTAAAGA 1920 

ATGGAGTGCA AAATATGATT TATCAAAAAC AACGAAACAC AGCTGAAACG CAACTAAATA 1980 

TTTCAATATC TGATGATCAG TCACCATCGC ATATTAATAC AGGTGTGGGC TTTTTAAATC 204 0 

ATATGTTAAC CTTGTTTACA TTTCATAGCG GTCTGTCATT AAACATTGAG GCACAAGGTG 2100 

ATATTGACGT AGATGATCAC CACGTAACTG AAGATATCGG CATTGTCATT GGCCAATTGT 2160 

TACTTGAAAT GATTAAAGAT AAAAAGCATT TCGTTCGTTA TGGAACGATG TACATTCCAA 2220 

TGGATGAAAC ATTAGCACGT GTCGTTGTGG ATATAAGTGG GCGCCCATAC CTATCATTCA 2280 

ATGCATCATT AAGTAAAGAA AAAGTTGGTA CGTTTGATAC GGAGTTAGTA GAAGAATTTT 234 0 

TTAGAGCGGT CGTAATCAAT GCAAGATTAA CAACGCATAT TGATTTAATT CGTGGAGGTA 2400 

ATACACACCA TGAAATTGAA GCTATATTCA AAGCX3TTTTC CCGTGCATTA GGCATAGCGC 2460 

TAACTGCAAC TGATGATCAG CGTGTGCCGT CATCGAAAGG TGTGATTGAA TGATTGTCAT 252 0 

CGTTGATTAT GGATTAGGGA ATATTAGTAA TGTAAAACGC GCTATTGAAC ATTTAGGGTA 2580 

45 TGAGGTGGTT GTCTCAAATA CCTCAAAAAT AATCGATCAA GCAGAAACAA TCATATTGCC 264 0 

CGGTGTCGGC CATTTTAAAG ATGCGATGTC AGAGATAAAA CGATTAAATC TCAATGCAAT 2700 

ATTGGCTAAG AATACTGATA AGAAGATGAT TGGTATTTGT TTAGGCATGC AATTAATGTA 2760 

TGAGCATAGT GATGAAGGCG ATGCATCTGG ATTAGGGTTT ATCCCAGGAA ATATTTCGCG 2820 
TATCCAAACA GAATACCCAG TGCCACACTT AGGCTGGAAT AATTTAGTGA GTAAGCACCC 



35 



40 



50 



2880 



55 
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AATTGCATAT GCGCAGTATG GGGCTGATAT TCCGGCAATT GTTCAATTTA ACAATTATAT 3000 

TGGTATTCAA TTCCATCCTG AAAAAAGCX3G TACATATGGG TTACAAATTT TGCX3TCAGGC 3060 

AATACAAGGG GGATTTATAA ATGATTGAAT TATGGCCAGC GATTGATTTG ATTGGGTCAA 3120 

CAAGTGTGAG GTTAACAGAG GGTAAATATG ATAGTGAAGA AAAAATGTCA CGCTCGGCTG 3180 

AAGAAAGTAT TGCTTACTAT AGTCAATTTG AATGTGTGAA TCGTATTCAT ATCGTCGACT 324 0 

TGATAGGTGC TAAGGCACAG CATGCCCGAG AGTTTGATTA TATTAAGTCA TTGAGGAGAT 3 3 00 

TAACAACCAA AGATATTGAA GTAGGTGGTG GCATTCGTAC GAAGTCACAA ATCATGGACT 3 3 60 

ACTTTGCCGC AGGGATTAAT TATTGCATAG TTGGAACGAA AGGTATTCAA GATACTGATT 3420 

GGCTTAAAGA GATGGCACAT ACATTTCCAG GTCGCATTTA TTTATCTGTT GATGCCTATG 34 80 

GAGAAGATAT TAAAGTGAAC GGATGGGAAG AGGACACAGA GTTAAATTTA TTTAGTTTTG 354 0 

20 TGAGACGGTT ATCGGATATA CCTCTTGGCG GCATTATATA TACTGATATT GCTAAAGATG 3 60 0 

GCAAAATGTC CGGACCTAAC TTTGAATTAA CTGGTCAATT AGTAAAGGCA ACGACGATTC 3660 

CTGTCATTGC TTCCGGTGGT ATTAGACATC AGCAAGATAT TCAACGATTA GCGTCGCTAA 3720 

25 ATGTTCACGC TGCTATTATA GGAAAGGCTG CACATCAAGC ATCTTTTTGG GAGGGGCTAA 3 78 0 

AATGATTAAA AAACGTATCA TTCCATGTTT AGATGTCAAA GATGGTCGTG TCGTTAAAGG 3 840 

GATTCAATTT AAAGGATTAA GGGATATTGG GAATCCTGTT GATTTAGCAA TGTATTACAA 3900 

TGAAGCGGGT GCTGATGAAT TAGTATTTTT AGACATCTCT AAGACGGAAG AGGGTCATAG 3 96 0 

CTTAATGCTA GAAGTGATTG AACAGACAGC GTCACGCTTG TTTATCCCTC TTACTGTAGG 4 02 0 

GGGTGGGATT CAAAGTCTCG ATGATATTAC CCAATTGCTA AATCATGGTG CAGATAAAGT 4 080 

ATCATTAAAT TCAAGTGCTT TAAAAAATCC ACAGCTCATT AAACAAGCGA GTGATAAATT 4140 

CGGT^GACAA TGCATCTGCA TAGCAATTGA TAGCTATTAT GATCCTGAAA GAAAAGCACA 4200 

TTATTGTTGT ACGACTGGTG GTAAAAAAAT GACAAATATT AAAGTATATG ACTGGGTACA 4 260 

GCAAGTAGAA CAGTTAGGTG CAGGTGAGCT CCTCGTTACA AGTATGGGAC ATGATGGTAT 4320 

GAAACAAGGC TTTGATATTG AACACCTAGC AAATATTAAG TCTCTTGTAA ATATTCCAAT 4 380 

45 CATTGCTTCT GGTGGTGGTG GCAATGCACA ACACTTTGTA GAATTATTTG ATCAGACGGA 444 0 

TGTTTCTGCA GGTTTAGCTG CAAGTATATT ACATGATCGA GAAACGACGG TTCAATCTAT 4500 

TAAAGAAGTG ATACGGCAAG GGGGTATAGC AGTAAGATGA CCAAATATAA AATTGATTTT 4 560 

^0 AGCAAAGGTT TAGTGCCAGC AATTTTACAA GATAATCAAA CAAAACAAGT ATTGATGTTG 4 620 

GGTTATATGA ACCAAGAAGC TTTTGATAAA ACGATAGAAG ATGGTGTGGT ATGTTTCTAT 46 80 
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4800 
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4920 
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5040 
5100 
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AAAGATATTC ATGTAGATTG CGACAATGAC ACTATTTTAA TTGATCTCAT ACCAAATGGA 

CCAACATGTC ATACAGGCAG TCAAAGTTGT TTCAACACAG AAGTTCCATT TTCAGTGCAA 

ACATTAGCGC AGACAGTTCA AGATAGTGCC CAATCCAATA ATGAAAAGTC ATATACAAAA 

TATTTATTAA CAGAAGGTAT AGAAAAGATT ACAAAAAAAT ACGGTGAAGA AGCTTTTGAA 

GTCGTAATTG AAGCAATTAA AGGTGACAAA AAAGCATTTG TAAGTGAAGT AGCAGATGAA 

CTTTATCATT TATTTGTCTT GATGCATGCG CTTGGCGTCG ATTTTTCAGA AATTGAGGCG 

GAA1TAGCGC GTAGACATCA TAAGCGCAAT AACTTTAAAG GTGAACGACA AAATATCGAA 

CAGTGGTAAA GCAAGTATGG ACTAAGATAT AAGGAAAAGG ATCATGGCTT ATACACTTAC 5220 

AAATATTGTG GAAAACGTGA CATTTTCAAG TTTAAAATAC GACACCAACA TATTITAACT 5280 

ATGAATGCTG TGATGGTACT AAAGTTGCGA ACTCX3TTATA GATAAGTAGT GGATAATCAC 534 0 

AATACGAAAT CAAAAATAAT TATAAAAAGT AAATTGAGCA ACTCAGGAAT AGATGTCACT 

GTTAAAGATG TCGAAAAGTA TATGAATCGA TATAATGAAG TTATGAAGGG AAAAAATGGC 

GAAPAAGCTA AAGAGTTATG TTTGTCGTTA CTACCTATTA ATATCATAGT TGTCTTTACA 

25 TTCTTTGTAT TTATACTATA AATACAAATA TATCTAGCCT GAAATAGAAA TGTCATAGCC 

TATTTAAAAG ACAATCTCCA TTAGAACTAA GATATGCATC CCGAAAGTTA GACTAAAAAA 

CTAACTTTAT GGGATGTATT TTTATGCTAA TCATCATAAA TTCGAGATTA AGTTAAAGGT 

AGTTCAAGAG TAATTAAACA ATAAATwAAA AATAGTAGGA TACTTACTTT GAGGGAAGAA 5760 

AATTAACTGT ATATATTTAG TTTAGGAACA AGTATTACGG TTTATCCTGA TACAATTATT 5820 

GTGGATGGGA TGATATTTTT AGGTTTAAAA TACGACACCA GCAAACATAA TAACTGTAAT 5880 

AGCTCATAAA TCTCCCCATA TAGCTAATCT AAAAAAATAA TACATCATTG GAATTAAGCC 5940 

CCAAQCATGT AAATATTAAA AATCAAAAmA GATATmTGTA AAAaAGTTAC AATTtGCATA 6000 

ATTAAATTGT GTCTAATTAT TGACTAATTA AATTTTGCCA AATATAATAT TAATTAATAA 6060 

TTTGaAATGA TTAGCGTATA CACTTTAAAT TCTCTTTGGA GAATATATTT TTTAAATACA 6120 

AATGTAAACG CTTTCTCGTC AAATTAAACA ATAGAAAGGA TGGTCATTAT GAGTGCTTGG 6180 

TTAAGTAAAT TATTTGAGTT TATTCCTCGA ATAATTATCA ATTTGTTTAT CTAAAATAAA 624 0 

AAAATAGAGG TGCTGACAAT GATGAAAAGT CAAAATAAGT ATAGTATTCG TAAATTTAGT 63 00 

GTAGGTGCAT CTTCCAnTT AATAGCTACA TTACTATTTT TAAGTG 634 6 

50 (2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3775 base pairs 

55 



5400 
5460 
5520 
5580 
5640 
5700 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

TGATAATGAT TCTCATTGTC ATACATCACG AAGGAGGCTA ATTAGTCAAT GAATAAAGTA 60 

ATTAAAATGC TTGTTGTTAC GCTTGCTTTC CTACTTGTTT TAGCAGGATG TAGTGGGAAT 120 

TCAAATAAAC AATCATCTGA TAACAAAGAT AAGGAAACAA CTTCAATTAA ACATGCAATG 180 

GGtACAACTG AAATTAAAGG GAAACCAAAG CGTGTTGTTA CGCTATATCA AGGTGCCACT 24 0 

15 GACGTCGCTG TATCTTTAGG TGTTAAACCT GTAGGTGCTG TAGAATCATG GACACAAAAA 3 00 

CCGAAATTCG AATACATAAA AAATGATTTA AAAGATACTA AGATTGTAGG TCAAGAACCT 360 

GCACCTAACT TAGAGGAAAT CTCTAAATTA AAACCGGACT TAATTGTCGC GTCAAAAGTT 420 

" AGAAATGAAA AAGTTTACGA TCAATTATCT AAAATC13CAC CAACAGTTTC TACTGATACA 4 BO 

GTTTTCAAAT TCAAAGATAC AACTAAGTTA ATGGGGAAAG CTTTAGGGAA AGAAAAAGAA 54 0 

GCTGAAGATT TACTTAAAAA GTACGATGAT AAAGTAGCTG CATTCCAAAA AGATGCAAAA 600 

GCAAAGTATA AAGATGCATG GCCATTGAAA GCTTCAGTTG TTAACTTCCG TGCTGATCAT 660 

ACAAGAATTT ATGCTGGTGG ATATGCTGGT GAAATCTTAA ATGATTTAGG ATTCAAACGT 720 - 

AATAAAGACT TACAAAAACA AGTTGATAAT GGTAAAGATA TTATCCAACT TACATCTAAA 780 

GAAAGCATTC CATTAATGAA CGCTGATCAT ATTTTTGTAG TAAAATCAGA TCCAAATGCG 840. 

AAAGATGCTG CATTAGTTAA AAAGACTGAA AGCGAATGGA CTTCAAGTAA AGAGTGGAAA 900 

AATTTAGACG CaGTTAAAAA CAACCAAGTA TCTGATGATT TAGATGAAAT CACTTGGAAC 960- 

TTAGCTGGCG GATATAAATC TTCATTAAAA CTTATTGACG ATTTATATGA AAAGTTAAAT 1020 

ATT(»AAAAC AATCAAAATA ATTAAGGAGT TTTACGATGC TACTTAAACC AAAATACCAA 1080 

40 ATCGTTATTG CTGGTTTATG TCTTGCAATA GTAGCTATCT TAAGTTTAAT GATTGGAAAT 1140 

ACGCTTGTGT CACCAGGTAC GGTGATACAG GCGTTATTCA ACTTTGATAG TGAAAACGAT 1200 

TTACATGATG TTGTCACTGG TGCACGGGCG TCGAGAACAA TCATTGCGTT ATTGACTGGT 1260 

GCTGCCCTTG CTGTCTCAGG TTTGTTGATG CAAGCACTTA CACGAAACCC AATAGCCTCA 13 20 

CCAGGGCTTT TCGGTGTCAA TGCAGGCGCA GTATTTTTTG TCATTTTTAG TATTACATTT 13 80 

ATCCAAATTC AATCTTTTAA AATGATTGTA GTTATTGCAT TTTTGGGGGC TATTGTTGTT 14 4 0 

SO 

ACTGTATTAG TT G TTGCACT AGGTATGTTT AGACAAACAC TATTCTCACC TCACCGTGTC 1500 

A TTTTGGCAG GTGCTGCGAT TGCGATGCTA TTTACAGCCT TTACTCAAGG CATACTTATT 1560 
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AATATTTGGG ATATCCCATG GATTATTCCG CTTGTATTGA TACTTATTrr AATTGCATTT 
AGCATGGCTG CACACATCAA CATCTTGATG ACAAGTGACG ACATTGCAAC CGGCCTCGGT 
CAAAACATAA AATTAATCAA ATGGATGATT ATTATGCTCA TCAGTATGTT AGCCGGTAI-T 
TCGGTAGCCG TAGCTGGATC AATCGTCTTT GTGGGTCTTA TCGTACCGAA TATTAGCAAA 
CGAT-TATTAC CACCAAACTA TAAGTATTTA ATTCCTTTTA CTGCATTAGC TGGAGCAATC 
CTAATGATCA TTTCAGACAT TGTTGCTCGT ATAATAATTA AGCCACTAGA GTTGCCTATC 
GGTGTCGTTA CCGCTGTCAT TGGCGCTATT GTCTTAATCT ATATTATGAA GAAAGGACGT 
IS CAACGCTTAT GACCGAAAAG ATTAATAAAA AAGACAATTA CCATCTCATC TTCGCGTTAA 

TCTTTTTAGC CATCGTTTCA GTGGTAAGTA TGATGATTGG TTCAAGCTTT ATACCATTAC 
AACGCGTACT GATGTACTTT ATAAATCCAA ATGACAGTAT GGATCAATTC ACTTTAGAAG 
TATTACGCTT ACCTCGCATT ACACTTGCGA TTTTAGCAGG TGCCGCACTA GGAATGAGTG 
GTTTAATGTT GCAAAATGTA TTAAAAAATC CAATTGCCTC ACCTGATATT ATCGGTATCA 
CAGGTGGTGC TAGCTTAAGT GCTGTTGTCT TTATTGCATT TTTCAGCCAT TTAACAATAC 
25 ATTTACTTCC ACTATTTGCA GTATTAGGTG GCGCAGTTGC AATGATGATA CTATTAGTGT 

TTCAAACGAA AGGACAAATA CGCCCGACAA CACTCATAAT CATCGGTATT TCGATGCAAA 
CGTTGTTTAT TGCGCTTGTC CAAGGATTAC TCATTACAAC GAAGCAATTA TCTGCTGCCA 
AAGCTTATAC ATGGCTAGTC GGAAGTCTTT ACGGTGCTAC GTTTAAAGAT ACAATCATTT 
TGGGTATGGT TATTTTAGCT GTTGTGCCGT TGTTAITTCT TGTTATACCA AAAATGAAAA 
TATCTATACT TGATGACCCT GTAGCGATTG GCTTAGGCTT ACATGTACAA CGTATGAAAC 
TAATCCAATT AATCACTTCT ACTATACTCG TATCTATGGC AATCAGTTTA GTAGGTAACA 
TTGGGTTTGT CGGTTTAATC GCACCACATA TCGCGAAAAC AATCGTTCGC GGAAGTTATG 
CTAAAAAGTT ACTAATGTCA GCAATGATTG GTGCCATATC AATTGTTATT GCAGACTTAA 
TTGGGCGTAC CTTATTCTTG CCTAAAGAAG TGCCAGCAGG TGTATTTATT GCTGCTTTTG 
GTGCCCCTTT CTTCATATAC TTATTATTAA CCGTGAAAAA GTTATAACGA TATTATTAAA 



30 



35 



40 



1680 
1740 
1800 
I860 
1920 
1980 
2040 



3180 
3240 



TTTTTATACA TATAGTTGTT GTTATTGTTA ACAAGCGTCG ACTTTCTTAA TTACATATTA 
ATACTTTATA TACAAATAAC ACCGACTCAT ATTCTATAAT ATCAATCAAT ATTCTTCGAT 
so TTTTCAAATA TCGATAACTA TTTCTTATTT AAATATAGTG TTTGATAATG TCATTTATTC 3300 
AAAAACACAA ATTTTAATAA AAATATCATA TTATTTTTAA TTGTAAATTA TGGATTATTT 3360 
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ATATAATAAC CATGACAAAA TTAGAGATTT TATAATCATT GAAGCATATA TGtnTCGTTT 34 80 

TAAGAAAAAA GTCAAGCCTG AAGTCGATAT GACTATAAAA GAATTTATAT TACTGACTTA 3540 

^ TTTATTTCAT CAGCAAGAAA ACACACTTCC ATTTAAGAAG ATTGTTTCAG ATTTATGTTA 3600 

TAAACAATCG GATTTAGTAC AGCATATAAA AGTACTTGTG AAACATTCAT ATATTAGTAA 3660 

AGTTCGAAGT AAAATTGATG AGCGTAATAC TTACATTTCA ATATCTGAAG AACAACnAGA 3720 

70 

nAAAATTGCA GAACnTGTTA CATTGTTTGA TCAAATCATT AAACAATTTA ACCTT 3775 

(2) INFORMATION FOR SEQ ID NO: 2 39: 

(i) SEQUENCE CHARACTERISTICS: 
'5 <A) LENGTH: 1361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 







GCCTACCTAT 


GCAAGTTGTT 


ATTGCATTAG 


TGTTAGGTGT 


TGTCGTAGGA 


60 


2S 


CTTTTATTAT 


ATGGACAAGA 


AAACGTAGCA 


AATTACATTA 


AACCATTTGG 


TGACGTATTT 


120 




TTAAATTTAA 


TTAAAATGAT 


CGTTATACCA 


GTTGTATTTT 


GCTCACTAGC 


GCTTTCTATT 


180 




TCGAACGTTG 


GGGAATCGAA 


AACTGTAGGG 


CGTTATGGCT 


GGaAAACAAT 


TTTATACTTT 


240 


30 


GAAATTATTA 


CAACAATCGC 


AATAGGTTTA 


GGGATTATCT 


TCGGTAACCT 


ATTTAAACCA 


300 




GGTGCTGGAT 


TAGACCCAAC 


AAAATTACCT 


AAAGGTGATA 


TTTCTAAATA 


TCAATCAACT 


360 




GCACATGCAG 


CAGAACAATC 


TACATATGGa 


AATCATTTTA 


TTGATACCAT 


TGTACATATT 


420 


3S 


ATTCCGACAA 


Acrrrrri'GA 


AGCTTTAAAT 


AAGGGTGAAT 


TATTACCTAT 


TATCTTCTTC 


480 




GCAGTATTCT 


TTGGATTAGG 


ATTAGCTGCT 


GTAGGTAAAA 


AAGCAGAACC 


AGTTAAAGAA 


540 


40 


TTTTTAAGCG 


GATOGCTTGA 


AGCTGTGTTC 


TGGATGATTA 
ACAATTATTA 


ATAAAATTTT 
CATTTGGTGC 


AAAATTAGCA 
ATCCGCATTA- 


600 
660 




CCACTTGGAG 


TGTTTGCATT 


CATTTGTACT 




TTACCACTAT 


TAAAATTAGT 


ATTAGTTGTT 


GTCTTTGGAA 


TGGTGTTCTT 


TGTATTCGCT 


720 


45 


ATACTAGGAC 


TAGTTGCATG 


GATGTGTGGT 


ATTAATATCA 


TGAATATTAT 


TAGAATCTTG 


780 




AAAAGTGAAT 


TGCTTTTAGC 


ATTTTCTACA 


TCAAGTTCGG 


AAGCTGTACT 


TCCTGTAATG 


840 




ATGAAGAAAA 


TGGAAAACTT 


CGGTTCTCCA 


AAAGAAATTA 


CTTCTTTTGT 


TATACCAATT 


900 


SO 


GGTTATACGT 


TTAACTTAGA 


TGGATCAGCA 


CTTTATCAAT 


CTATTGCAGC 


ATTATTCGTT 


960 




GCACAGATGT 


ATGGAATGCA 


CTTAACATTA 


TCAGAGCAAA 


TTGTGTTGAT 


GTTAACATTA 


1020 



55 



1023 
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10 



20 



30 



35 



40 



45 



ACAirrAGGTO CCATGGGCTT ACCC3GCACAA GGTTTAGCAT TAAT-TAOTCG TGTTGACCGT 
ATCTTAGATA TGGTACGTAC ATGTGTAAAC GTTATTGGTA ATGCATTATC AACAATCGTT 
ATAGCTAAAT GGGAAAACGT ATATGACAAA GCAAAAGGTC AAGAATATTT AAAATCAATT 
TAAAAAATAC TATCTGACAT TTAArGnCCC TTACAACCTT TGGTTgTnAG GGCTnTTTTA 1320 
TGTCATGCGT CTTAAAGCCA GGCCGTATAn CGGTAAGCGT A 13^1 
(2) INFORMATION FOR SEQ ID NO: 24 0: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1489 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1140 
1200 
1260 



60 
120 
180 



360 
420 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 
TCAATATGTA AATCGGCGAT TGTCCCTATC CTCATACCCT TACCTTCTTT ACTCAAATTA 
AAATGTTATT TATTATGCCA TaGCTTATCT AATATATATA GTTAACTGCT TCATTTAGAT 
25 GATTATTTTA TATTTTTGCA TAAAAACTTA TATCTTTTCA AAAAAATCGA TAAGTTTTAG 

TTATCATACC CTTACCTATC AAATGTTTTT TCTTATATTT APAMJ^ATIiA TTGCTTTATT 24 0 

AAATGGATTT CTTTAGTATT TATAATTAAG AAAACGCTTA CACACAACTT TTTTATTTGC 300 
TTTATCCTGA GGAGGAAAAT TATGGCAAGA AAATTGCATA GAGAGTTGAA TAACAGACAC 
ATCCAATTAA TAGCAATTGG GGGCGCAATT GGAACTGGGT* TATTCCTAGG ATCAGGTCAA 
ACAATATCTT TAACTGGTCC ATCACTGTTA TTCACATACA TGATTATTGG GGTTGTACTA 480 
TTCGCTTTTA TGCGCGCATT AGGCGAATTG TTGTTGAGCA ATACAAGATT TAATTCATTT 54 0 

GTTGATATTG CAAATGAATA TTTAGGCCCT TTTGGTGGCT TTGTCATTGG CTGGACTTAC 
TGGTTATGTT GGATTGTATC AAGTATGTCA GACCTAACTG CGATGGGACA ATACTTTGCA 
TTTTGGTATC CACAAGTCCC AAATTGGATT ACCGTGCTAT TTATTCSTTTT AATCTTGATT 720 
AGCTTCAACT TATTAGGTGC CAGATTATTT GGTGAACTGG AGTTTTGGTT CTCGATTATT 780 
AAAGTTGTCA CAATTATTGC GATGGTTATC GTTGGTCTTG TATTAATCTT TTTCTCATTT 84 0 

AAAACACATT ATGGACATGC ATCATTCACA AACTTAATCA GTCACGGTGG CATGTTCCCT 900 
GGTGGAaCAT TTGGTTTCTT AATGTCATTC CAAATTGCTG TATATTCATT CATTXSGTATT 960 
5^ GAACTTATAG GTGTAaCTGC TGGTGAAaCG AAAGATCCTG AAAAAACCTT ACCGAAAGCA 1020 

attaataatg tacctatccg tattttatta ttctatatcg gtggtctatt agtaattatg 



600 
660 



1080 



55 
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TTAATCGGCG 


TACCATTTGC 


AGCAGGTGTC 


GTTAACTTTG 


TCGTGCTAAC 


TGCCGCGGCC 






TCTGCTACAA 


ATAGTGGTAT 


CTATTCGAAT AGTCGTATCT 


TATTCGGACT 


GTCACAACAA 


•A. ^ O U 


5 


GGGTTAGGTC 


CTAAAGTTTT 


AAATAAAACG 


AATAGTCATG 


GCGTGCCTTA 


TTTATCAATG 


^ w « V/ 




TTAGTTTCAT 


CAATTGCATT 


ACTTATAGCA 


GCCTTGTTAA 


ACTACATTTT 


CCCTAATGCA 






ATTCAACTAT 


TCATATACGT 


TACAACGTTA 


tCAACTGTGT 


TGTTTTTAGT 


TGTtTGGGCA 


1440 


10 


ATGATnATTG 


TCGCTTATCn 


AATGTATTTG 


GAAAAAGCAT 


CCTGAGGCA 




1489 




(2) INFORMATION FOR SEQ ID NO: 241: 








IS 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








20 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 








TTTTCCATCA 


TrATcACCAT 


TTAAGACAAT 


AGCTGTATGA 


CCATATCCTC 


CACCAAATGT ' 


60 


25 


TCCACTAAAA 


ACCACTAAGT 


CCCCAGGTTC 


AGCTTTGAAA 


GTTGGTGTGT 


TGTGATAAAT 


120 




TTTAGCTTCA 


CTATTAAAAT 


TATTTGCATA 


TGGTATATCT 


TTAGCTCCAT 


ATCCTTTtAA 


180 




TCCAkGACCA 


TAAAGAkGAT 


TCCAATATAC 


ATTAACTAAA 


TCGAAACATT 


GCCATCCATA 


240 


30 


ACTACCGTCG 


AAATCCCATC 


CTCTATTCTC 


TAATGTTTTC 


AAATAATTTA 


ATGTACTGCT 


300 




ATTACTATTA 


CTTTTATTAT 


TTGAAGACAC 


TGTTTTTGGT 


TTTGGTTCTA 


CTAATGGTGT 


360 




CATTGGCACT 


TTTAATTTTT 


GACCAATAAA 


TATTAAATTA 


GGATTTGCTA 


TATTATTTGT 


420 


35 


ATTTTGAATA 


TTTGAAACTG 


TAGTTTTGTA 


TTTTAATGCT 


ATAGCACTAA 


GTGTGTCTCC 


480 




rrrr^TACA 


GTATAGATTT 


GTGTTTTTGG 


AGCTTCTCTA 


AAACTGTAGT 


AACCAAAGTA 


540 


40 


ATTATTAGTA 
TTTAATTTCT 


ACTTTATTCG 
TTAGTAAGTT 


TTTTCTTCTG 


ATTAGAATTT 


TGAGCTTCCA 


AGTTTGCAAT 


600 




CATTGTTATT 


AATAACTAGA 


TTGTTACCTT 


GGCTTGAGTT 


660 




TTTCGGAGTA 


TTTGAAATCT 


TTATATCTTG 


ATTAATTTCA 


TTTCCGTTTG 


AAATTGCTGA 


720 


45 


TTTGTTGTCT 


AACTTTAAAC 


TTGTGTCCGA 


TGTTTTAACA 


GCACCTTCAT 


I'l'i'i'l'ATTTT 


780 




GTCTTTTGTC 


GTATTTTi'AT 


TAGCATTTAA 


CTCTGATTTC 


GCGAATACAT 


TTTGCTCATA 


840 




CCCTCTTGTA 


AAATCTTTAG 


ATTTATCAAT 


TTCATCTGCA 


TATGCTTTGT 


TCGACATACC 


900 


SO 


CAATGCCAAA 


AACATACCTA 


TTGAAATTGA 


CAAAATTCCA 


ATACTAACTT 


TTCTAATTGA 


960 




ATAGCGTACT 


TTACTTTGTT 


GTTTATTCAT 


GATGAAACAC 


TCCTTACAAT 


AATATACAAA 


1020 



55 



1025 
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10 



30 



3S 
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GTAGGAGGTA AAATAATTAA CTTGTCrTTC CAAAATATGA AAAGTGTACT AAAaTTCATC 
GCACGACAAA TAGCCCATTT CCGATACTTT TATAAAGTAT GGAaTGGGCT ATAGCCATTT 
ATATCATCTT TTAACTTTAT TTATTAACAG TTAATAATGA TTCATAAATA CCTGCTTCTT 
TAGCAGCTTC AATTAATGTT GAACCAATTT CTGAAGGTGT TGCCGCTGTT TTCACACCAC 
AACTATTTAA TGTTTTAATT TTCTCTTCAG CAGTACCTTT ACCACCTGAA ATGATTGCAC 
CAGCATGTCC CATACGTTTT CCAGGAGGTG CTCTTTGTCC ACCGATAAAG CCTACAACTG 
GTTTTGTCAT ATTCGCTTTA ATCCATTCAG CTGCTTCTTC TTCAGCCGTA CCACCGATTT 
CACCAATCAT AACAACTGCT TTCGTTTCGT CATCTTCATT GAATGCTTTT AAAACATCAA 
TAAAGTTTGT TCCGTTGACT GGGTCTCCAC CAATACCAAC AGCTGTAGTT TGACCAATAC 
CTTCTTCAGT CAATTGGTGC ACTGCTTCAT ATGTTAATGT ACCTGAACGA GATACTACAC 
CAACATGACC TTTTTTGTGA ATATAGCCAG GCATAATACC AATTTTACAT TCATCTGCTG 



IBOO 
1860 . 
1920 
1980 
2040 
2100 
2160 
2220 



TAATCACACC TGGACAGTTC GGACCAACTA AACGTGTTTT TCTACCTTGT AAGTAGCGTT 
TAACTTTAAC CATGTCTAAT ACAGGAATAT GTTCAGTGAT ACAAATAACC ATATCTAAGT 
25 CTGCATCAGC TGCTTCTAAA ATTGAGTCTG CAGCAAATGG TGCTGGAACG TAAATGACTG 

AAACCGTTGC CCCAGTTTCA TTTTTAGCTT CTTCAACAGT GTTGAAAACA GGAACGCCTT 
CAACAACTTG ACCACCTTTA CCAGGCGTCA CACCTGCTAC TATTTTCGTA CCATAATCAA 
GCATTTGTTT TGTATGGAAA AGGGCAGTAG ACCCTGTAAT ACCTTGTACC ATTACTTTAG 
TATTCTTATC TATAAATACA CTCATCTTAG TGCTCCCATC CTTTCCTTAT GCTTCnTCA 
CTAGTTTAAC AATnTTTGT GCACCTTCAG CCATTGTTGC TGCTGGTTCA ATTGCTAATC 
CTCAGTCTTT TAAGATTTTT TTACCTAACT CAACATTTGT ACCTTCTAGG CGTACAACTA 2280 
GTGCTAAAGT TAAATCTACT TCTTTTACAG CTTCAACGAT ACCTTCTGCX; ATAACATCAC 2340 
ATTTCATAAT GCCACCGAAA ATGTTTACAA AAATACCTTT AACATTTTCA TCACCTAAAA 24 00 

40 

TGATTTTAAA TGCTTCAGTT ACTTTTTCTC TAGTAGCGCT TCCGCCTGCA TCTAAGAAAT 2460 
TGGCTGGGTT TCCACCGAAA TGATTAATCG TATCCATTGT TGCCATGGCT AAACCTGCAC 2520 
CATTAACCAT ACATCCGATG TCACCATCTA ATGCAATGTA TGATAAATCA TCTTTAGACG 2580 
CTTCAATCTC TTTCGGATCT TCTTCTTCTA AATCACGTAA TTCTACAACA TCTTTATGTC 2640 
TGAATAATGC ATTATCATCA AAATTAATTT TAGCATCTAA TGCCAATACA TCACCATCAG 
so CTGTTGTAAC TAATGGGTTG ATTTCTACGA TTGAACAATC TTTTTCAATC AATACATTAT 

AAAGTGCTAA TAAGAATTTA GCAGCTTTGT TAACAGATTC TTTAGGAATA TTAATATTAA 



2700 
2760 
2820 



55 
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AGATCTTTTC AGGAGTCTTC GCAGCAACTT 
CCATCAATGT TACTTGGTCA GTCGCACGAT 
5 GAATAGCACA ACCTTCTTCG ATATATAAAC 

GATGTGTCAC CAAAGTTTTC CCTAATAATT 

ATTTAGCAAT TTTTACTCCG CCTGCTTTAC 

10 

CAACATAAAC ATCAGAATTT AATTCTTTTG 

CTACTCGTCC TTCTGGAACT GCAACGCCCA 

CGTGGATATT CATCTTCCAT CCTCCTGTTA 

75 

AATGTAAGCG CTATTGTAAA CTTAAATGCT 
TTTACAGTAA CGATTTTATA GGTTCAAAGC 
CATCGATTGC TAGTAAATGT TGTTTGGTAC 

20 

CAGGATAATC TTTAGATAAC TGTGTCATAT 
TTGCAGCTGC AATGGACACA CTTCTTGCAT 

25 CATTATCAAG CGTCATCGCG TCTATCAATA 

TAGCTCGCTG CATGGCGATT TGAGTAGCTT 
GtGTCGCGAT CCCATATGCA AAAGCAGTAA 

30 GACGTTTCGT AACAGGTACT TTTTTCGAGT 

TTAAAATTGT TGCGCATGCA ACGACTGGAC 
TCCCACAAAT AATAGCATTA GGATGCTCTT 

^ CAACATACTT TTCTTTTAAA GCTTGTTCTT 

CATTTTGAAC ACCTTTTCGC TCATCTAAAA 

TATTAACCGC ATTAATCAAC TGCGTAACTT 

40 

TCAGTCATAT CTTTAAAAAT ATCAAAACAA 

TAAATAATCA GTTCAATGAC TGCTTCGTAA 

ACGTTTTTTC CCTATCGCAT CAAACCACGC 

45 

ATTATAATGT GACTTTAATC GCGCTAAATC 
GGCAACTTCA TCTAAGTGCA CAATACTATC 
ACCGACTTCT TCATCTTCAA ATTTAGGCCA 

SO 

TGCATTACCA ACTTTAATCC ATTGTTGTTG 
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CTTCAATCTC AGTGCCCCCT TCTTCAGACG 2940 

CAATAACGAA TCCAACGTAA TATTCTTTTT 3 000 

GCTTAATTTC TTTACCTTCT GGACCAGTTT 3060 

CTTTTGCATA TGTTTCTACC TCAGATAAAG 3120 

CTCTACCTCC AGCATGAATT TGTGCTTTTA 31 BO 

CTTTCTCCAC CGCTTCTTCA GCAGTAAATG 3240 

TTGAACGAAA TATTTCTTTTA CCTTGATACT 3300 

CTTAGGTTAA GTTCCCTTAC AATTATAAAA 3360 

ACTTTTTTAT CATTTAATTG AATTTTACX3A 3420 

TTTTTCTATG CTCTTTCATA ATGCCAATAT 34 80 

CGTAACCCGC GTTTTTTTCA AAACCATATT 3 540 

AATCATCACG AAAAACCTTT GCCATGATAC 3 600 

CACCCTTGAT TAAAGATACT TGAGGCAGTG 3660 

AATGCGTTGG TTGTACTGAT AATCCATCAA 3 720 

TATAAATATT AAATTCATCT ATTTCTTnCA 3 780 

CTTCATTTTT TAGTGCTTCA TTTAATTCTA 3 84 0 

CATCAAGGCC CAAATAATTG TGATTTGAAT 3 900 

CTGCTAAAGG TCCTCTTCCA ACTTCATCAA 3960 

TTAATATTTC ATTTTCAAAG TAAGTCATTT 4 020 

TTTCTAACGC TTTTCTGCGC CTAGCTATGG 4080 

AGCATTCATG ATTTTCTAAT TCTTCTATTG 414 0 

CTTTAATTGT TAGCGTCATT TGCTAATTCC 4200 

TAATTTCCTA TTTTAGCATT TCGAATATCA- 4 260 

TCAATTTCAT TACCACGTCG AATTAAGCCc 4 320 

TATGATTTCT GCAtCTTCAG GAACTTCAAT 4380 

ATTTTGAATT AAAAAGTTTA ATCCATAGAT 444 0 

TTTTATCGCA CCAGTTAAAC TCAACTTCTT 4500 

AAGTATCCCT GGTGTGTCTA ATAGTTGTAA 4 560 

TTTGGTCACA CCTGGTTTAT TACCAGTCTG 4620 



55 
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AACGATCATT GCTCTTATCG 


CTCTAGGTTT 


AAGTCCTTTC 


GCTTTTTCGC 


GTTCAAATTT 


4740 




TTCAGCAGTC GCCTTAATTG 


CTGCAGCTTC 


CACTTTCTTT 


AAATTTTTAC 


CGTGCTTAGC 


4800 


5 


ATCCACTGAT ACAGGATAgT AtCCTTTATC AATAAAAAAT 


tGTTCCCATT 


TTGACATCTC 


4860 




ATTTAAATTA GACATATCTT 


TTTTATTTAA 


TATAACAACA 


CGTGGTTTTT 


GGTTAATAAC 


4920 


10 


TTCATCTATC ATAGGGTTTC 


TTGAACTATA 


TGGAATTCTT 


GCATCTACTA 


GTTCAAACAC 


4980 


TACATCTACT rrrriTAATT 










5000 




(2) INFORMATION FOR SEQ ID NO: 242: 








IS 
20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 








AACCCGnAGA CGAAACTnCA 


TAGTTGCCTG 


TTATGTTGtAT 


TAGTGCGTTT 


GTAGCTATTT 


60 


25 


niW\ATCAAAC ATTATTAAAT 


ACAGCGTTaC 


CTAGTATAAT 


GAGAGAATTA 


AATATCAATG 


120 




AAAGTACATC GCAATGGCTA 


GTTACTGGGT 


TTATGCTTGT 


TAATGGCGTC 


ATGATACCTC 


180 




TGACGGCATA TCTAATGGAT 


AGAATTAAAA 


CTAGACCTTT 


ATACTTAGCG 


GCGATGGGGA 


240 


30 


CATTTTTATT AGGTTCTATT 


GTTGCAGCCT 


TAGCTCCGAA 


TTTTGGAGTT 


TTAATGTTAG 


300 




CTCGTGTAAT TCAAGCGATG 


GGTGCAGGCG 


TACTTATGCC 


CTTAATGCAA 


TTTACGTTAT 


360 


3S 


TTACATTGTT CAGTAAAGAA 


CATCGAGGTy 


TtGCAATGGG 


ACTAGCAGGT 


TTAGTAATTC 


420 


AATTTGCACC AGCAATAGGA 


CCTACAGTTA 


CAGGATTAAT 


TATTGATCAA 


GCGAGTTGGC 


480 




* GAGTT CCATT TATTATAATT 


GTAGGAATTG 


CTATACTTGC 


CTTTGTTTTC 


GGTTTGGTTT 


540 


40 


CAATCTCGAG TTACAATGAA 


GTGAAATATA 


CGAAATTAGA 


TAAGCGTTCA 


GTAATGTATT 


600 




CAACTATTGG GTTCGGGTTA 


ATGCTATACG 


CATTTAGTAG 


TGCAGGAGAT 


TTAGGATTTA 


660 




CAAGTCCRRT AGTAATAGGT 


GCGTTGATAT 


TAAGTATGGT 


TATTATCTAT 


TTATTTATAC 


720 


45 


GTAGACAATT TAATATTACT 


AATGCACTTT 


TAAATTTAAG 


GGTTTTTAAA AATAGAACAT 


780 




TTGCATTATG TACGATTAGT 


TCAATGATTA 


TAATGATGTC 


AATGGTTGGA 


CCTGCGCTGC 


640 




TTATACCGCT ATATGTTCAA 


AACAGTTTAT 


CTTTATCTGC 


CTTGTTATCA 


GGACTTGTTA 


900 


SO 


TCATGCCTGG TGCAATAATA 


AATGGTATTA 


TGTCAGTTTT 


TACAGGTAAA 


TTTTATGATA 


960 




AGTATGGTCC TAGACCATTG 


ATTTATACTG 


GTTTTACAAT 


TTTAACAATT 


ACTACAATTA 


1020 
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GAATGTTTTC AGTTTCTTTA CTCATGATGC CGATAAATAC TACAGGAATT AATTCTTTGA 
GAAATGAAGA AATCTCACAT GGCACGGCTA TTATGAACTT TGGTCGTGTA ATGGCTGGTT 
CACTAGGCAC AGCTTTAATG GTTACATTAA TGAGTTTTGG TGCAAAAATA TTTTTATCTA 
CATCGCCATC GCATTTAACT GCAACTGAAA TTAAACAGCA ATCCATTGCT ATAGGGGTGG 
ATATCTCATT TGCTTTTGTA GCTGTGCTTG TTATGGCAGC TTATGTGATA GCACTTTTTA 
TAAGAGAACC TAAAGAAATA GAAAGTAATA GAAGGAAATT TTAAAATAAT TATAGTAGTT 
GGTCTATTTA AAATAATAGG CTAACTGCTT TTTTTATTTA ATAAAAAGTT TTATACTTTT 
AGTGATAGAC TAAGCAAAAA TTGTTATTTG CTATGATGTA GATGTCTTAA AATGATTAAG 
GGGGATTTGC TTTGTTAACG GTAGATCAAG TGAAAGAATT GGTAGGAGAA ATTAAAGATC 
CTATTATAGA TGTGCCTTTA AAAGAAACAG AAGGTATTGT TGAnGTTTCT ATTAAGGGAG 
AAnAAGAACA TGTGAGTGTT 

(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 
TGCATCAACT TTCAAACAAT AAATCATCAC AATAACCACA CCTAATTCAA CACTTTTCAA 
ACATAAGTAT TGACACATTG AGCAAAATGA TTTTTAATTG TAACTAATAC AGTTACAATT 
ATGAGGTGAG AAACATTGAA TTTAGAATTT AACATTGCCG TGCATGTATT AGCTTTTTTA 
ACTAAGCATC aTTCAGAAAA ATTCAATAGT AGTTCATTAG CAGAaTTAaC TTGTTTAAAT 
CCTGTTCAAT TACGACGCGT GACGACTCAA CTTGTCGATT TAAAAATGAT TGACACAATA 
CGAGGTAAAG ATGGCGGTTA TTTAGCAAAT GATCAAAGTG CTGATGTCTC TCTAGCAACA 
TTATaTAAAC ATTTTGTCTT AGAGAAAGAA CACCACACAC GTCTATTTAC TGGCGACGAA 
GGCAGTCACT GTCAAATTGC TCGTAATATT GCAACTACCA TGTCACATTA TCAGCAAGAC 
GAACAGAATA TCATTATTAA TTTTTATAAT GGAAAAACAA TCAAAGATGT CATTGAAGAC 
ATTCAAAAGG AGGATTTATG TCATGAAAAC ATATGATTTA ATTGTAATAG GATTTGGGAA 
AGCTGGTAAA ACTTTAGCGA AATACGCTGC ATCAACAGGT CAACACGTCG CAGTTATCGA 
ACAATCTCCG AAAATGTATG GAGGCACTTG TATAAACATA GGATGTATAC CTTCGAAGAC 



1029 
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10 



IS 



20 



25 



30 



35 



40 



45 



SO 



TGTTGTCAAT 
TGTACTGGAT 
TGACGATATC 
TGTCATTCCT 
ATTAAACATT 
AGAATTTGCT 
ATCATTTATG 
TAAAGGCATT 
TACAACAGTC 
GCGCAAACCG 
CGAAATTAAA 
TGTTAAAGGC 
AGCGTTATAT 
ATTTATAGAT 
TTATGATTAC 
TGATCCAAGA 
TACATTATAT 
AAACATTCCA 
ATTTAATGAT 
ATGCTGCCAT 
TATCFTGTTC 
CTTCAGGATC 
TAAGCGCagc 
AATAAGATTG 
GTAACGTTCT 
TGCTACCCAC 
CTATCTATTT 
AATGGAGCAT 
CCCATGATAA 



GCGCTAAACA 
TTTAAAGCGC 
GTCGATAGTA 
AACATTAAAG 
AGCTATCAAC 
TCAATGTTTG 
CCACGCGAAG 
GCATTGCATA 
CATACCAACG 
AATACGGATT 
GTCAATGCTC 
GGACTTCAAT 
GGTAATCAGT 
CCACCATTAT 
ACTGAACATC 
GGTTTATTTA 
GGTAAGCAAT 
TATACCGTAT 
TTATTTAATT 
CAGGCTTTAT 
TTCTTTACGA 
ACGTGGTTTC 
CAGTGGGCGG 
ACAACCTATC 
TATATTGACT 
CTTTCTTTAT 
TACTCTTTTT 
TTATCTTCTA 
ATATTAAATT 



ATAAAAATTA 
AGTTTAAATC 
TTACTGCACC 
GCCTTGATCA 
CTAAGCACCT 
CGAATTTAGG 
ATCAAGATGT 
CAAATGTTGA 
TTGGTAACTT 
TAGCTTTAGA 
ATCTTCAAAC 
TTACGTATAT 
CACGTACGAC 
CACGTGTTGG 
AACTTTTAGT 
AAGTAGTCAT 
CTGAAGAATT 
TACGAGATAA 
TCTAGACAAA 
TGTTGTGCTT 
CGTTTACGTT 
TTTAATTGTT 
TTATTAACAA 
ACGATTTCAG 
GCCTQACATT 
CATATCTATA 
AATATTTTTT 
ATTAAATTAG 
TTAATGGTTT 



CCACTTATTA 
TAATACTGAA 
TCATATCATT 
AGCTAAACAC 
CGTTATTGTA 
TAGTAAGGTA 
CGTTGCATAT 
AACGACTGAA 
TGAGGCTGAT 
AAATACTGAT 
AACTGTGCCG 
ATCTTTAGAT 
TGACAATAGA 
ATTAACTAGT 
AAGTGCTATA 
TAATAATGAA 
AATTAATATA 
TATTTATACG 
ACATAAAAAC 
TTCGCTTTTC 
CTTCATGTCG 
GAGAAACTTT 
AAGTGAATGT 
CATCGGGATC 
CATCACAAAT 
TCGTCGATTT 
TCAAGATACG 
ATGaTTGCTC 
AATACCATGT 



GCAGACGATA 
GTGAATTTAT 
ATTAATACCG 
GTCTTCGATT 
GGTGGCGGTT 
ACAGTATTAG 
GGTATTACTG 
TTGTCATCTG 
GCAGTACTTT 
ATCGAATTAG 
CATATTTATG 
GATTATCGAA 
GGCT^GCGTGC 
AAAGAAGCTG 
CCTCGTCATA 
AATAATATGA 
ATTAAACTTG 
CATCCTACGA 
CTGGTGGCAC 
TAATTTTTCT 
TTTTCTTAAA 
TTCGATTAAT 
TTTTCGGCGT 
TAATTTTTCC 
AAGGAATGTA 
CATTAATTTT 
TAACACGGCT 
CCCTATCAAA 
TTTGTCCATT 



ACAACATTGA 
TAGATCAACA 
GTGCTACCTC 
CGACAGGTTT 
ATATCGCCTT 
AACGTGGCGA 
ACTTAGAAAA 
ACAATCATCA 
TGGCTATCGG 
GCGACAGAGG 
CTGCAGGTGA 
TTATCAAATC 
CTTATACAGT 
CCGCTCAACA 
AAATTAACAA 
TTTTAGGTGC 
CGATTGATCA 
TGGCCGAATC 
GCATTGAATG 
TTAAGCrTTTC 
CGCTCTTCTT 
TCTTCTTCAG 
CCAGGTCCAC 
AACTTCTTTT 
TTTTTCATAT 
TTCGTTAACT 
GaCAATAAAA 
TCATTTATTG 
CATCATAAAT 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 - 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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10 



IS 



20 



TGATGTTTTA GATGCGCCGT GATACTTTTC AGCAATATCA CACAAATATT TTAGCTTTTC 264 0 

AGTTTCTATA TCAACTGTAG CTTCTTTATC CATACGTTGA ATAATTGTAC GATTCTGACG 2700 

CACCATCTTT TGCACACCTT TAATGTTATT TGTTTTAAAA GCATX3AATAA GTTTTTCAAC 2760 

ACAACGATGT GAATCTTCTA AGAAGTCACC GTAAAATGAA GGATCTGATT TCAAACGTTT 2820 

CACTTCGCTA ACAAAGTGTG GTGATGACGC CGGTGAGCCA GTCCAACCGA TAAGTACTTC 28 80 

CATATTTTCA GGTGCTTGTA ATGGTTCGAT GTGCAATCCA GGCCAGTTTT TGATTAAAAC 2940 

TTCTTCAACC GTAGTATCTT CAATTTGATG CTTAACCCAT TCATGATCAA AAGTACTATA 3000 

CGCTAGCCAT CCACTATATA CACTCACAGC AATATCTCCG CATGAACTTA AACTTTGTAA 3060 

CTTCATATTT GCAATCACTG CTAGTTTATA AATGTATAAA TTAGATAACT TCATATCATA 3120 

AAATTCATTT AATACTTTTA TAACTGACAC AAGTACTGCT GCACTTGAAC CTAATCCATA 3180 

TTTATGACCA TTTGAATCAT CTAAATTACT ATCAATAGTC AGATGAAAAT GCTTCATCGC 324 0 

TATATCGCAA CTTTTCGCGT ATTGTTCAAA TATTTCAATA GCTGTGaCCA CATAATTTAA 3 3 00 

TT G TTTTGCT GcATGTGGAT CTGAAATGAC AATACTATCT TCATCTCTAC TAAATGTAAC 33 6 0 

25 TGGGTTATGA TGTAATGCTT TTGAATGAAT GGTACCTTTA TATTGGTCTG CTTCTTCAAT 3420 

AGTAGCAGTT ACAAAACGAT CTAACGCAAT AAGTACAGAT TTATATCCTG GTTCTGTTAC 34 80 

AGCATATTCT CCAGCAATAT AAAGTTTTCC GGGTGCTTTG ACCTGAATCA TTTTATCTCT 354 0 

30 TCCTTACTCA ATTATTTCAA TTCCTGTGGC AATAATGTCA CTATCAATAA TTTGGTTATT 3600 

ATCAAACTGT GTTAATAATT TATCTATAAT CTGTTGCTTG TTTTTCTTTT CTACAAGTAT 3660 

TTTCACATTA GGTCCCGCAT CCATTGTAAA ATAACACGGA TACCCCGCTT CTCGGCATTC 3720 

GTGAACAAGC GCCATGACAT CATAACTTTC TTGCACAAGA TATGTGAACG GCGGTGTTGA 3780 

TCCT^GATTC GTGGCATGCA TACGCAAACC ATTTTCTTCA ATTACTTCAC CAAGGCGTTT 3840 

AAAATCTTTG TCTTGAATCG CTGCTTTTGC TTCAGCTAAA TCTTCATCAA TATGATCTAA 3900 

CCAATATTGA TAAAACCTTG ATGTGTTTOG TGTCAATGAC ATACCATATC GACTAGGTAC 3960 

CTTTTTAGAA TGTTGATTAA TCACAACAAA TATCATGGCA AGGTCATCTT CAAAATGATT 4020 

CGATTCAAGT GGAACGGCAT ATGACGTCTC ATCACTATAC CCTTTTTCCC ATTCTGCAAA 4030 

TCCACCATAA ATACTACGCG ACGCAGAACC CXSAACCAATT CGCGCCAATC TCGATAAATC 414 0 

CTTATCTGAC AGCTGCATGT CTAGCGCTTG ATTACAAGCT GCTGCTAAAG CTGCATATGC 4200 

so GCTTGCCGAT GAAGCCAACC CTGCTGCTGT TGGTACAAAA TTGTCGCTTT CAATTTCTGC 4260 

ATACCAATCG ATGCCAGCTC TATTTCTGAC AATATCCATA TATTTTGAAA TTTTCTCTAA 4 320 
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AAAAC3TGACT TTCGTTTCAG TGTAAAATTT TTCTAATGTA ACAGATATCC TATTATTCAT 
TGGAATGATT AGTGCTTCAT CTTTTTTACC CCAATATTTT ATAAGTGCAA TATTCGTATG 
TGCACGTGCT TTGCCACTTT TAATCAACGC ATTAACCTCC TAAATTCTCA ATCCAAGTAT 
GTGCTGCACC AGCTTTTTCT ACAGCTTTTA CAATATTTTT CGCTGTTGGT AAATCTTTGG 



4440 
4500 
4S60 
4620 

CAAGCAATAA CATACTTCCA CCACGACCAG CGCCAGTAAG TTTTCCAGCA ATCGCACCAT 4 680 

TTTCTTTACC AATTTTCATT AATTGTTCTA TTTTATCATG ACTAACTGTC AACGCCTITA 4740 

AATCCGCATG ACATTCATTA AAAATATCCG CTAAGGCTTC AAAGTTATCA TGTTCAATCA 4800 

,^ CATCACTCGC ACGTAAAACT AACTTACCGA TATGTITTAC ATCTGACATG TACTGAGGGT 4 860 

CCTCACAAAG TTTATGAACA TCTTCTACTG CTTGTCTTGT TGAACCTTTC ACACCAGTAT 4920 

CTATAACAAC CATATAGCCG TCTAAACTTA ACGTTTTCAA CGTTTCAGCA TGACCTTTIT 4 980 

GGAACCAAAC TGGTTTGCCT GATACAATCG TTTGCXTTATC AATACCACTT GGTTTACCAT 504 0 

GTGCAATTTG CTCTGCCCAA TTAGCCTTTT CAATGAGTTC TTCTTTCGTT AATGATTTCC SIOO 

CTAAAAAATC ATAACTTGCA CGAACAAAAG CAACCGCGAC AGCTGCACTC GATCCTAATC 5160" 

CACGTGATGG TGGTAAATTC GTTTGGATCG TTACTGCTAG CGGCTCTGTA ATATTATTTA 5220 

ATTCTACAAA ACGGTTCACC AAAGACTTAA GATGGTCAGG CGCATCATAT AACATACCAT 5280 

CGTAAACATC GCTTTTAATA GACGAATAGT TCCCGCTCTC TAAGGCTTCT ATTAAAACTr 534 0 

TGATTTTACC TGCGTTAAAC GGTACTGCAA TAGCAGGCTC TCCAAATGTA ACAGCATGTT 5400 

CTCCTATTAA AATAATCTTA CCTGTCGATT CCCCATATCC TTTTCTrGTC ATGTCAATAT 5460 

CACCTTTTAT ATTTATCCTA TACTTGATTC ATTATTTTTA TITATTAGTA AAAGACATCA 5520 

TATTCTAAGT TGCAtngCAT TCGCGTTAAA TTTCATTGCA GTCTTTATCT CACATTATTC 5580 

ATATTATGTA TAATCTTTAT TTTGAATTTA TATTTGACTT AACTTGATTA GTATAAAACT 5640 

^ AACTTTCG-rr TACTTCAAAG TTTAAATCTT ATCGAGTCAT ATTTCAGATT CTTTATCrrT 5700 

TTATAAAATA GCCCTACAAT TTATAATTTT CCACCCTAAC TATAATACTA CAAATAATAA 5760 
TTGGAATATA TAGATTTACT ACTAAAGTAT TAGAACATTT CAATAGAAGG TCGTTTCTTT 
4S CATAGTCATA CGCATTATAT ATACCCTATT CTCAATCTAT TTAATACGTA AAACATGAAA 
TTTTCTrATT AAATTTATTA TTTCCATCAT ATCATTACTT TTAATTTAAT 6ATGTTCAAT 
TTAAATATTA GGTCAATAAC ATATTTATGC TTTTTATGGA TACTTTCAAA AATAACAGCC 
CCAAACGATA ACTTGAAAGG GGCTGTTAAA TATTTAACTA TTGCATTTGA TCTATCATTT 
TCTTGTTTCT TTCAATCATT TTATCAAAAT ACGTATCGTA TCTTTGCCAT TCTTCTTGAG 



5820 
5880 
5940 
6000 
6060 
6120 
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TTACATCXTG AACAGTAATC GTTTTGTTAA GCAATGTCTC TAATGAGGCC ATACAAGATG 624 0 

GTTCAATTTC AGGATATTTA AATTTAGTCA CTTCACCTTT TAAAGCATGT TCATAAAATG 63 00 

TTTGCATCAT CAATGCACGT TCTGAACCAG AGCCTTCAAC ACAAAGATAA ATTTGTACAG 636 0 

CAATACCGCC TCTAACTCTT CGTTGCGATA TGCCTGCAAA TTTCTTACCA TCGATACTTA 642 0 

AGTCAAATTT TCCTGGGCAA TAAGAATGTT CAATTTCCAT CGTATCAATA TCAACATTCT 64 80 

CATTTTCGAA CATTTTGCTA ATTAAGAGGT ACATCACAGT AAACGCTTCA TCAATCGTTG 6 54 0 

TTTCTGTTTG TCCTTTGAAC ATCAGCGATA TATTTAATAC ACCTTGATCT AGAACGACAC 66 00 

CTAAGCCACC AGAATTTCTA ACAATGGCAT TATAACCAAT CTCATTCGTT AAATAATCAA 6660 

TGCCATCTTT TAAAAACGGC AATCTTGAAT CATGAATACC AAGAATAACA GTATGTTGAT 6720 

GAATCCAAGT ACGCACAACA TTATCTGATA TATCTTTGCC CACACTTTCG CAAAATGTAT 6780 

CATCGAATGC GAAAGATTGC ATAGGTTCTA ATCCAGAAGA ATGATCGATA TATCGCCAGT 684 0 

TGACGCCATT AAAATATTTA CTCGCTAAAT CCATCGTTAT TGTAAGGCTT GCGCTGCTGT 6900 

AATAATTGAA AGATTGTATA CATCTTCAAT TGAGCAGCCA CGTGATAAGT CATTTACTGG 6960 

AGAATTTAAA CCTTGTAATA CTGGACCAAC TGCATCATAT CCACCTAAAC GTTGTGCAAT 7020 

TTTGTAACCA ATATTACCAG CTTCTAAACT TGGGAATACA AAGACATTTG CATCACCTTG 7080 

TAATTTAGCA CCTGGCGCTT TTTTCTCAGC AACACCTGGT ACAATCGCAG CATCAAATTG 7140 

GAATTCGCCA TCAATGATTG CTTCTAATTT TtCTTCTTCA GCTTTTtGTT GTGCTAATTT 7200 

GACAGCTTCT TGAACTTTTG TCACGTCGTC TGATTTAGCA GACCCTTTTG TTGAAAAGCT 7260' 

TAACATTGCA ACTTTTGGAT CCATGCCAAA GCTTAATGCT GATTTTGCAC TTTCTACTGC 7320 

AATTTCTGCA AGTCCTTGTG AATCAAGTTC TGGATTGATT GCACAATCAC CAAAGATGTA 7380 

TTCrrCATCA CCTTTAATCA TAAAGAAGAT ACCTGATGTT CTTGATACAC CTGGTTTCGT 7440 

TTTGATGATT TGTAAAGCTG GACGCACAGT GTCGCCTGTT GAATGTGCTG CACCACTAAC 7500 

TAAACCATCT GCTTTACCAG CATAAACAAG CATTGTACCG AAGTAGTTCA CATTGTTTAA 7560 

TAATTCTTGT GCTTGTTCTT CAGTCGCTTT ACCTTTACGT CGTTCAACAA ATGATTGAAC 7620 

TAATTCAGCT TTCAATTCAC TTGTCGCAGG ATTAATTAAT TCAATATTAG AAATATCAAG 7680 

ATCAAGTTTT TGCGCTAAAG ATTGAACCTT AGTCTCATCA CCTAACACGA TTGGTGTAAC 7740 

ATAATCTGTT GCTTGTAATT GTGTTGCAGC TGTTAGAACA CGTTCGTCCT CTCCTTCAGG 7800 

TAATACGATT TTAACGTTTT TACCAGAAAG TTTGTCTTTT AATACATTTA ATAAATCAGC 7860 

CATAATGTCC TCCTGTAATA TAAATCTTAT TAATCATTCA CGGTATAATT ATACXKTCATT 7920 
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TATGATAAAA TTTATAAAGA ACTGATGATT TTTGAAAAGG AGCGATAAAC ATGAGTCAAG ' 804i 

CAGCCGAAAC ATTAGATGGT TGGTATAGTC TACATTTATT TTATGCAGTT GATTGGGCAT eiOl 

CATTACGTAT AGTTCCAAAG GACGAACGCG ATGCACTTCT CACTGAATTT CAATCATTTT 816( 

TAGAAAATAC AGCAACTGTA AGATCATCAA AATCTGGTCA TCAAGCTATT TATAATATAA 8221 

CTGGTCAAAA AGCAGATTTG TTATTATGGT TCTTACGTCC TGAAATGAAG TCTTTAAATC 82 8 C 

ATATTGAAAA TGAATTTAAC AAATTGCGCA TTCCTGACTT CCTAATCCCT ACATATTCAT 834 0 

ATGTATCAGT CATTGAATTG AGCAATTATT TAGCTGGTAA ATCTCATGAA GATCCTTATG 840C 

AGAACCCTCA TATCAAAGCA AGATTATACC CAGAATTACC ACATTCTGAT TATATTTGTT 84 6 C 

TCTATCCAAT GAACAAACGT CGTAATGAAA CTTATAACTG GTACATGTTA ACTATGGAAG 852C 

AACGCCAAAA ATTAATGTAT GACCATGGTA TGATTGGTAG AAAATATGCT GGCAAAATCA 8580 

20 AACAATTTAT TACTGGTTCT GTAGGGTTTG ATGATTTCGA ATGGGGCGTA ACATTGTTCT 8640 

CAGATGACGT ATTACAATTC AAAAAAATTG TATACGAAAT GCGCTTTGAT GAAACAACAG 8700 

CACGATACGG TGAATTCGGT AGTTTCTTTG TAGGACATAT TATTAACACA AACGAATTCG 8760 

25 ATCMTTCTT TGCGATTTCT TAATACATTG GTACGTTTAT AAATTAATAA AAAAATTCCA 8820 

AGCTTATCGG TTTAAGCTTG GAATTTTTCG TTTATCTTCA GTATATTCCC GTATACATAA 88 80 

GACGTGATTT GGTAAATAGT TGAAATCTGT ATGTTTAAAC TTATATATAT GTGCTAATGT 894 0 

ATTATCAATA ACAAAGtACA CTTTGCTCAT AGCAAgTsaC CCGAgTAGTC TTCCTTGGGA 9000 

GAACTTTAAC TACTATCACT ACATATAAAC GTTAACCTCA ATAGAAATTA TACAGTCGCT 9060 

ACTCTATACA ATTTTTGTAA TGGTTAACTA ATATTATTTT AACCTATTTG AAATATTTGA " 9120 

AACATATTTT TGTCGAATTT TTTTCAATAA TTTTTCCTTT TTATACTTCA AGAGAATTTT 9180 

AACTACTAAA AATTCC6ATG ATTATTATTA CAATAGTATC AAATATTAGT TTTTTAAAAT 924 0 

^ CAATAACAAC TTATCAAAAA GCTCATGTGG TTATTTTATA GTGTATAAAC TATAATGAGT 93 00 

ATTAAATTCT TATAAACAAT GGTGATGAAA TGGACATAAA TTCAGAAGAA TACAAACAAG 9360 

AGGTACTTAT CAAAGACGTT GTCATGCTTG CTGCTCGCAT ACTATTAGAA TCTCGTCCAG 94 20 

AAGGTACGCG TGTAGAAGAT ACCATGACAC GTATTGCAAA AAAACTTGGT TACAGTGAAA 94 80 

GTAACAGCTT TGTTACAAAC ACTGTCATCC AGTTTACGTT ACATTCGGAA TCGTTTCCTA 954 0 

GAATATTTAG AATTACCTCT CGAGATACAA ACTTAATAAA AATTTCTCAA GCTAATAAAA 9600 

SO TTTCGCGTCA AATTACAAAC AATGAAATTT CTTTAGCCGA AGCAAAAACG CAACTTGAAA 9660 

AAATATATGT TGCTAAGCGT GACAGCAGTC TTCCCTTTAA AGGTTTTGCT GCAGCAATGA 9720 
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TAGCAGGTAG TCTAGGATAC CTAGTCACTG AGATTTTAGA TCGTAAGTwA CACGCACAGT 
TTATCCCAGA ATTCaTTGGT TCaTTAGTTA tTGGGATTAT CGCCGTTATT GGACATACAC 
TTATTCCAAC AGGTGACTTG GCAACTATTA TCATTGCGGC AGTCATGCCT ATTGTTCCTG 
GTGTATTAAT AACAAACGCA ATACAAGATT TATTTGGTGG ACACATGTTG ATGTTCACAA 
CGAAATCATT AGAAGCATTG GTTTnGCGTT TGGCATCGGT GCTGGCGTTG GTAGCGTATT 
AATTTTAGTA TAGGAGTATC AGACTATGTT TTGGATCTTA AACTTTATCT TTAGCTTTTT 
AGCTTC 

(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 
ATTTAATTGG TTGGTGGCGT ATTCTCnTTT CCAAGATTTG CCACGATTGG ATGTTCAGTC 
GACAAATATG TAACCAGTCC CGATAAATTA CGTTCCTCAA TCATTGATAG TATGTCTGGT 
GTTTGTAACG ATTTATCGAG TAAGGCATCA AGCAATTGAT AATGTCCCAA TACAACAAAA 
TGCACGTTGT CTCTCAACTG CTGTTGAATA AACTGAATAA AGAGCTTTAA GCTCTGTTGC 
ACATTCGCTA ATGATGGTGC ATAGTTTTCC AAACCAACTT GTACAGCCGC TTCATTATTC 
CGAATGATTA AACCTGTGTA TGCCACTTTT GTTGCTGCAG TTGGATACAT TGAGTAATAA 
CGCAATAATT GATCTGTAAA ATCATTTCGA AGTGCATAAA TTTGATGCTC ATGTTGCCAA 
AAATttCGCT CACCCATCTG CTgCAAATCC TCATGGTTCA ATTGTTTCCA GTCCAACTTT 
TCAACCACAC TAAAATCAAC TAACTCATAA TCCGCTTTAT TAAAATATTT TAAAAATGCT 
GTTTCCGATT CTTTTAACGC AATTAATTGT TCTGnATTAT TCACTCGACC ACCCTTTACT 
TTCAATACTG TATTTAAAAT CACTTGGTAT TTTCGTTGTT TGCTTTACTT CTCTACCACG 
CTAAAGTGTA ATATGATTAA TAACTTATCA TTTTTAGCAA TACATTACAA CCTTTTTCAG 
AAAATTCGGT GTATTGATTT TAAAATTTTT TAAAATAnAA AAGGCAAGAC ATTTGTGCCT 
ATAAAAATGC TTAACCAAGA TTTTTATATT GaAGTTGTAC TTCTTGCACA TATTGTCCTT 
GCCTTATTAT GTAAAGTTAT TTTCTTTCTA TCTTTTTATT AAATTTAACT ATTCTTCATA 
ATCCCGATTC CCTTTAAAGT AACGTCTATC TTGTTTACTA TATACATTTT CAGGATTAAA 
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TTGGTAAACG TrCGTTGCTG ATATATCTGT AAAATTCTTT GGACCGACAC CTCCAATAAA 
CTTAAACTCT GCTTCATCTA CCAAATAATC ATACGCTTGT GTATCTCTAT CCTGTGCGCC 
ATGTGGAAAT ACAAACATAT CTGTTTTACC TACAATTGGT TCAACTTCAT CTTTCCATCT 
TTTAGTATCA CGTTTAATAC CTTCTAAAGA TGTTTT7TCA AAATTAATGT GACCATATGA 



1080 
1140 
1200 
1260 

ATGACTCGCA AATGACCATC CATCCCGTTT CATTGCGCGA ACAACTTCCT CAGCTGCCTT 



ro w,*i«^.«^v« «(-A«(_in_i.-r CAGCTTGCCTT 1320 

TTTATTCTrr GTATAATCTT TACTCGTTAA TTCATTCGTC CGATAACCTA ATACGCCCTC 
ATAACCGGTT AAAGCAACAA CACCTTTTTC ACCATTTAAA GAAAAATCTG GATGCTCTTr 
,5 TACAAATITA TTTAAAATTG GCACGATATC ATTGTCATCA GAATAAGTAG CATGGCCTTT 

TTTGTCTGTA GTTTCAGAAA CAACATGTTT ATTTTTATCG AGTACTAAAC GGTCAGCATA 
ACCATGGTGT CTCATGTAAC TATAGTAATT CATATCATCA ATTGAGATGA TTAGTGGCTT 
20 TTTACCTTTC GGCAATTTTA TTTTTTTGGC TTTTACATGA TGAGATGATA AGTCGTATAC 

ATCATGTXKSA TTAACGATGA TGTAATTATT TTTATATAAT TCGTTCAATG ATTTTTTAAA 
TTCACTTACA GTAATCATCC AATCATTGTT GCCCTTAGCT TGGTGTCTAT CTCCTGTAAA 
25 CGCAACTTTT GGGTCTGTAA TTAATGGGTG ATAAAACACA TGATAAACTT GGCCGTCATA 

TGTTTCCCAA TGTTCATCCA TTTTcGATTT aTGCTTTGCA TACTCATTra GATTAACAGA 
TTTATTkTGA GCTTTCTCAT TTTGCTTGGA ACAGCTATAT mACAATGCAA CTGATAATAA 

30 

QKQPJ^l^PAPCV AGCAATAAAT ATTTTTTATG CATTAAACAT TC 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 
^5 <A) LENGTH: 134 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 
ATAGAGTGAT AAAC3GATGGT TGTCATATGA TAAATGCAGT AGTAATAGCA GTAArTTTAA 

^5 TGATTGTGCT ATGTTTATGT CGATTAAACG TnTTATTAAG CrrATTTATC AGTGCGCTAG 

TTGGTGGCTT AATTTCAGGC ATGAGCATTG AAAAAGTTAT AAATGTATTT GGGAAAAATA 
TAGTCGATGG TGCTGAGGTA GCATTAAGCT ATGCTTTATT AGGTGGATTT GCAGCATTAA 

SO TTTCATACAG TGGTATCACA GACTATTTAG TAGGAAAAAT TATAAATX3CA ATTCACGCTG 

AAAATAGTCG ATGGTCAAGA GTTAAAGTCA AAGTGACAAT AATCAITCCA TTAITAGCTA 



1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2022 



60 
120 
180 
240 
300 
360 
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CACCATTGTT AAGTCTGTTT AATGACTTAA AAATAGATAG ACGTTTAATC GGTTTGATTA 4 80 

TCGGTTTTGG TTTATGTTTC CCGTATGTGT TATTACCATA TGGATTCGGT CAAATTTTCC 540 

AGCAAATTAT TCAAAGTGGC TTTGCAAAGG CAAATCACCC AATTGAGTTT AATATGATTT SOO 

GGAAAGCAAT GCTTATTCCT TCAATGGGGT ATATTGTTGG CTTACTTATC GGTTTATATG 660 

TATATCGTAA ACCACGTGAA TATGAAACAC GTAAAATTTC AGATAGTGAC AATGTTACAG 72 0 

AGTTAAAACC ATATATCTTA ATAGTAACAA TTGTAGCAAT ACTAGCTACA TTTTTAGTAC 780 

AAACATTTAC AGATTCAATG ATTTTTGGTG CACTGGCAGG GGTACTCGTA TTCTTTATTT 84 0 

CACGTGCATA TAATTGGTAT GAATTAGATG CTAAGTTTGT TGAAGGTATT AAAATTATGG 900 

CTTATATTGG TGTAGTTATT TTAACAGCAA ATGGATTTGC TGGTGTAATG AATGCTACTG 960 

GTGATATAGA TGAATTAGTT AAAACTTTAA CAAGTATTAC TGGTGATAAT AAATTATTTA 1020 

GCATTATCAT GATGTATGTG ATAGGTTTAA TTGTCACTTT AGGTATTGGA TCATCATTTG 1080 

20 

CAACAATTCC TATTATCGCA TCATTATTCA TTC CI T T TGG AGCGTCAATT GGACTAGATA 114 0 

CAATGGCATT AATCGCATTG ATTGGAACAG CGAGTGCATT AGGTGACTCA GGTTCX5CCTG 1200 

2s CAAGTGATTC AACATTAGGA CCAACTGCGG GATTAAATGT TGATGGCCAm CATGATCATA 1260 

TACGTGATAC ATGTGTACCA AACTTCTTGT TTTATAATAT TCCTTTAAAT GATTTTCGGT 1320 

ACTATTGCTG CTATGGTACT 134 0 

30 (2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^ (D) TOPOLOGY: linear 
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ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

CAAAATCTAA GAGAATAAnA TTTGTTAATT TnAAATAGCA AGGAATTCAA AGTTATATGT 60 

GTAATAGATA AAATAGATAT CCCTATAGTG ATGCGTTACT AGCTAAACAT AATAACACAT 120 

TAGAAGATAA TGAAGTTAAG GAGTTACTGG ATTGTTTCGA CTATGTAATT AAGTATAAAA 180 

ATATCCAACG ACAAAACGTA ATTATAAAAT GGTAAAAGCT ATGGTACAGT TTCAAATTGC 24 0 

TAATGACATG CGTATCGGTG AGCTACTTGC AATAAAGAGA GTAAATATAA ACTATGAAGA 300 

TAAAACGCTA GATATCGACG GTAAAGTTAA TTGGATAACT GAAAAAAGAC GGGAGCATTC 3 60 

GGAGTAAAGG AGACAACTGA AAGAAGTAAT AGCTATAAGG CCACAGGGCT CACTACCCAA 420 
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25 



35 



TATTCACAAA TACGTCTGGT AGCCCTATCG ACTCGAACAA AATTAGCCAC ATTATTAAAG 
GGGGGCGCTG ATATTAGTTC TATTAAGAAA CCTATAACGA CGCATACATT ACATCATTCG 
CATATATCTA CACTTGCTCA ATTAGGAATT AACTTAAAAG CAATGCAAGA GCATGTAGGT 
CATTCAGATT ATAAAArAAA TCTAGAGATA TACACACATG TTACTAATCA GATGGCGAAA 



540 
€00 
660 
720 

GATATGATGA ATAAATTTGA ACGATTGGGG AGTTAAAATT GGAAAAAGAT GaTACACTAG 780 
CAGAAATTAA GCCTATGCTC AATTTTGATG AGCAAATAGC AAAATTAAAA CAGATGAATA 
TATTTTTTAA TATTATTGAC ACCGAAAAAG CAAATGAAAT TCTTAGAAAA AATAATTACT 



TCTTCAAACT wGcTTATTTC CGaAAAAATT TCGrAAAAAA GaATGGCGGC TATTTCATAG 960 



840 
900 



AATTTGCTTA TTTATCAGAT TTAGCZAACTA TAGATATGaA ATTAAGATAC ACAATGTTGC 
ATTTAACTTT AGATATTGAA CATAGTTTAA AGTATCTAGT CTTAAAACTA ATAACAGAAA 
20 ATAACCAAGA AGATGGTTAT AAAATAATAG ATGAGTTCTT ATGTATTGaT AAATCATATA 
GCAATTCAAA TTTTGACACA AATTCAAGAA CACCAGAAGA AGTTATGGAA ACCAAAATCA 
AAAATAAAAA CGAAATATTC AAGCATATGA ATAAACGAGG ACAACTACCC GAGAAGTTGa 
ATAAATACTA TCmAAATCCA CCCGCnnGGk TTTGCaTTGr ATTCATGCAA CTAGGTCAAT 
TCGTTTCGTT TCTCAACTTC TATTACAAGA AGTACAATGA CGAAGAAITG AGAGTTGCTA 
ATATTTTAAT GCCTTTAGTT AAAAATATAA GAAaCAAATC AGCTCATAAC CAACCCATCA 
TAGCAAATCT AAATTATGAC AGTAGATTAC CTCAATATTT ATTTGAAAAA GGGAATAATA 
TAGGCATATC TAGAAACATG TTCGGAATAA AAAATTTCAT AGATACTkTC ksTACGCTAG 
AATTACATAA TCAAGTTTGT AGTAATGCAA TTATCCAAGC AAGATATCAC GATTTGGACC 
AACTTCAAAA GCGATATAAA AGrAACGrAA GCTATTATAA TAATGCATTA GCTATCAAAA 
GATTTTTTAT AGCTTTAGAT AAAATTATTG ACTTCAACAG ACCAAAAGTA TAAACTATCT 
AGTGAGGAAA GAGACTTATA GGTCTCGCGA GTTATTTTAA TTCGTATGCA AGAAAAAGAA 1800 
GAGCTATGCA TTTTATTTAA AATGCGTAGT TCTTtTTTTA TGCATCTAAA TTCATATTAT 1860 
TTTTGCAATA TAAACATATC TTTGTGCAAA TTCCGAACAC AAAACATTCA CATCATCCIT 1920 
^5 TTTtGCCCTT TTTCTATACC CCAAAACACA AAAAGCCCCG TAAGCCTATG CcTACGGGgT 1980 
TTGACAATAA ATTATATATT ATTGTTCTTC TTTAACATAT GGTAATAATG CCATATGACG 2040 
AGAACGTTTG ATAGCTGTAG TCAaCATACG TTGATATTTA GCTGAAGTAC CAGTTACACG 2100 
SO ACGTGGTAAA ATTTTACCGC GTTCTGAGAT AAAACGTTTT AATAATTCAG TGTCTTTGTA 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



2160 



GTCGATATGT GTAATACCAT TTGCTGTGAA ATAGCATACT TTTTTACGAC GACGTCCGCC 2220 
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CGTTAATTTT TATTAGAATG GTAAGTCATC ATCACTTATA TCAATCGGTC CGTTTGCATT 234 0 

TGCAAATGGA TTATCAGATT GTTTCGTGTT TGATGAATTA TTGTACGAAT TGTTTTGTCC 2400 

TGATTGTTGA CCACCGAATC CTTGACCGTA ATCTTGGAAT TCATTTTGTT GACGTTGGCC 2460 

ACCATTTTGT TGCGCATTTT TAGGTTCAAG GAATTGAACG CTATCACACA CAACTTCAGT 2520 

AACAAACACA CGACGACCTT CTTGATTTTC ATAATTACGG GATTGTAAGC GACCATCTAC 2 58 0 

ACCAGCTAAA CTACCTTTAG ATAAATAGTT ATTTACATTA TCTGCTTGTC TTCTAAAAAC 264 0 

AACACAGTTA ATAAAATctG CtTCGCGCTC CCCTTGAGCA TTCGTGAACG TACGATTTAC 2700 

TGCAAGAGTG AATGtCGCTA CACTCACACC TGAGGGAGTG GTTCTGTATT CCGGATCTTT 2760 

CGTTAAACGA CCTACTAATA CAACTCTATT TAGCATTTAA ACGCCCCCTC TAATTATTAC 282 0 

TTGTCTTCGT CTTCACGAAT AACCATGTAA CGAATGATAT CGTCACTGAT TTTAGCTAGA 2880 

CGTTGGAATT CGTCAGTAGC TTTGTTGTTA TCAGATTTAA CACGTACGAT GTTGTAGAAG 2 94 0 

CCATCTTTGA AATCATTGAT TTCATAAGCT AGGCGACGTT TACCCCAGTC TTTTGCTTCT 3 000 

AAAACTTCTG CACCTTCAGT AGCTAAGATA CCGTTGAAAC GTTCAACTAA CGCTTTTTTA 306 0 

25 GCATCTTCCT CAATGTTTGG GCGTACGATG TACATAACTT CATATGTTCr CATTTTATAT 3120 

TTGCACCTCC TTGTGGTCTA TACGGCTTAT CAATCTTAAA ACAGATAAGC AAGGaATAAT 3180 

TTTCATTACT CACAATAAAG AATTaTATCA TGCGCCATTA CTTTTTACAA TaATAATTcA 324 0 

AACTACTCTT CATATCATTT TTGATATtAA TTCATTTGaA ACTTTCnATG ATATTTTnAA 3 300 

AAATACACTT CACAAAAGCG AACATATGTn CTATAAnAGT TGTGAGGTGG TAAGGAATGA 3 360 

ATTTA 3365 
<2) INFORMATION FOR SEQ ID NO: 24 7: 



20 



35 



4i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRAKDEDNESS : double 

(D) TOPOLOGY: linear 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

GTTAAAAGTA ATTGGTGGTA TTGATGATGA TTTTACAGCC AATGTTATGC ATCCAAATCA 60 

ATATCGAATT CGATATTCGT CTCAGAAACA GGACCTTAAT GAAGATATGA CAGTTTTTGA 120 

so TGCAGTATTA AGTTCTGATA CAACAACTTT ACGCATCATC AAGCAATATG AGCAGGCAGT 180 

ACAAGCTTAT GCGGATGACC AAAGTGATAA ATTGTTCAAG CGAATGATGG ATGCGCAAGA 24 0 
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30 



ACTAGGTATA CATGATACTA CTAAATACAT TAAAGAATTA TCCGGCGGAC AACAAAAACG 
TGTTGTACTT GCTAAAACAT TAATAGAACA ACCAGATTTA TTCOTATTAG ATCAACCTAC 
GAACCATTTA GACTTCGAAT CAATCAGCTG GTrGATCAAT TATGTCAAGC AATATCCTCA 
TACTGTTTTA TTCGTAACCC ATGATCGATA TTTTTTAAAT GAAGTTTCCA CTAGAATTAT 
TGAACTAAAC AGAGGTAAGT TAGCGTCATA TCCTGGTAAC TATGAATCTT ATATTCAAAT 
GCGCGCTGAA AGAGAAGTAA CACTTCAAAA GCAACAACAA AAGCAACGAG CmTATATAA 
GGAAGAACTT GCTTGGATGA GGGCTGGgAG CTAaGGCTCG TACTACAAAG cAACAAGCTA 



75 w*„xa™.«rtxv, MMoi-i-aACiJA CiCAATATAAA GACGATAAAG 780 

GTGAATTGAA TCTTGCTTAT TCaAGATTAG GTAAGCAAGT GTTCGAATTA GAAGACTTAT 
CAAAGGCTAT TAATGATAAA GTATTATTTG AACATCTGAC GGAAATTATT CAAAAmGGTG 

20 AGCGTATTGG TGTTGTTGGG CCAAATGGAG CTGGTAAAAC AACACTCTTA AATATTTTGA 
GTGGAGAAGA CCAACAATTC GAAGGTAAAT TGAAGACTGG GCAGACGGTT AAAGTAGCTT 
ATTTTAAGCA AA 

25 (2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

35 

TGTGATTAAC GAAGCTTATT TTCGTACACC TTCAACAACT GATTACAACG GCGirTTATCA 
AGGT^TTAT ATTGATTTTG AAGCAAAGGA AACTAAAAAC AAGACGTCCT TTCCTTTAAA 
TAATATTCAT GACCATCAAG TCGAACATAT GAAAAATGCA TATCAACAAA AAGGTATTGT 
GTTTTTAATG ATTCGTTTTA AAACGCTAGA TGAAGTTTAT CTTTTACCCT ATTCAAAATT 
CGAAGTATTT TGGAAGAGAT ATAAAGATAA TATTAAAAAG TCTATAACAG TTGATGAAAT 
ACGAAAAAAT GGTTACCATA TTCCTTATCA GTATCAACCA AGATTAGACT ATCTAAAAGC 
AGTTGATAAG TTGATATTAG ATGAAAGTGA GGACCGCGTA TGACGGAAAA CAAAGGATCT 
TCTCAGCCTA AGAAAAACGG TAATAATGGT GGGAAATCCA ACTCAAAAAA GAATAGAAAT 
50 GTGAAGAGAA CGATTATTAA GATTATTGGC TTCATGATTA TTGCATTrTT CGITCITCIT 
TTACTAGGTA TCTTATTGTT TGCTTATTAT GCTTGGAAAG CACCTGCTrT TACCGAAGCT 



840 
900 
960 
1020 
1032 
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TTAGATAATG GCCAAAGACA TGAGCATGTA AATTTAAAAG ACGTGCCGAA ATCAATGAAA 
C3ACGCAGTAC TTGCAACTGA AGACAATCGT TTCTACGAAC ATGGCGCACT TGATTATAAA 
CGTTTATTCG GTGCAATTGG TAAGAACTTG ACTGGTGGAT TTGGkTCcGA AGGtGCCTCA 
ACATTAACAC AA 

(2) INFORMATION FOR SEQ ID NO: 24 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 24 9: 
CACTTTTTTC ATTAAAAATC TCATATTTAT ACACTGAACG TAATCTCGAA TATTTTTCAA 
CCCAAGTTTT AACTTTAACT TTTTCTGGAT AAAAAATAGA CTTTTTATAA TTGACATTGA 
GGTCAGTCAC AGGTGAAATG ATTCCTTGTT TTTCCATATC AGCATAACTA AAACCTAACT 
TCGATATATA ATCCAACCGC GCAACTTCAA ACCAAGTTGC ATAATTCCCG TGATAAATTA 
CACCCATCTT ATCAGTTTCA GCATAACGCG CTTCTATTTC TGTAATACTA TATATCATTT 
TAAGCCTTCT TTCA G TT T AA CTTTATATCT CATTCTAACA TAAAATACAA GAAGAGGCCG 
GCCAAGAACA CAAAGGkTTT GAACCGACCT ATTATATCAT AAaGTTTATA GAAGTATTTT 
TGAGCACTAT CAAAGTGCCT CAAATACCGA TTAAAATTTT ACTGTGATAT CTATTTTTTA 
TTGCGCTAAT TTATTTCTTA AAACCATTTG TAAAATTCCA CCGTGACGAT AGTAATCCAT 
TTCAACAAGT GAGTCAAAAC GAACCATAGC GTCAAATTCT ACCAAATCAC CATCTTGCTT 
CTTAffCAGTA ACTTTGACGT AGtCATGTGG TTGAACATTT TCATCAATAT TAACAGrAAT 
TTCTTCTGTA CCATCTAGAC CAAGAGAATC AGCTGATTCA CCTTTTTTAA ACTCTAATGG 
TAATACACCC ATCATAACTA AATTTGAACG ATGGATACGT TCATAACTTT GTGCAATAAC 
TGTTTTAACA CCTAATAAGT TTGTACCTTT TGCTGCCCAG TCACGAGATG AACCCATACC 
ATAATCGTTA CCAGCTAATA CAACTAAACC TGTACCATCT TCTTTATATT TCATTGCAGC 
ATCAAAGATA GGCATTACTT CATTTGTTGG CCAATAAGTT GTAAAACCAC CTTCAGTACC 
TGGCGCTAAT TGG T TTTTAA TACGTATATT AGCAAACGTA CCTCGAACCA TTACTTCGTG 
ATTACCACGT CTTGAACCAT ATGAATTAAA TTCACGAATA GGCACTTGAT GATCTTGTAA 
ATATTTACCA GCTGGCGTAT CTTTACCAAT TGCACCTGCT GGAGAGATGT GGTCAGTTGT 
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10 



15 



20 



25 



30 



35 



40 



45 



SO 



TTCTTTAGAT 
GTCATATAGA 
GTTATTATAT 
TTCTTTAATT 
AATAGGTTCA 
CTGTGGTGAA 
GTTACCAGAT 
TGGTAATAAA 
GCCTAAATCA 
TGATCCTGGT 
TTTTTTAGCA 
TGTAATTGCT 
GTTAATTTCA 
TGGAGCTGTT 
AGGACGTTTT 
TGTATAATTA 
TTTTACTAGC 
ATCATCAACT 
AGCACGGTCT 
AACAACACCT 
GCCTTGTGGT 
AGGTTGTCCA 
AATACCATTT 
TTTTTCAOCA 
GTGAACTATT 
ATTTAAAAAC 
AGCTTCTGGA 
TACTTCTGGA 
AGCTAAATCA 



AATCCTTGGA 
GGTTGATCAG 
TCTTCAATAA 
GATGGCCAAA 
TTTTGTAAAT 
GCTAGGTAAT 
AATACAGATG 
GGACCTGAAT 
TCTAAATAAG 
GCTAGAGAAG 
ACTAAACCTG 
GCTATTGCAA 
GCTTTCTTAT 
ACAGAATTTT 
GGTCCTGAAA 
GGATCTTCTT 
GCGATATGTT 
GGGAAGAATC 
GCTAGTGGTA 
TTTTTACGTA 
AATGAATTTA 
AGCATTCCAG 
ATCATTGTTG 
TCTACATCAC 
CCAGTTGCAG 
TGATAACX5TT 
TTTGCATAGC 
TTAATTTTAG 
ACTACGGCTG 



AGAATGATGG 
TTACATCAAT 
ATAATTCAGG 
TATCTTTCAA 
CAATATCAAC 
TGGCTTTAAC 
TCACTAATAG 
TACCGATACA 
GTTGTAAGCC 
TTTTAACGTA 
CACCTAACAT 
TATCACCTGT 
CAAATTCACT 
CAAATGATGA 
GCGATGCTTC 
TCTCAACATC 
CGTCTGATCT 
CGCAAGTTGC 
AATGTTGTAC 
GCTCTTGAGT 
CTAGTCGTAC 
CTTCAGCTTC 
TATGTGAATC 
GAACATGTAC 
GAGGAACTGC 
CATAGTTACG 
TATCCACTTG 
TAATATCTCC 
GTACACCTGT 



ATTTTGAATG 
CTCATTCCAT 
TGTTACAACA 
ATATACATCT 
CGTTCCAGCT 
AAGAGGATGG 
GTCCTCATCA 
AGTTGTACAT 
AGCATCTCTT 
TTCAGGAACT 
TACATAAGGG 
TTTCATTGTA 
TTTATCTAAA 
TTTCATATCA 
AACTGTTGAT 
AAAGAACATA 
ACCAGTTAAC 
TCCATACTCT 
ACCTGGACCA 
TACTCTTAAC 
ACCAATAACC 
AATACCACCA 
AGTACCAACT 
AACACTTGCT 
ATTGTAATTA 
TTCAAATTCT 
AACTGAGTGG 
CCCAACGTCA 
GAAATCTTGT 



TATGTTGAAT 
AATTCGTTGT 
CTATCAACGG 
TCACCGTCAT 
AATGCATAAG 
ATACGACCTT 
GCAATCGCTT 
CCATAACCAA 
AAATATCCGG 
TTCAAGCCTT 
TTAGATGTAT 
GCTTTTGATC 
CCGTGTCCTT 
CTTAAGAAAA 
AAATCCAATT 
TGGTTTTGTT 
TTCATATATT 
GGTGCCATGT 
AAGAACTCCA 
GCTAAATCAG 
TCTGGAATTG 
ACACCCCATC 
AATGTATCTG 
AAATATTCTA 
TCAAATGCTT 
AATTTCATAT 
TCAATAACTA 
TCCATTGCTT 
AAAATAACAC 



TAGGATCAA^ 
TATTGTATAC 
TATCTGAAAC 
TACCTTTACC 
CAACAACTAA 
CAAAGTTACG 
TTTCAATTTC 
CCAAGTTGAA 
TAACAACTTT 
TTTCAACTGC 
TTGTACATGA 
CATCTTTAAA 
GGTTGCCTGC 
TTAAATCTTG 
CGATAACATC 
TCAAATATTC 
TAAGAGATTC 
TTGCAATTGT 
CAAATTTACC 
TTGCTGTTGC 
GGAAATAAGA 
CTAGTACGCC 
GAAATGCAGT 
AGTTAACTTG 
TCGTTGCCCA 
TACGTTCAAG 
AATCCACCGG 
TACGTAAAGA 
GAGAAGGTTT 



1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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GTCTGTAATT ACAAAATCAT CTTCTTGACG AAGTAAAGAT TCTAACAAAA CACGAATTGA 3060 

ATAAGGTAAA TTGGAAACTT TAGTAATACC TTGCTCTTCT ACAGCTTTTA AATCATAGTA 3120 

AGTATAACTT TGGCCATTCA AGTCAAAATG TTTTTTTGAT TGCTCTTTAA AATTTGCAGC 3180 

CATTTAATGA TCCCCCTTGA TACATTTTTA TATTTATATG CCTTGATTAA ATTGTATTAT 3240 

TATATTTATT GATAAACAAC TCATCATGCT TAGAAAACGC TTAATTTAGG TTTTGACTTT 3 300 

TTAATCAGAG TATATAAGCA AAACTTATCA TACAGGTAAG GTGTAATAAG TATTTTTTAT 33 60 

TAATTGAGAA TAATTATCAA TTTCGCGAAT GATTCAATTC AATTTTTAAA CGTATTATTT 3420 

CATTGAGCAG AAAGAAAATT ATGGCACCAA ACTTTAATAT TTTTTTCAAT GTCATTCTTT 34 80 

TGATGGGAGT GGGACAGAAA TGATATTTTC GCAAAATTTA TTTCGTCGTC CCACCCCAAC 354 0 

TTGCATTGTC TGTAGAAATT GGGAATCCAA TTTCTCTTTG TTGGGGCCCA TCCCCAACTT 3600 

GCACATTATT GTAAGCTGAC TTTTCGTCAG CTTCTGTGTT GGGGCCCTCA CCCCAACTCG 3660 

CATTGCCTGT AGAATTTCTT TTCGAAATTC TCTGTGTTGG GGCCCCTGAC TAGAATTGAA 3720 

AAAAGCTTGT TACAAGCGCA TTTTCGTTCA GTCAACTACT GCCAATATAA CTTCGTAGAG 3780 

25 CATAGAATAT TGATTTATGT CCCAGCCTGA GTTAATTTTC TATAAAAGTA TATTTAATTT 3 84 0 

GCGTTTATAC CGTCAAACTT CACTTTAGCT TTGTCAAACC CCTTTCTATT AAGTTTTCAG 3 900 

AAATAAACCT ATCTTAAAAT ATAAAAAAAT CGAGAATTCX5 TAGTTTAATA ACGAAATTCT 3960 

30 CGTTCTTATC CTTTTGAATA TACTCAATTT TCCACAAAAA CAAACAAGTA GTATATCTGT 4 020 

TCTAGCTACT AGAATGACAT ACTACTTGTT ATTAAAATAC TTAACTAAAC TTTATTAGTT 4 080 

ATCTTTTTTC TCTATATTTC TACGTGACTG ACGCTTTTCA AGAATGTCAG ATTCATAATC 4140 

TTCTTGTTGA CTCTTGATAT ATTCTTGTAA GCGATGTTTA TTCGGAGTCA ATGTTAAACC 4200 * 

TAGGAATTTA CGTTCCTGGT TCGCATCCTT GTAGAAACTT ACCATCATGA GTATGACGAC 4 260 

AAAGGAGAAT GGGAATGCAC TTATAATTGC AGCACTTTGA ATCGCATTTA AAGCTTCAGC 4 320 

GCCGTTACCG CCACCAGCTA ATAAAAGTAC AAATGCTATT AAGGCCTGTG AAATTCCCCA 43 80 

AACAACTTTT ACCATACTAG ATGGATTTAA TGAACCAAAT GTTGTTTGCA TTCCTAATAC 44 4 0 

AAATGTTGCT GAGTCAGCAG ATGTAATAAA GAATGATGCA ATTAATAATA ATGCAATCAA 4 5 00 

CGATAAAACA ATGCCAAATG GCACATGATT AAACACTCCA AATAGCTGTG TTTCAGGAGT 4 56 0 

CATATCAAAA ATTTCTTTGT GTTTCTTACC TGTCTCGATG CCTAATACAC CAAAGACACT 46 20 

AAACCAAACA AAACTAACAA TTGCTGGAAC TAGCAAGACA CCAGAAATGA ACTCTCTAAT 4 6 80 

TGAACGTCCT TTTGAAACTC GTGCAATAAA CACTCCAACG AATGGACTCC AACTTAACCA 4 74 0 
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TGCTGTATCA AAACTATTAA ACAAGAATGT 


GTTTAGTAAA 


w * nv. ^ SvVJ X /War 


AGCTAGTTAA 


4860 




CATATTTAAA 


ATAAGAACAG TTGGTCCAAC 


AATTAAAGCA 




AAA. i. Avj i AUU. 


4920 


5 


TAAACCAATG 


TTCAAGTTAC TTAAGTATTG 


AATACCTTTA 




AU W\ 1\» W\L. 1 


49B0 




TGCTATAAAT 


AAGATAGTAA CAACAATGAT 


GATAATCGCT 




XAl 1 IVjO 


5040 


10 


AACATTGAAC 


AAGTAATGTA AACCACCATT 


AATTTGTAGA 


GCACCCATAC 


^ 1 AAL-VaAAAt. 


5100 


GGCTACCCCA 


ACGATTGTCG CAAATACAGA 


TAAAACGTCA 


ATAAAAATCC 


w\A 1 AoijAk.. (_ 


5160 




TTCTACTTTA 


TCACCTAAAA GAGGACGTAA 


AGTTCTAGAT 


AATAAACCTG 


f I — I TV t ** 1 * 1 T • 

u 1 1 wAww X 1 1 


5220 


IS 


ACGGAATTGC 


GAATATGCCA ACGTAACGCA ACAACACCAT AAACAGCCCA 


A« UATUuAAT 


5280 




CCCCAATGGA 


AAAATGTTGA ACGTAGAGCT 


TCAGTATAAG 


CTTCAGTAGT 


TTTGGGATCT 


5340 




GCTGTAGGTG 


GCGTAGCAAA GTGCGCCATC 


GGTTCAGCTG 


CACCATAAAA 




5400 


20 


ATCCCCATAC 


CAGCACTAAA CAACATAGCA 


AACCATGAAA 


TTGTATTAAA 


WT.CAGGTTTG 


5460 




TCATTTGGTT 


TACCTAGTTT AAGTTTTCCA 


ATAGGACTAA 


AAATAAGGAA 


TATaCAGAAG 


5520 




AACACGATAA 


TCGTAGTAAG AATAAGATAA 


TACCAACCTA 


ACTTTTCTGT 


AaTC CACATT 


5580 


25 


TTAATATTAT 


TGGTAACATA GTTGAATTGT 


TCAGGTAAAA 


ATGCACCAAG 


TAATACGACT 


5640 




ATAGCAACAA 


CAATTGCACT ATAGATGAAG 


ACTGGTGAAT 


ACTTCTTTCC 


ATTTGGATTC 


5700 




TCTGGTGAAG 


AAGAATTCAT AATTAATTAC 


TCCCTTCAAT 


TCTATATTTA 


ATTTTATGTA 


5760 


30 


GTAGAATAAA 


AATATTATCT AAACATTTTA 


TTCAATAACT 


CACG 




5804 




(2) INFORMATION FOR SEQ ID NO: 250: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 



4S 



ACCCGCGAAT 


ATGGTCCATC 


CTATCGATTT 


ATTTTTAACT 


GGTTTGACAA 


TATTTAATTT 


60 


TTCATAATCA 


TTCTTAGTGA 


TTTTGACATA 


TGTTTTCGGT 


ATGAGCCAGT 


TAATAAATGG 


120 


AAAGAAGAAG 


ACAATCCAAT 


TACTTGCCAA 


ATCaATCATT 


AAATATTCAC 


TATCGTATTT 


180 


GATTATTCGA 


TATTTAGGGT 


TTTTATTAAT 


AACTTTAGAT 


TCGCAAAGCA 


ATGTCTCCAC 


240 


ATCCCTTTAA 


TTTTATGTGT AATACATTTT 


TCGATACTTC 


AAAAGACATT 


CAAATACTAT 


300 


CAAGTTACTG 


TCATCAAAGG 


TTTTATTAAC 


TGATATTtTC 


ATATTTTTAa 


TCTGAATTTA 


360 
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(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

CCAGGGTGCG GAAAGCTTTA AAATTTGGAC TAATAAAGAT GCTGATATTA ATTCTATGAA 60 

AACAGCAGTT TTACAACAAT TAAAAGGAGA ATAACATATG CTTACTGGcA AACAAAAAAG 120 

ATACTTAAGA AGTTTAGCAC ACAATATTGA TCCGATTTTT CAAATTGGAA AAGGCGGTAT 180 

CAACGAAAAT ATGATTAAAC AAATAGATGA TACGTTAGAA AACAGAGAAT TGATTAAAGT 24 0 

ACATGTACTA CAAAATAACT TTGATGATAA AAAAGAATTA GCTGAAACAT TAAGCGAAGC 300 

TACTCATAGT GAATTAGTGC AAGTGATTGG ATCTATGATA GTGATTTATA GAGAATCTAA 360 

AGATAATAAA GAAATTGAAT TGCCATAATA ATGAAAAAGA TAryACTTTA CGGCGGTCAG 420 

25 TTTAACCCTA TCCATACTGC ACATATGATA GTAGCTAGOG AAGTATTTCA T6AATTACAG 4 80 

CCAGATGAAT TTTATTTTTT ACCTAGTTTT ATGTCTCCAT TGAAAAAGCA CCATGATTTT 54 0 

ATAGACGTTC AGCACAGATT AACAATGATA CAGATGATTA TCGACGAGCT TGGTTTTGGA 600 

GATATTTGTG ACGATGAAAT TAAACGTGGT GGTGAAAGTT ATACCTATGA CACGATCAAG 660 

GCATTCAAGG AGCAACACAA AGACAGTGAG TTGTACTTTG TTATTGGGAC GGATCAGTAT 720 

AACCAACTAG AGAAATGGTA TCAAATTGAA TACTTAAAAG AAATGGTTAC TTTTGTAGTT 780 

GTAAATCGAG ACAAAAATAG TCAAAATGTT GAAAATGCTA TGATTGCAAT TCAGATACCT 840 

AGGOTAGATA TAAGTTCGAC AATGATTCGA CAAAGAGTTA GTGAAGGGAA ATCTATCCAA 900 

GTTCTTGTTC CTAAATCCGT TGAAAACTAT ATTAAGGGGG AAGGATTATA TGAACATTGA 960 

AAAA 964 
(2) INFORMATION FOR SEQ ID NO: 252: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 
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TTGTCATGAA ATAAATGGGA TGAATATCAC GACTAGAAGT AATGTTACGA ACAGGAGCGT ' 120 

ATAAACTAGA GACGCTAAAT TCGACATAGT ATGTnGCTCA ATTATCGCTG ATGATGAATT I80 

TAAAGTATGT GCGTTGGAAC TGTCGGATTT TTGTTCATAA TGTTTTGCAT ATTGCGCCAT 24 0 

GATGAATAGT GTAAAAATAA ATAAAACAAT AAGAGATATA ATGCCCATAA TCAAAAGTAT 300 

TTGTTTAGAG CCTTTCATTA TTTCACATCC TTTCTAAAAT ATATTTGTAA CTAAATTTAA 360 

AATAGTTATT TTTGTAATTC TAAACCTTTT TCATCX5CGAA AACAATTAAA. TAGGTCGCGG 42 0 

TATTAATTAT TATATTATTA CCGCTTAATA TGAAAAATAC ATGAaAATTA ATTTTCTAAT 480 

ATACTTTTQA AAAATTATTA CAAATTAGCC CCTTCAAAAC GCGAAAACAT AAGGATTCTA 540 

GTTTCAAAAG GGCTGATAAG CATAAAATGA AATGTAATAT TTCX3ATGTAT AAAATTTTTA 600 

ATTAGCTAAA AATCATCGCA TTAATTTTTT GAGCTACATC ATCAAAATTC GGACATTTTA 660 

20 ACGACACATA TAATTTAATT TTAGGTTCAG TACCAGAAGG ACGTAAAGCG ATAAATCCTT 720 

CGTCAAATAA GACACGAATA ACATTTGATT TAGGAGAATT AATCTGCGAC GTTGTATCTT 780 

TATCCAAATG ATAAACCTCX3 CTAGTTAAAT AATCTTCAAT TGCTTTCACT TTGAGTCCTT 840 

25 GAATCTCTTG CGGTGGATTT GAACGGAATT TGGTCATTAT TGCATTAATT TTCTTTTTCC 900 

CTTTCAAATCC TTCTAGCGTA TGCGgAATAA TGTATCCTCA TGTCTACCAA CAGTTTGATA 960 

AATCTGTTCr AATTCATCTT TCAATGTTTT GCCATATAAT TTTAACTCAG AAGCGTATTT 1020 

TATAATGAGT GGCACAATTT GTACX3GCATC TTTATCACGT ACAAAAGGCT CTGATAGAAA 1080 

ACCGTAACTC TCTTCAAATG CGAAAATCAT ATTTGATGAT CATCCAGTTG TCTTATTTCC 1140 

TGAGCAATAA ATTTAAGCCC GTCAGCACCT CTTTGGTATT CAACATTATT ATA 1193 
(2) INFORMATION FOR SEQ ID NO: 253: 



30 



3S 



CI) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

TATCAGGATG ATTATGnCGG GGTTTTATTA AGTCTGAGCG TATATnCCTT TTATTCTCCT 60 

AAATGGTTTG GAATTCAAGG AGAAGACGAT GAAATGGTAT CCAAAAACCA AnGAATATAT 120 

SO nGCATTAAGT CTGAGGATGA TAGTGCGGTG GCAATTCGTT CATTAATTTT GCATAAAGAT 180 

GAACCTATGT ATTTAAAAAA ACGTACATGT GTACCTACTT TGTTAATTAA TGGGGAACAT 24 0 
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AAAAAAATCT 


TCGAACAITC 


AGGACATGCA 


CCGCATATTG 


AAGAACCAGA 


AGCATTTATG 


360 




AATTATTATT 


TAAAATTTTT 


AAAAAGCGTA 


TCATAATATG 


TGATATATAA 


ACCTAGGGCA 


420 


5 


TAAAGTCCTT 


AGGCAATGTG 


AAAAAGCTGA 


TTACTATTCA 


TTATTTGATA 


GAAATCAGCT 


480 






TGTATTTGAT 


ATATACTGCT 


CGTTATGCGG 


CTATCTTCCT 


TATATTAAGT 


54 0 


70 


GCCATTAGTG 


CAAAACCTCT 


TAACAATTAG 


GTAAAAAGAG 


CATAAAAAAA 


GGAAGTTTAA 


600 


TAGAATGTAT 


CATCTATCAA 


ACTTCACCPA 


ATTGCGCTAA 


ACAAAATTAT 


AGTTCAATTT 


660 




CGTTGTTTGC 


TTCAGTGATT 


CGTTTATTTA 


CTCGACTCAA 


TAATGATTCG 


ATrrrrriAC 


720 


IS 


GTTGTTGTGC 


ATTAACAAGA 


ATTAATACAG 


TTCTTTCATC 


ATGCTCATTA 


CGTTTTTTAT 


780 




CGAAGTAATC 


TTCTTGAGAT 


AAAATTTTAA 


CTGCTTTAAC 


AACTTGTGGT 


TGTTTGTAGT 


840 




TTAAATGATT 


AATAATATCt 


TTAAGATAGT 


ATTCTTTCTC 


TTTGTTTTCG 


CTGATGTATG 


900 


20 


TCAATACAGC 


GAATTCTTCA 


AAGCTAATTG 


AnAATTCCTT 


TTTAATTAAA 


CTTTTTAATT 


960 




TGTCAGCATA 


AGTGACCATT 


GATAACAACT 


CAAAGCAATC 


ATTGATTTTT 


GTAATTGCCA 


1020 




TGTTTAAAAC 


CTCCCTATTT 


GATGCATCTT 


GCTCGATACA 


TTTGCCCCGA 


TAATATAnTG 


1080 


2S 


TATCTAATCT 


TTATGnAT 










1098 



(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 



40 



4S 



SO 



CCAGGTAAAA 


TTGTGCAATC 


ATTTGACGCA 


TTAATGGACG 


CCTTGGACAA 


TGAAGATTAT 


60 


GAAGGAGAAA 


AAGTCATTCC 


ATTCTTAGAT 


AAACATTTTA 


AATATCAAGA 


TGGCCGATCA 


120 


AGTGAGCGTT 


TAGTCAGAAA 


TTTATTTGGT 


AGCTAAGTTT 


ATATAGTAGT 


CAAAGTGGGA 


180 


GAGGTATAAT 


GATGAAATTT 


TCAGTAATAG 


TTCCAACATr 


CAATTCAGAA 


AAGTATATAA 


240 


CAGAATTACT 


TAATAGCCTT 


GCGAAACAAG 


ATTTTCCGAA 


AACTGAATTT 


GAAGTGGTTG 


300 


TAGTTGATGA 


CTGTTCAACA 


GATCAAACGT 


TACAAATAGT 


TGAAAAGTAT 


CGCAATAAAT 


360 


TGAACTTGAA 


AGTAAGTCAA 


CTCGAAACAA 


ATTCTGGTGG 


TCCAGGTAAA 


CCTAGAAATG 


420 


kGGCGTTAAA 


ACAAGCAGAA 


GGTGAATTTG 


TATTATTTGT 


GGACTCCGAT 


GACTATATAA 


480 


ACAAAGAGAC 


TTTAAAGGAT 


GCAGCAGCAT 


TTATTGATGA 


ACATCACTCA 


GATGTCTTAT 


540 
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CACCTGAAGT TACrrTGTTA AATTCAAGAA TTATCTATAC TTTAAGCCCG ACTAAAATCT 
ATAGAACAGC ATTACTAAAA GATAATQACA TTTATTTrCC AGAAGAATTA AAGAGTGCAG 
5 AAGATCAATT ATTTACAATG AAAGCATATT TAAATGCAAA TCGAATCAGT GTGTTAAGTG 

ATAAAGCGTA TTATTATGCT ACAAAGCGTG AAGGTCAACA TATCAGTAGT GCGTATGTTT 
CACCTGAAGA CTTrTATGAA GTCATCAGAT TGATTGCTCT AGAAATATTA AATCCAGATT 
TAGAAGAAGC CCATAAAAAT CAAATCTTAG CAGAATTTTT AAATCGTCAT TTTAGTTTrr 
CTCGTACGAA TGGCTTCTCA CTTAAAGTTA AACTAGAAGA TCAACCACAA TGGATTAATG 
CTCTAGGAGA CTTTATACAA GCAGTTCCAG AACGTGTAGA TGCArPGGTG ATGAGTAAAT 
TACGACCATT GTTCCACTAC GCGAGAGCGA AAGATATAGA CAACTATAGA ACTGTGGAAG 
AAAGTTACCG TCAAGGTCAA TACTACCGTT TTGATATTGT AGATX3GTAAA TTAAACATTC 
AAITCAATGA AGGCGAACCA TACTTTAAAG GCATTGATAT CGCTAAGCCA AAAGTGAAAA 
TGACAGCATT TAAAITTTGAT AATCATAAAA TTGTTACAGA GCTAACGTTA AATGAATTTA 
TGATTGGCGA AGGACATTAT GATGTCAGAC TTAAATTACA TTCACGAAAC AAGAAGCACA 
2S CAATGTATGT ACCTTTAAGT GTCAATGCGA ATAAACAATA TCGTTTTAAC ATTATGTTAG 

MGATATTAA AGCGTATTTA CCTAAAGAAA AAATTTGGGA TGTTTTCTTA GAAGTCCAAA 
TAGGTACGGA AGTATTTGAA GTGCGTGTTG GTAATCAACG TAATAAATAT GCATATACTC 
30 CAGAAACAAG TGCATTAATT CATTTGAATA ATGATTTTTA TAGATTAACA CCGTATTTCA 

CAAAAGACTT TAATAACATT TCGTTATACT TTACAGCTAT TACATTAACG GATTCAATCT 
CATTGAAGTT AAAAGGTAAA AACAAAATCA TTTTAACTGG TCTGGATCGT GGTTATGTAT 
TTGAAGAAGG TATGGCTAGT GTCGTACTAA AAGACGACAT GGTGATGGGA ATC3TTAAGCC 
AAACffrCAGA AAACGAAGTG CnAAATCTTA CTTAGCAAAG ATATTAAAAA GCGAGACTTC 
AAAAATATTG TTAAGTTAAA CACTGCACAT ATCACTTATC CACTAAATAA ATAATAAATG 

40 

CCCTCAAATC ATTGXGAGCC AACATGATTT GAGGGCTTTA TTTTXSCTGrr TATGACAT6A 
TTATGACArr TCCCTGATTT TCATTTTCAT ATACATTAAA TTGTATACAC TX3GAAATGAG 
GAGGTTATCT ATAATGATAA ATAAAAATGA CATAGTAGCA GATGTAGTAA CTGATTATCC 
GAAAGCAGCG GATATTTTTA GAAGTGTGGG AATAGATTTT TGTTGTCGCG QACAAGTAAG 
TATAGAAGCA GCAGCCTTAG AAAAGAAAAA TGTAGATTTX3 AACGAATTAT TACAGCGTCT 
CAACGACGTT GAACAAACGA ATACACCAGG ITCGTTAAAT CCTAAATTTT TAAATGTTTC 
ATCACTTATT CAATATATTC AATCAGCATA TCATGAACCT CTAAOAOAAG AATTTAAAAA 
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TGAGTTAAAA GAAACATACG ATACATTTAA AAATGGCATG TTAGAGCATA TGCAAAAAGA 24 60 

AGACGATGTC GATTTTCCAA AACTCATTAA ATATGAGCAA GGTGAGGTAG TAGACGATAT 2520 

^ TAATACTGTG ATAGATGATT TAGTTTCAGA CCACATTGCA ACGGGAGAAT TGTTAGTAAA 2580 

AATGAGCGAA TTAACATCTA GTTATGAACC TCCGATAGAA GCGTGTGGTA CTTGGCGACT 264 0 

TGTTTATCAG AGATTAAAAG CACTTGAAGT GTTAACACAT GAACACGTAC ATTTAGAGAA 2700 

TCACGTATTA TTTAAAAAAG TATCATAAAT AACGCGATTA GAAACTGTTG GCAAAAATAA 2760 

GTCCAGCAGT TTTTCGCTAT GTATAAAAGT CATAATAGTG ACATAAACAG CATTATTTGA 2 8 20 

AAAGAAnAAT GGTCAACTTA GCATAAAAAT TGATATGAAn ATTTAATGGT ATAGATAATT 2 880 

15 

A 

(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 . , 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 





ACCGTCGAAT 


ATCGCTTGTG 


ATTTACAATT 


TGTGTATTAA 


GATGCTCAAC 


TAATTTGGGT 


60 


30 


ACATATTCCG 


AATTTAGATT 


TGCAAGTACA ACAATTCCAT AATTTTGTTT 


TGGATTTAGT 


120 




AAAATAAATG 


ATGAAAAGTT 


ATCTAGCGTT 


CCTGAATGAA 


ACACTAAATG 


TTCATCATTA 


180 




TTGGTAAACC 


AGCCGGAAGC 


ATATGCATTG 


GCATTAGGTT 


CACCAATTGT 


TGAAGATAAA 


240 


35 


TTTTTATGTG 


ATTGTTGAAC 


TAATGATTTG 


TATTTATCAG 


GTGGATTAAG 


TTGGAATTTT 


300 




ATCC^TGTT CCAAATCTTC AGTTGATGTC ATCATATATG 


CTGATGGTGT 


ATCCCAAAGG 


360 




TTAAATTCAG 


GTTTAGAGAC 


GACAGGTGTC 


GAACCTTGTA 


ATTCATAGCC 


AATAGCATCA 


420 


40 


TCrm-i'GATT 


TGTAATTGGT 


TTGTTTGAAT 


GATGTATGTG 


TCATATGCAA 


AGGCTTGAGC 


460 




CATGAATTTG 


TAATATATTT 


TGTATAGGAT 


TGCTTCGTAA 


CGTTTTGGAT 


AATTAAACCT 


540 


45 


AATAAATCAT 


AGTTCATATT 


TGAGTATTCA 


AATTCTTCTC 


CGGGCTTATG 


ATGTAATTCA 


600 


TCACCCATAA 


TTGCATGGGT 


TACATCATTT 


AAACGATTAT 


TTTTGCITGT 


CACAGAATCT 


660 




tCGCTTGTAA 


TAtCACTAGG 


TATACCACTT 


GTTTGAGCCA 


AAAGTTGCTT 


AATCOTAATA 


720 


SO 


GTTTCATTTT 


GACCATTATA 


GTTCATTTTA 


AAATGAGGCA 


CATGTTTGGA 


TACGGCATCA 


780 




TTTAAGTTTA 


ATCGACCTTC 


TTGAGCTAAT 


TTTAAAATTG 


CAAGACCTGT 


GAAAGCTTTC 


840 
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10 



15 



40 



45 



TGATAACCAT AACCTTTATr TAAAAAAACT TTGCCATTTT TTACTAyTAA AATTGATGCT 
CCAGGAATGT GTCCCTTTTG TAAATCATGC TCGATAATTG TATCTATTTG TTGTTGCGAA 
TCATTGGTTA ACCGTGTCTT CX3TATTGCTA TTTAAT 
(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 



960 
1020 
1056 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

ATGCCaCACT TATTGGTGGC AGGATCGACG GGTAGTGGTA AATCTGTTTG TATAAATGGT 60 

20 ATTATTACAA GTATTTTATT AAATGCTAAG CCGCATGAAG TTAAACTTAT GTTAATCGAT 120 

CCGAAAATGG TTGAACTAAA TGTTTATAAC GGAATTCCAC ACTTATTAAT TCCGGTTGTT 180 

ACAAATCCTC ATAAAGCTGC TCAAGCTTTA GAAAAAATTG TAGCTGAGAT GGAAAGACGT 24 0 

2s TATGATTTAT TCCAACATTC ATCAACTAGA AACATTAAAG GTTATAACGA ATTAATCCGT 3 00 

AAGCAAAATC AAGAATTAGA TGAGAAGCAA CCAGAATTAC CTTATATCGT TGTTATTGTA 3 60 

GATGAGCTTG CAGATTTAAT GATGGTAGCT GGTAAAGAAG TTGAAAATGC GATTCAACGT 4 20 

30 ATTACACAAA TGGCACGTGC AGCAGGTATA CATTTAATTG TAGCGACACA AAGACCTTCT 4 80 

GTGGATGTAA TTACAGGTAT CATTAAAAAT AATATTCCAT CTAGAATAGC TTTTGCTGTG 540 

AGTTCTCAAA CAGATTCAAG AACTATTATT GGTACTGGCG GCGCAGAAAA GTkACTTGGT 600 

AAAGGTGACA TGTTATACGT TGGAAATGGT GACTCATCAC AAACACGTAT TCAAGGGGCG 660 

TTTTTAAGTG ACCAAGAGGT GCAAGATGTT GTAAATTATG TAGTAGAACA ACAACAGGCA 720 

AATTATGTAA AAGAAATGGA ACCAGATGCA CCAGTGGATA AATCGGAAAT GAAAAGTGAA 780 

GATGCTTTAT ATGaTGAAGC GTATTTGTTT GTTGTTGaAC AACAAAAGGC aAGTACATCA 840 

TTGTTACAAC GCCAATTTaG AATTGGtTAT AATAGAGCAT CTAGGTTGAT GGATGATTTA 900 

GAACGCAATC AGGTAATCGG TCCACAAAAA GGAAGCAAGC CTAGACAAGT TTTAATAGAT 960 

CTTAATAATG ACGAGGTGTA AAAAAATGTC AGAAATGAAT GCGGTATATA ACGTTAAACA 1020 

ATaCATTTtA AATTTgATTA AGCAAAATAA ATTGGAATAT GGTGACCAAC TTCCAAGTAA 1080 
TTTATCAATT GCCAGAQAAT TAAATGTAAA AACCGACGAT GTTTATGAAG CAATTCAGCA 



35 



1140 



TTGATTACTG AACAAGTCAT TAAAGATAtT TTGAAGAGGG CACAAGTGTT AAGTCACTGC 1200 
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10 



IS 



2S 



40 



45 



SO 



GrtTTGAATG CGGAACT 1277 
(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

TACCGTCGTT TTATGyGTCA AATTTTACAG TAAATTTTGC TTCATCAAAA GAAATAACCT 60 

TTAACAAGTA TAGTAATTTT ACATTTTACA ATGTTACAAA ATATAATTTT TTATAATTAG 120 

TTAAAATCAC TAAAACGCTT TTATACACTA TCAAATCAGC ATTTATAAAA ATATGAACCG 180 

ATATCCTAAA ATGTTAATAA TATTACAAGA TAATAACArA CCACACAAAG CTACTTATTT 24 0 

TTGATAATAT GGAAATCGTA ATATAAAACn AAAACTTAAT TTACTATATA AATTGTCTTA 300 

ATAATTTTTA AAAGTAGTAA AACATAATTT TAAGGAGGAG TCCCTTTGAA AAAATTAGCA 360 

TTTGCAATAA CAGCAACATC TGGTGCAGCT GCATTTTTAA CGCATCATGA TGCACAAGCT 420 

TCTACACAAC ATACAGTACA ATCTGGTGAA TCATTATGGA GTATTGCTCA AAAATACAAC 4 80 

ACTTCAGTAG AGAGTATTAA ACAAAATAAC CAATTAGATA ACAACTTGGT ATTCCCTGGT 54 0 

30 CAAGTTATCT CAGTAGGTGG AAGTGATGCA CAAAATACGT CAAACACTTC TCCACAAGCT 600 

GGTTCAGCAT CATCTCATAC TGTACAAGCT GGTGAATCAT TAAATATCAT TGCTAGCAGA 660 

TATGGTGTTT CAGTTGATCA ATTAATGGCA GCCAATAACT TACGTGGTTA TTTAATTATG 720 

CCTAACCAAA CATTACAAAT TCCTAATGGT GGATCAGGTG GTACAACACC AACAGCTACA 780 

ACACS&TAGCA ATGGCAATGC ATCATCTTTT AATCACCAAA ATTTATACAC TGCTGGTCAA 84 0 

TGTACATGGT ACGTATTTGA CCGTCGTGCT CAAGCTGGTA GTCCAATTAG CACATATTGG 900 

TCAGACGCTA AGTATTGGGC TGGTAACGCA GCTAATGATG GTTACCAAGT AAACAACACA 960 

CCATCAGTTG GTTCAATTAT GCAAAGCACA CCTGGTCCAT ATGGTCATGT TGCTTATGTT 1020 

GAACGTGTCA ATGGTGATGG TAGTATCTTG ATTTCTGAAA TGAATTACAC ATATGGTCCA 1080 

TACAATATGA ACTACCGTAC AATTCCAGCT TCAGAAGTTT CTAGCTATGC ATTCATCCAT 114 0 

TAATTAAATA AATTGTACTG ATATATACTA GCAATTCACA TCATGTGAGA TTGCTAGTTT 1200 

TTTATTTTTG AAAAAAATTT TCATTTTGGT ACAAAAAATT ATCTCACCCT TCCCTATCAT 1260 

ACATATTTAT ATTTTGTATG AATGGTAGTT AGGTAAAAAT TAACAACCTA CCTATTTGAT 1320 
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ATTTAATTTG TTATACCACrT ATTTTACGCT TTTTCGTCTA CATATACAAA TTTATATTAA 
ATAAAGCCCA ATACAATTTA GGTTAATTAA ACAAGTTGAT AACTATTTAA TTATTCCTTC 
ATTGAAGAAT ATAAACTATT AAATCATTAT TTTGCTCTTA CATATATTTT AATGACCTAA 
CTGaTTATGT TCCATCGAAT ACATTTATAA TATAGCCTCC TAATTAArAT GCyrTGTCTT 
GGTCATTCTA CGTAAATTCT ATAAAATATG TTATCTACTT ACATAArArn CTGrACTTCA 
ATACCACCAT ATGTTTGTGA TACTGAAGTT CAGTTTAGTT TTATTTTCAA TTAGAAAAAT 
AAGTTAAGTA TATAGAATAG TAAACCTGCT AACAATGCTG AAATAGGTAA TGTAATCACC 

CATGTAATGA TCATTCGTTG CGCAgTGCTC CATTTTACAC CTTTAGCTCX3 GTTAGAAGCA 1860 

CCAACACCTA AGATTGATGA TGACACAACG TQAGTTGTTG ATAATGGGAA ATGTAGCGAT 1920 
GATGCAACAA AAATTGTTAA TGCAGATGAT AAATCGGCCG CAGCACCATT TGCTGGACGT 
ATTTTCATAA TATTACCACC TACAGTTTTG ATAATTTTCC AGCCACCAAT TGCAGTACCA 

AGCCCCATTG cTGTCGCACA GGCAAATTTT ACCCATAACT GTGGTTCAAC ACTGCCATCA 2100 

TTCTGTACAT TAGCGACAAT CAATGCCAAC GTAATAATAC CCATTGATTT TTGCGCATCA 2160 

25 TTCGTACCGT GAGAGAATGA TTGTAACGCT GCTGTGAAAA TTTGGAAAAA TCTAAAGTTA 2220 

CGATTCGCTC TTGTTAAATT TGCATTTTTA AAGATAACTT TAAAAATTGA ATACATCAAG 2280 

AAACCAACAC AAAATGCGAT AATCGGTGAA ACGATTAATA CAATAATAAT TTTTGTGAAA 234 0 

CCTTGGTAAT GTAACACTCC AAATGAGCCT TCAGATGCGA TTGCTGCACC CGCAATTGAA 24 00 

CCTATAAGTG CATGTGAAGA CGAACTTGGA ATTCCGTAAA ACCAAGTAGC TAAATTCCAA 2460 

ATAATAGCCG CAAGTATTGC AGCTAACACA ACAACTAATC CATTTTCCAA TTTAAATGGA 2520 

TCGACAATGT CTTTAGTAAT GGTGCCTGCA ACGCCCGTAA ATGTTAAAGC ACCTATAAAG 2580 

TTCM-CACTG CTGCCATTAA AATTGCCX3TT TTAGGgTTAA CGCTCTAGTA GATACAGCAG 264 0 

TAGCTACTGC ATTGGcTGTA TCATGGAaTC CcATTGATAA AGTCAAATAT CAGCGAGAAA 2700 

ATAACTACAG CTATAGTGAC GATGATTATA TATGACATAA ATATATACTC CCCTTAGCTA 2760 

TTTTTCATAA TAATAGTTTC AAAATTATTT GCTACGATTT GACATTTATC AGCGATTTCT 2820 

TCCATGCTTT CATAAATATC TTTTATTTTA ATTAAAGTGA TTGGATCTGT TTCGCTATTG 2880 

AAAATATGTT TAATTGACTG TCTTAAAATA CCATCACAGT TTGTTTCAAA TTCTTTAATA 2 940 

TTAATTGAAT GAATACGCAT ATGTGATAAT TTTTTATCGA CTAATAAGCC GACAGCAAGT 3 000 

so TTC7VTTTCTG CAACTGCTTT TTGAATGTTA TCAACAAACT CAGCCATATA TTCATCTGTG 3 060 

TATTCGATTG AATACATTTC AAACATrGCT GCCXSTTTCTT CAATTGCATC TAAAACATCA 3120 
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TTTAAATCAG TAATTACTTG ATGTACTAAt tCGcACCATG TGACTCATAA GTTTTAATGT *3240 

TGTCTGAGTA TGCTTTTAAA TCTAAATGTG TATTGAAATC CATTTTACCG AATTCAATAG 3300 

5 CAGCACGATC CAGATTGAAA ACCATCTCTT CTAATTGAAC CATAAACTTA TCTTTTTTCT 3360 

TACTAAACAT TTAAAATCCT CCATTTAAGC GATTGTCACC AATCACATTC AGTTATAATT 34 20 

TGTTTCAAAT TAAGACAAGT GAATTTACAA ACTAATGATA CAAATTTGTT ATTATCAATC 34 80 

GTCAGTATAA TTTTAGTGTA CTGATATTAA TTTCAAAAAT GCCTCACAGT AAACAATTTA 354 0 

CTGTATTTGC CCTTATA 3557 



(2) INFORMATION FOR SEQ ID NO: 258: 

IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1631 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
AACTATACAT TTCGAAAAAT TCTTCTAGTG AACCTGCGCC ACCAGGAGCC ATGACAAATG 60 

25 

CATCTGCAAG TTCTGCCATT TTATTTTnAC GTTCATGCAT AGAATCAACT AAAATTAATT 120 
CAGTTAAACG TTGGCTTGTG ATTTCATGTT CATCTAACAT TTTAGGCATG ACGCCAATAG 180 

30 CTTTGCCGCC ATGATCTAAT ACACCATCTT GaATGGCACC CATAATGCCA ATTGACCCTG 24 0 

CACCAAATAC TAATTCATAA CCTTGTTCAG CAAAATATTT ACCTAAATCG TATGCTTTTT 300 
GTACATATGA AGGGTCATGA CCTTTGCTTG CACCACAATA AACTGCGATT CGTTTCATGT 360 

3S TAATCCAGCT CCTTAATTCG ATGAATGACT TTTAATAGTG ATTGTTCAAA CACTTTTTGA 420 
TCTTGCTTTG TAAAAGGTGG GGGACCTTTG TGGCGACCAC CTTGTTTTCT AATTTGTGCA 480 
TTCATATATC GTTTATCTAA TAGTTGTTGA ATATTTTTGG AATTGTATAT CTTCCCATTA 540 
TGATGCATGA CAATTAAGAC TTTGTCGACT AATAAACTTG CGAGTCCATA ATCTTGAGTG 600 
ACTACGATAT CATCCTTCGT TGATAATTGA ACAATTTTGT AATCAACTGC ATCTGGTCCA 660 
TCATCAACAT ATAATGTTGA TACATGTGGA GGATATAATT GGTTCGAAAA ATGGCTGAAG 720 

45 

CTCCGAATAA TTGTCACAAA AATGCCTGTC TCAGTTGTTA AATCTATAAT AGAATCAACA 7 80 

ACAGGACAAG CATCTCCATC AATAATAATA TGTGTCACAA TTATGCCTCT GTATTGTTTT 84 0 

CTTTATTTTG TTGAGAGGCG CTTTTGGCAA CATAATCTTT ATATTTTTTA AATGACTTGA 900 

SO 

TGCGTGCTTT ATCAGCTTCT TGTTGGCGTT TTTGTTCTTC TTTGTGTCGT TTTTCAATAT 960 

55 
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CGCCTTTTTT CTCAGTTTTC TCATCTAATT TATTAGGTGT TAAGCCTGCT TTTTCTTCGT 1080 

ATTTTTGTGA TTTTTTCATA TCTTTAATAC GTTGTATTTC ATTCTTTTCG CGGGCTTTTT X14 0 

^ GCTCTTCTTT ATGACGCTTT TCX3ATATTTT TTTGAAGTAT TTTATTCATT TTATCAGCGT 1200 

CTTTACGATT TTGTTTAGCT AATTTTTCGC CTTTTTTCTC AATATAGGCA GGATCATGTT 1260 

CTCTAGCAAA CTTTTTAAGT TCACGTTTAT TTTCAAAATC TTGTTTTTTA TCGCCGACAT 1320 

10 

ATTCTTTAAC ATCACTCGCT GTGTTACTGA TTGCTGCAGA TGTTTTTGAA GCAACTTTAC 13 80 

TTGTAGCATC TGTAACTTTT TGTACGTCCG GATGTTGTTT GATACGTTTA CGTTCAACAA 144 0 

TTAACGGTAC CAATACAATT GGTAATACAT TAATCATAAA TTTGATGACT TTTTTCTTAT ' 1500 

75 

CCATAGATCT TGcCTCCaTA ATTACTTTAT TAAtTTTACa TACCCtATGa TACATCAATA 1560 
TAAACGATGA TAGTAGTGAA TCACTATTAA GTATTTCAGA TGTTTtTTAA AAgaAGaCCC 1620 
AATTAnAAAA A 1631 

20 

(2) INFORMATION FOR SEQ ID NO: 25 9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6645 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 



35 



40 



45 



50 



CGAAATCATG 


ATTTAATGCT 


TTTTCATATA 


AGCTTTTCCA 


ATTAATCTTT 


CGTCCATGAT 


60 


ATTCTTCAAC 


TGTTGCTAGA 


TATTGTGCAA 


TTTTAGTTAC 


TTTAAAGGAG 


TGTGCTGCAA 


120 


CaTTGTGkTC 


mAAATATTTA 


AATTTTCCaG 


GtAATCTTAT 


AAGTCTTTCC 


aTATCTGATA 


180 


ATCT£TTAAA ATATTGATGT ACACCCATTT 


CAATTACCTC 


CTCCATTAAT 


TAATCATAAA 


240 


TTATACTTTC 


TTTTTACATA 


TCAATCAATT 


AAATATCATT 


TaAATATCTT 


CTTTaTATAA 


300 


cTCTGATTAA 


ATGATACCAA 


AAAATCCtCT 


CAACCTGTTA 


CTTAAACAGG 


CTAAGAGGAT 


360 


AGTCTTGTCT 


TGATATATTA 


CTTAGTGGAT 


GTAATTATAT 


TTTCCTGGAT 


TTAAAATTGT 


420 


TCTTGAAGAT 


TTAACATTAA 


ATCCAGCATA 


GTTCATTTCA 


GAAACAGTAA 


TTGTTCCATT 


480 


AGGGTTTACA 


GATTCAACAA 


CACCAACATG 


TCCATATGGA 


CCAGCAGCTG 


TTTGGAAAAT 


540 


AGCGCCAACT 


TCTGGTGTTT 


TATCTACTTT 


AAATCCTGCA 


ACI"m"GCTG 


CGTAATTCCA 


600 


GTTATTTGCA 


TTGCCCCATA 


AACTTCCTAT 


ACTTCTACCT 


AATTGTGCAC 


GACGATCGAA 


660 


AGCATAATAT 


GTGCAGTTTC 


CATAAGCATA 


TAAGTTTCCT 


CTGTTAGCAA 


CTGATTTATT 


720 
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TACATTAACT GTCTTAGTTA CTGCTTGCTT 
TGCTTGTTGT ACAGGTTGTT TTACTACCTT 
^ TTTAACCGCT TTAGTTTGTT TCACTTTATT 

AAAAATTAAA GGTGTTACAC CAGGATTGTA 
CTCTAAAGCA ATCTTATATA ATGAATCGCC 

w 

CGTTTGTGCT TGAACATTTG ATACATAATT 
ATGTTGTTGT GCATGTGCTG CATTATTTAA 
CACTGTAAGA GATTTTTTCA TCTTGCTGTC 

75 

AAATACTATA GCACAATACA TTTTGTCAAA 
TTAACAATAT AAAAAGAATA CAACCTTTTA 
ATTGAGAACG TTTTCAATAA TTAATTCAAT 

20 

AAATGCACAC TGCATATGTT GTTTT AACAA 
TATCTACCGA ATTGTACTTA AATATTCATA 

25 ATTTATTTAG CTACCTTTAA AAAAACCAAA 

AAATATATAT TAATCGTGCG ACATTGTCTG 
GTCTCCATCC ATCACTGCAT CAACCTTACC 

^ CATTGAGTAT GGATGGAAAA CATATGATCT 

TTCGCCACGA ATTTCAGCCA TTTCACGTGC 
AGACTTTAAC ATTTTCATAG CTGCTTCACG 
ATTAACAACT ATACCTGAGG GGTGGTGGGT 
ATGCTGACCA CCTGCACCAG AAGCTCTGAA 
TTCAATCTCT ATTTCATCAT TATTAAAATC 

40 

ATGACGACGT CCTGATGAAT CAAATGGAGA 
AGCTTTTAAA TAACCATAAG CATTATGCCC 
CGCTTCATCC CCAGGTAGAT AATCAACAGT 

45 

ACGTTGATAC ATTCTAAATA GCATATTAGC 
AGGATGTAAC TCTAGAATTG CGTTATTGGC 
5^ TTCGTATTCA TCCACTTTAG CCTTAAAATT 

TTCTTCATCA AATTCTTCTT GTAATAAATC 

55 



AGGTGCTTGC TTAACTACTA CTTTTTTAGA 840 

TTTAGCTTGG CTTGCTTTTC TTACTGGTGA 900 

TTGAGGCACA AGTGAAATCA CGTCACCAGG 960 

TGAATATAAT TGATTCAACG TTAAGTGATG X020 

AGCAACTACT GTATAAGTTG TCGGTGATTG 1080 

ATGTTGAACA GGTGTTTTTA CTTGTGTGCC 114 0 

AGCTAAAAAA GCTAACACTG ACGAAACCGT 1200 

ATTCCTTTGC TGTTAGTATT TTAAGTATGC 1260 

AGCTATTGTT ATAACGATGT AATCAAATGG 13 20 

TCATAGTGTA AAATGTATTC ATACCATGTA 13 80 

ACCTTGAAAA TCGCCATAGG TAATATTACT 144 0 

ACACAACTTT TAAAAAATAT ATTCTAACTC 1500 

AACAAATCAT ATTCCAAAAT CTAATTTACA 1560 

AACCGACGCC CTTTTAGAGC CTCGGTTTTA 1620 

TCTTAAATAT GATTCGATAA ATGGTCCAAT 1680 

TGTTTCTTCG TTCGTACGAT GATCTTTCAC 174 0 

AATTTGGCTT CCCCAGCCGA TTTCTTTTT G 1800 

CTGCTCTTCC AATTTTAATT GATATAATTT 186 0 

GTTTTTAATT TGAGAACGTT CATTTTGGTT 1920 

AATTCGTATT GCCGATTCAG TTTTGTTAAT 1980 

TGTATCAACT GTAATATCAT CCGGATTGAT 204 0 

TGGAATAACG TCGCATGATG CAAATGATGT 2100 

AATTCGTACT AGTCGGTGTA . CACCTTTTTC 2160 

TTTGATGAGC AATGTTACAC TTTTAATCCC 2220 

TTCAACTTTA AAGCCTTTCT TCTCACAATA 2280 

CCAATCTTGA GACTCCGTGC CACCTGCACC 234 0 

ATCGTGAGGC CCATCTAATA ATAATTGCAA 2400 

AATGACCTCT TGCTCTAAGT CTTCTTTCAT 2460 

CCAAGTAGCA TCCATGTCAT CTACTTCTGC 2520 
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TTGCGCTTTC 


GTTTGGTTAT 


CCCAAAAATT 


AGGTTCTGCC 


ATCATTTCTT 


CATATTCTTG 


2640 




AATATTAGTT 


TCTTTGTTCT 


CTAAGTCAAA 


GAGACCCCCT 


AATTTGTGTT 


AAATCTTGAT 


2700 


c 


TATACTTATC 


TATATTTCGT 


TTGATTTCTG 


ATAATTCCAT 


AGCATTCGCT 


CCTATTTATA 


2760 




TTTCAATTCA 


AGTCATTGAT 


TTGCATCTTT 


TATAATGCTA 


AATTTTAACA 


TAATTTTGTT 


2820 


10 


AAATAACAAT 


GTTAAGAAAT 


ATAAGCACAC 


TGACAATTAG 


TTTATGCATT 


TATTGTTAAA 


2880 


AATCAGTACA 


TTTATCATCG 


ACATATGCCT 


AAACCGATTT 


TTTAAAACTA 


AGTACATAAC 


2940 




AACGTTTAAC 


AACTTCTTCA 


CArmTTAA 


AGTATTTAAC 


GCTTGTAAAA 


TAAAAAGACT 


3000 


IS 


CCTCCCATAA 


CACAAACTAT 


AGGTGTTTAA 


TTGGAAGGAG 


TTATTTTATA 


TCATTTATTT 


3060 


TCCATGGCAA 


TTTTTGAATT 


TTTTACCACT 


ACCACATGGA 


CAATCATCGT 


TACGACCAAC 


3120 




TTGATCGCCT 


TTAACGATTG 


GTTTCGGTTT 


CAcrrrrrcT 


TTACCATCTT 


CAGCTGAAAC 


3180 


20 


GTGcTTCGCT 


TCACCAAACT 


CTGTTGTTTT 


TTCACGTTCA 


ATATTATCTT 


CAACTTGTAC 


3240 




TACAGATm 


AAAATGAATT 


TACAAGTATC 


TTCTTCAATA 


TTTTGCATCA 


TGATATCAAA 


3300 




TAATTCATGA 


CCTTCATTTT 


GATAGTCACG 


TAATGGATTT 


TGTTGTGCAT 


AAGAACGTAA 


3360 


25 


GTGAATACCT 


TGACGTAATT 


GATCCATTGT 


GTCGATATGA 


TCAGTCCAAT 


GGCTATCAAT 


3420 




AGAACGAaGT 


AAAATCATAC 


GCTCAAACTC ATTCATTTGT 


TCTTCTAAGA 


TATCTTTTTG 


3480 




ACrri'GATAT 


GCTGCTTCAA 


TCTTAGCCCA 


AACGACTTCG 


AAAATATCTT 


CAGCATCTTT 


3540 


30 


ACCTTTGATA 


TCATCCTCTG 


TAATGTCACC 


TTCTTGTAAG 


AAGATGTCAT 


TAATGTAGTC 


3600 




GATGAATGGT 


TGATATTCAG 


GCTCGTCATC 


TGCTGTATTA 


ATATAGTAAT 


TGATACTACG 


3660 




TTGTAACGTT 


GAACGTAGCA 


TTGCATCTAC 


AACTTGAGAG 


CTGTCTTCTT 


CATCAATAAT 


3720 


35 


ACTATTTCTT 


TCGTTATAGA 


TAATTTCACG 


TTGTTTACGT 


AATACTTCAT 


CGTATTCTAA 


3780 




GATA&GTTTA CGCGCGTCGA AGTTATTACC TTCTACACGT 


TTTTGTGCTG 


ATTCTACAGC 


3840 




TCTTGATACC 


ArrrrixsATT 


CAATTGGTGT 


AGAGTCATCT 


AAACCTAGTC 


GGCTCATCAT 


3900 


40 


TTTCTGTAAA 


CX3TTCAGAAC 


CAAAACGAAT 


CATTAATTCA 


TCTTGTAATG 


ATAAATAGAA 


3960 




GCGACTATCC 


CCTTTATCAC 


CTTGACGTCC 


AGAACGACCA 


CGTAACTGGT 


CATCAATAOG 


4020 


45 


ACGAGATTCA 


TGTCGCTCTG 


TACCTATTAC 


TGCTAAACCG 


CCTAATTCCT 


CTACGCCTTC 


4080 


ACCTAATTTG 


ATATCTGTAC 


CACX3ACCAGC 


CATGTTAGTG 


GCAATAGTAA 


CGGCACCTTT 


4140 




TTGTCCAGCG 


CCTGCAACAA 


TTTCAGCTTC 


; ACGTTCATGA 


. TTTTTCGCAT 


TTAACACATC 


4200 


SO 


ATGACGGATA 


CCACGrmT 


TAAGTAAATT 


' TGAAATATAT 


' TCAGAAGTCT 


CAACTGCAAC 


4260 




AGTACCTAAT 


AGCACTGGTT 


GCCCTGCCTT GTCTTTTTCA ACAACATCTT 


' CTACTACTGC 


4320 



55 
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TTTATTTGTC GGAATTTGAG TTACTGTCAT GTTATAAATA TTTCTAAATT CTTCTTCTTC 4440 

AGTTTTAGCT GTACCTGTCA TACCCGCAAG TTTATTGTAC ATTCTGAAAT AGTTTTGGAA 4 500 

^ TGTAATAGAC GCCATAGTTT TAGATTCATT TTGAATTTGA ACGCCTTCCT TCGCTTCAAT 4 560 

AGCTTGGTGT AAACCTTCCG AGAAACGACG GCCTGGCATT GTACGTCCTG TAAATTGATC 4620 

GACAATTAAT ACTTCGCCAT CAACAACCAT ATAGTCTACG TCACGTTGTA ATGTAACGTG 4 680 

10 

CGCACGTAAA GCTGTGTTGA TATGACTAAT AACATCAACA TTTTGTACAT CATATAAGTT 4 74 0 

TTCAACTTTG AACATACGTT CAGCTTTATC CGCACCTTGT TCTGTTAAAT GTACAGCTTT 4 800 

CGTTTTTTCA TCGTATTTAT AATCTTCGTC CTGTTTTAAC ATTTTCGCAA AAACATTTGC 4 860 

IS 

TTGTGTATAA AGTGACGTTG ACTTTTCAGC TTCACCAGAA ATAATTAATG GCGTACX5TGC 4920 

CTCGTCGATT AAAATTGAGT CAACCTCATC AATGATTGCA AAATGTAATG GACGCATTAC 4 980 

TCTATCTTCA GAATAATTCA CCATGTTATC TCGTAAGTAA TCAAAACCTA GCTCATTATT 504 0 

2G 

AGTACTGTAA GTAATGTCTT GTGCGTATGC TTCACGTTTT TCTTCTGTCG TCTTACTGTT 5100 

TAAGTTTAAT CCGACAGTCA IKACCTPkAGPiA GTTATATAAC TCAGCCATTT CTTCACTTTG 516 0 

25 AACACTTGAT AAGTATTCAT TGACTGTAAT AACGTGAACA CCTCTACCAG CTAATGCATT 5220 

TAAGTATGTT GGCATTGTCG CTGTTAATGT TTTACCTTCA CCTGTTCTCA TCTCAGCX3AT 52 8 0 

ATCACCTTTA TGAATTGCAA TACCACCCAT AATTTGAACT TTATATGGTG TCATATTGAA 5340 

30 TACACGTTTA GAGCCTTCTC TAACAAGTGC ATATGCTTCT GGTAAAATTT TATCTAAATA 54 00 

ATCATTTTGC TTTTTGACAT TATCAATGTC AGCTAATTCT GTTTGGAATT GTTTCGTTTT 546 0 

ATTACGAATT TCTTCATCAG TTAAAATTGC CGTTTTTTCT TCTAAAGCGA TTACTTTATC 5520 

AGCAAGTTTA CCTAACTGTT TAATTTCTTT ATTATTGCCA TCAAGAATTT TTGATAAAAA 5580 

TCCC^TTTCG TTCGCTCCTT TAGCTAAAAA ACTGTTTGGC CTACAACAAT ATATCTTATC 5640 

ATTTATAGTT AGAAAATTAT ACTTATTTAC TCATTTGTAG AATCAATATA AATATATTTA 5700 

40 

TGACATACTT CATTCACATT CTGTTGTCAA CAAGTTTATC ACTAATAAAT ATATTCTCAA 5760 

TACGCAATTA TACTTCCTAA TAAATTATAT TATAAATATT TTACXaATTTT CGACTCGGAC 5820 

TATACAATAG ACTGACATAC TATTATTAAC TTAACATTCA AATATATACA TCCATTAACA 5880 

45 

TTAGCATAGT CACTATGTTT CATTCAACAA ATTACATTAT CGAACTATGA AATAGTCATA 594 0 

ATTTGCTTTT GGAGTATAAA AAAGCACTTG TGCAAAAACA CAAGTGCTTT AAACTTAATT 6000 

TATTGTTCAC TAGTTTGAAT CAAGCCATAT TTACCGTCTT TACGGCGGTA AACGATACTT 6060 

SO 

GTTCCATCAG TTTCTCTGTC TGTGAATACA AAGAAGTCAT GACCTAATAG ATXCATTTGT 6120 

55 



1057 



10 



EP0 786 519 A2 

ATCTCGTTAT CATCGTAAGC GTCATTATCA ACTTGTCTTT CTTGCATTTC TTGTAATTCG ~ 6240 

GCAACAAACA CTTCTTGATC TCCTCGATCA CGGCTCTTAC GATTAATACG TGTTTTATAT 6300 

TTTCGAACTT GTCTTTCAAG TTTATTATTA ATTAAATcAA TACCTGCGTA TAAATCATCG 6360 

TTTCGCTCTT CAGCTCTTAA CGTAACATTT TTCAATGGAA rrGTTACTTC AATTTTAGTA 6420 

GCTGAATTTG AATAAGTTTr AACTTTAACA TGCGCCACTG CATTTGGTAC GTCATTAAAA 6480 

TAACGTTCCA ACTTACCAAT TTTTTCCkCA ATATAGTTGC GAATAGCATC TGTGATAGTG 6540 

AGGTTATCTC CATGAATTTC AAATCTAATC ATAGTAAATC TCTCCTTAAA CCTCTTTATn 6600 

GGnAACTCnT TATTATATTT AACATTTTTA CGCCAATCGT GC7VAA 664 5 

(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7430 base pairs 
20 TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 . (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

CAGTTCCAGC ACAtCTATTG GGGATCAACA AACTAGGGAA AATGCTAATT ATCAACGTGA 60 

AAACGGTGTT GACGAACAGC AACATACTGA AAATTTAACT AAGAACTTGC ATAATGATAA 120 

AACAATATCA GAAGAAAATC ATCGTAAAAC AGATGATTTG AATAAAGATC AACTAAAGGA 180 

TGATAAAAAA TCATCGCTTA ATAATAAAAA TATTCAACGT GATACAACAA AAAATAACAA 240 

TGCTAATCCT AGCGATGTAA ATCAAGGGTT AGAACAGGCT ATTAATGATG GTAAACAAAG 300 

TAAAGTGGCG TCACAGCAAC AGTCAAAAGA GGCAGATAAT AGTCAAGATT CAAACGCTAA 360 

TAACSATCTA CCTTCACAAA GTCGAATAAA GGAAGCACCA TCATTAAATA AGTTAGATCA 420 

AACAAGTCAA CGAGAAATTG TTAATGAGAC AGAAATAGAG AAAGTACAAC CACAACAAAA 4 80 

TAATCAAGCG AATGATAAAA TTACTAACTA CAATTTTAAC AATGAACAAG AAGTGAAaCC 54 0 

TCAAAAAOVC GAAAAAACAC TATCAGTTTC AGATTTAAAA AACAATCAAA AATCACCAGT 600 

AGAACCAACA AAGGACAAT6 ACAAGAAAAA TGGATTAAAT TTATTAAAAA GTAGTCCAGT 660 

AGCAACGTTA CCAAACAAAG GGACAAAGGA ACTTACTGCA AAAGCGAAAG ATGATCAAAC 720 

GAATAAAGTT GCCAAACAAG GGCAGTATAA AAATCAGGAT CCTATCGTTT TAGTGCATGG 780 

TTTCAATGGG TTTACAGATG ATATTAATCC TTCAGTGTTA GCTCATTATT GGGGCGGTAA 84 0 

TAAAATGAAC ATTCGCCAAG ATTTAGAAGA AAATGGTTAC AAAGCTTATG AAGCAAGTAT 900 



30 



35 



40 



45 



55 
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10 



15 



20 



25 



TCGTGTAGAT TATGGTGCAG CACATGCAGC AAAATATGGA CATGAACGTT ATGGAAAAAC 1020 

ATACGAaGGA ATTTACAAAG ACTGGAAACC AGGACAGAAG GTACACCTAG TTGGACATAG 1080 

TATGGGCGGT CAAACGATAC GTCAACTAGA AGAATTACTG CGTAATGGTA ATCGTGAAGA 114 0 

AATAGAGTAT CAAAAGAAAC ATGGTGGcGA AATTTCTCCA CTATTC7VAAG GTAATCATGA 1200 

CAATATGATT TCATCAATTA CTACTTTAGG AACACCACAT AATGGTACAC ACGCATCAGA 1260 

TTTAGCTGGT AATGAAGCTT TAGTGAGACA AATCGTATTT GATATCGGTA AAATGTTTGG 1320 

TAATAAAAAT TCAAGAGTAG ACTTCGGGTT GGCTCAATGG GGTCTAAAAC AGAAGCCAAA 1380 

TGAATCATAT ATTGATTATG TCAAACGCGT TAAACAATCT AATTTATGGA AATCAAAAGA 144 0 

TAATGGATTT TACGATCTGA CGCGTGAGGG TGCAnCAGAT TTAAATCX3TA AAACGTCGTT 1500 

GAACCCTAAC ATTGTGTATA AAACATACAC TGGTGAAGCA ACGCACAAAG CATTAAATAG 1560 

CGATAGACAA AAAGCAGACT TAAATATGTT TTTCCCATTT GTGATTACTG GTAACTTAAT 1620 

COOTAAAGCT ACTGAAAAAG AATGGCGAGA AAACGATGGT TTAGTATCCG TTATTTCTTC 16 80 

TCAACATCCA TTTAATCAAG CTTATACAAA AGCGACAGAT AAAATTCAAA AAGGCATTTG 174 0 

GCAAGTGACG CCTACAAAAC ATGATTGGGA TCATGTTGAC TTTGTAGGAC AAGACAGTTC 1800 

TGATACAGTG CGCACAAGAG AAGAATTACA AGATTTTTGG CATCATTTAG CAGACGATTT 18 60 

AGTGAAAACT GAAAAGCTGA CTGATACTAA GCAAGCATAA TTTATAAAGT AAAGGGAGGA 1920 

30 ATTAATAATG ACTGCAGACT TCTTTCAATT AATCGGATCA TTATTTAGAA TTCTAAAAGA 1980 

ATTATTCAAG TAAAACATTG GCGAGGCCCC AACATAAAGA ATTTCGAAAA GAAATTCTAC 204 0 

AAACAATGCA AGTTGGCGGG GCCCCAACAA AGAAGCTGGC GGAAAGTCAG CTTACAATAA 2100 

55 TGTGCAAGTT GGCGGGGCCC CAACATAGAA GCTGGCGGAA AGTCAGGTTA CAATAATGTG 2160 

CAAGTTGGGG TGGGACGACG AAATAAATTT TGCGAAAATA TCATTTCTGT CCCACTCCCA 2220 

TTGGCATTTA CGAAGTTTAA ATGTGCAATT AGAATATATG TATAACAATA TTAAACACGC 2280 

GGTAAAACGA AGTCAGTCAA TTCAAACTGA TTTCGCCCAC CGCGTGTTTT TAACATAGCT 2340 

TAATAATTAA TAAGCATTAA TGTTCAATTT ATATGGTTGT TTTCCAATAA TAAACCTAAA 2400 

GATATAGAAT TCACGCAATA TCATGCCGAC ACCTATACAT AATCCTAAAA TGAATAGTAG 2460 

TGATATCGCT AGAAAGACCA TTGTATTATC CTCAAATATA TTTGTATATX3 CAAACAATGA 2520 

GTCTAGAATG ATTGGATGTA ATAAATAAAT AAAGAATGAG AAAGCACTAA TCATTTGAAT 2580 

CGTATTAAAT AACATTGTTT TAAAATGCGT GCAAATACCC AAGATAACAA TAAACATAAT 264 0 

ACTATTATAT GGTGTTAATG AATATGAAAA GCTGGTAACG TTCCAATAGT CTCCaTTTGT 2700 



40 



45 



SO 



55 
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10 



IS 



20 . 



25 



30 



35 



40 



4S 



SO 



TCTAAGAAAT TTAATACACG TTCGTAGTTA TAACCCATAT ATGCACCTAA GAAGAAATAA 2820 

AAAATCCATC CGAATATTAT AGTATTTTCA CTTAATGGAT AATAGTGTAG CACGGTATCG 2880 

TGAAACGCTG TGTTGTTCGT AAAGTAATAT AAAAATGATT GCTGTAAAAT AAAAGATAAC 294 0 

AATAATAATA TTTTACTGTT GAATAGGTTA TAGTTAATTT TAAAAATGAT ATAACTCAAA 3000 

ATAAAGAATT GCATGATAAC AACGATAAAA TAGCCATACC ATTGACCTAA TAGGACATTT 3060 

TCAATGAATT GTTTATTGAA ACTTGAATCT GTTAATAATG ATTCACTATA ACTGTAAAAC 3120 

AATCCCATTA ATATGTAAGG AATAAGTATA TATTTTACGC GTGTAGTTAA GTATCTATAG 3180 

GTGACTTTTT GGTAATTCAA GGTTGTCAGT AACTGTGACA AGATAATAAA GCAAGGTGTA 324 0 

CCAAAAATCA CAATATTACG AATGTAAAAT TGTAACACTA AGGATCCACC CTCCATATTT 3300 

TCATGTTTTA AAGTAATTTG TGTAAGTAAA TGTGTGATAA TAATAATTGG ACATATAATA 3360 

GCACGTAAAT ATACGAGTTC AAGTCTAATC TTTTTCATGG AATCCGTCCC ATCTCTTAAT 3420 

TAAATGCTCA AAAGCATCAT CACTAATTAA TATTCTAGGG ATGTAATAAT CATTGGAGTT 34 80 

CGGAGTGACT GCmTTCCT CTAATGAAAA ACCGTATTTT AACCCAGCTT TTTTGATTAC 3 540 

CGGTAATTTA TCGTCATTCA TCAAGCCATA AGGATAGGCT ATAGTTTTCT GCGACTTTTT 3600 

AAAGTTTTTA GTTAGATATT TTTCACTTTT GTTTAAATCT TTTATGATTG TAGCTTCAGA 3 660 

AGCTTTCATT AATTTTGACT TATTATTTTT AGATAAGTTA TGCAAATCGT GGGTATGTGT 3720 

TTCAAATTCC CATAACCCAG TTTTATACAT TTCTTTTAGT TCTTTTTTAC TAATCATATC 3 780 

GAGGTTGTGA AAGTTTTCTT CCCCAACATG ACCTGTGATA ATAAACCCAG TTGCCGGTAT 384 0 

TTTATATTTT TTTAAGATTG GATAAGCATT TTCATAAATA GTTTCATCCA TATCATCAAA 3 900 

GTTAATCCAT ACACTTCGTT TTGGaAACTT ACCTTTTTTC TTGtAATATA AAAATTCTTT 3960 

CAAC^STTAAA AATTTAGCAT CATGTGATTT TAGCCATTTT ATTTGAGATT CAAATTGTGA 4020 

TTGACTAACA CTATAATTTT TAATTTCTTT ACTACTAGAA AAGAAGTAAA TAAAATTATT 4080 

CAGAAAATTC GCTTTTCTTA CACGGTGATA ATTTAATGCC AGAGCACTAT TTTCTTTATA 4140 

TTTCAGTCTT TTAGGTGAAT CGTCATCTGC ATTTGCAATA TGATGACCAT CCAGTGTGCT 4200 

TACAGGCAAT ATGATCAAGA TACTCAACAC TAAAATTATA AATTTTCTAT ACTTCACGAT 4260 

TCTCTTCCTC TCTGCCATTT TTGAATCAAT ATGCTAATTG TAAAAAATAC AAAAATGATA 43 2 0 

ATCGCGAAAA TGCCCATAGT TTCAAATATA TCTAAAATTT CAGTATTTTC AATGTTTAAA 43 80 

GCAACACGTA TTGTATTGAT ACTTTCGTCA TGAATTTCAA ATATAGTACC AATATAAACG 4440 

AGTAGAACAA CTAAACAATA TATCCAAAAG ACACAAGATA TAGCGATAAG TGCTGTTTCT 4500 
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TTGOGTAACC ACCTTTCTTA CGTTTTAATG CTTTTGGAAA TGCGACAAGA ACTACTGCTG 4620 

CGTTAATAAT CCAGTATACT GTCGGATACC AACTTACAAA TATGAGTCCA GCCATATTCT 4680 

5 TTTTCTCGTA GCGACTATCA ATAAAGAGTG CGACTGTAAA TTGAATAACG TTTATAAAAG 4740 

TCATAGTAAA TGATGATAGT AGAAATATTG AAAAACTATA TGTCATAAAT GTATAGTCTA 4 800 

AGAAGTTTGC TGTTATGAAC AAATAGCCTA AATATAGAAG CACTATATAT ACCCATAAAA 4 860 

10 

TCGAGATGAT TTGCTCAAAC ATCAAAATAT ATAAAGGAAA CCTTTTCGTT TTCATTGTGC 4 92 0 

TAAAAAAGTC TCGTAGTAAT ACTTCGTGTC CCCCTTGAGC CCATCTCACG CGTTGCTTCC 4 98 0 

AAAGACCTCC CAATGTTTCT GGAACCAACA TCCAACACAT GGCAAGCGGT TCATACTTAA 504 0 

IS 

TACGATATCC ACGTAAATGC AATTTCCAAG AAACTGCAAT ATCTTCGGTA ATCATATCAG 5100 

TATCCCAGTA GCCAACGTCG ACAACTGCAC TTTTTTTAAA TAGAGTGAAG ACACCCGAAA 5160 

TAGTATTGAC TGCGCCAGCA AGTGTCTGAC TtCGCTTAAT ACAGCCAATT AAACTTGCAT 5220 

20 

ATTCTATCGT TTGAATTTTA CCTAAAATAG AACTCTTATT TCGAATTCTA GGATTACCTG 5280 

TAACTGCACC AAGTTTTGGA TCATGTTTGA AATTCTCAAT CATATAATAT GGTGCATCTT 534 0 

25 GATCAACGAT AGTATCTGCA TCCAAGCACA TTACATAATC ATATGAAGCC TGTTTAATGC 5400 

CTTGATTGAG TGCGTTGGCT TTACCTCTGT TTTCTTGTAA ATCGACGAAA ATAAAGTCAT 54 60 

TATTTTCTTT GATTTTATAG ATGAGTTCTG CTGTATTATC TGAACTTCCA TCATTAATGA 5520 

30 TAATAATTTC TTTCTTCTCG TATTTGAGTG CAAGAACATT AGACAACGTA TCTTCAAyCG 5530 

TTTCACTTTC GTTATAACAG GCAAGTAAAA ATGTAATGCC TTCTAATTCA TCCACATTTA 564 0 

TGTCAGGCTT CTTGTTCAAT GAATATCTAA TTTCTCTGGT AAAATAGAAA TAAATTGAAC 5700 

CGACAATCCA GTAAATAGAC ATAAATACAG GATAAAAAAG CAAAAAGTTA AAAAATTGCA 5760 

ATriCTTTAC CTACCTTTCG TTAGTTAGGT TGTAAGCCAT ATGGTAATTG ATAGTATTTT 5820 

AATTTGCAAT AGATTGTTGT TATAATTAAA CGGAAATATT TGTAATTGCA ACTTAATTTT 5880 

40 

CCTGTAACAT AGTGTGATTA ATTTTCAGTA GGGGGTTATA AAAATTGAAG GATAAGATTA 5940 

TTGATAACGC AATAACCTTA TTTTCAGAGA AGGGGTATGA CGGTACAACA CTTGATGATA 6000 

TAGCTAAAAG TGTAAATATA AAGAAAGCGA GTTTATATTA CCATTTTGAC TCXJAAAAAAA 6060 

45 

GTATTTACGA ACAAAGTGTT AAATGTTGTT TTGATTACCT TAATAATATT ATTATGATGA 6120 

ATCAAAATAA ATCGAACTAT TCAATTGATG CTTTATATCA ATTCTTATTT GAGTTTATTT 6180 

TCGACATCGA AGAAAGGTAT ATTAGAATGT ACGTTCAATT ATCTAATACG CCTGAGGAAT 6240 

SO 

TTTCTGGAAA TATTTACGGA CAAATACAAG ATTTAAATCA ATCATTAAGT AAAGAGATAG 6300 

55 
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TGCTGTTTCT 


TGAAAGTTGG 


TATTTGAAAG 


CATCCTTTTC 


GCAAAAATTT 


GGAGCAGTGG 


6420 




AAGAAAGTAA 


AAGTCAATTC 


AAAGATGAAG 


TGTATTCGCT 


ACTAAATATA 


rrriTGAAGA 


€480 


5 


AATAArrnT 


GTTACTAGTT 


TGTAATAATT 


AACTTACTTT 


TGTAAAAAAA 


GACATGAGAT 


6540 




TA'rrrTTTTA 


AATCTATATA 


AAGTTGACAA 


TACAAATCGA 


TATTGAGAAT 


ATTAAGATGT 


6600 


10 


ATATGAATTT 


TATAAATTAA 


ATGCAATACA 


TTAATATAAA 


TATCAATTGT 


TGCAAAATAC 


6660 


GATTTGTTCA 


ATGATTTGAT 


AATATTATTC 


TTTATATTTG 


TGAATGGTTA 


AGTTTGTCTT 


6720 




TGAACATATT 


ATAAAAGTGT 


AATGTTCCCC 


TGAAAAGAAT 


AAGTTGTCAT 


CTAATTACAG 


6780 


IS 


GAAATCCGCA 


TAAATTAGAT 


GAAATGGAAA 


GTAATAAGTA 


ATAATTTATT 


GATAAGCX3CC 


6840 




TATGTGATGG 


TAAATCATGA 


CATAGGCGCT 


TTTTTTTATA 


AGTTAAAAAT 


GTAAATAAAA 


6900 




ATTATATAAA 


TTACCCACAT 


CTTTTTAAAA 


GGTGTGGGCT 


TTATTATCAT 


TAACCCAACT 


6960 


20 


CACAGTGACG 


GGTTACGCAA 


GGTATTGAAT 


TACCGAGTAC 


GGGCACGCTC 


GGTGTTGTAA 


7020 




AGAGCAAATA 


ATCAAGTAAT 


GATGATGCTT 


CTACTCGATT 


ATAAGAAAGC 


CATGATAGAG 


7080 




TACGATGGTA 


TCTAGTTTTA 


TTATTAATAG 


GTTTGGATAT 


TTAAAGTTGG 


ACAATATTAT 


7140 


25 


ATCTTGTGCA 


AAAATATAAA 


TAAGTTATAC 


ATAATGGTAG 


AGAATCATGA 


TATAATTTTA 


7200 




AACGATAAAA 


TATTTATATA 


AATAATTAGA 


GaAAATGTAG 


TTGTGTATGT yTTGTGGtCG 


7260 


30 


TTAAACTAGA 


TATAATTGTC 


CGATTTATAA 


AACATACATA 


ATGAATACaA 


TGATTGATTA 


7320 


TGTGGAGGAA 


ACCATGAmAG 


AmAAGTTTgA 


TTTAGTAAAA 


CTATTAAATA 


TTCTAAAGAA 


7380 




GAATATTAAA 


TTATTGCTTA 


TTTTACCGGC 


AATATGTCTT 


GTAGTAAGTG 




7430 



55 (2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4082 base pairs 
r (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^'^ (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

45 

ATTGTTACTC ATTATTTTTT CATATTCACA CAAATGATCT TGTTTATATT TAGCTAATTG 60 

ATTTTTATCT AGCATTTTAT CCTCCTGCTG AGTTTGTAAC CTTTAATAAT TTATTTTCTA 120 

TAAAAACTTA GTATTCCAGT TGCTTATTAT ATCATTGATG AAAGGCTGAA ATAAAACATA 180 

SO 

AACTGTTCCG ACCATTAACG CTGTAGCTAA AGATAAGTCT ACAAGTCCAC CTGTTTTAAA 24 0 

TTGAATCGGT GTCTTCACAT TAAACGGTAA gGAknAAAAT AATTTCACGC CTTTTGGTGT 300 

55 
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AGGCGTCTGA ATAATTTGCA ATAAAAATGC 
AAAGGTTCGA TGTCCAAAGA TCAATCTCAC 
^ AATCTTACTT TGAGTGTGAC ATATATCGGG 

AACCGTAACC GACGAAAAAA TATCCGTTTG 
CATGCCGCAT GAAGCATGTG TTTTACCTGT 

10 

TACCACATCC CTAACAAAAA CACGAACATA 
AATTTAAAAA AGTATCACAT AACTCTTGAA 

,5 ACAAAATATT TAAAGGATGT GTTTGAATAA 

AAGTGCGTAT TCAATGGAGA GTTGCTGATA 
TTACACAAGA CCAAGATATT CATGCAGTTC 

20 TCGGCTCAAC GTTTGGTAAA ACGAATCGCG 

TTATTTATAC TCAAAATGAT CAAAAGGTTT 
AaAATCATTC ATGGTGAGGG CTTCATGAAT 
AAACAAATAC AAAAGGACAA CTGTTCCCAT 
TTTATAACAA AAGATGTGCC ATCAAAGAAA 
AGAAAATCAT AAACAATTTG CCGATGCTTA 

30 

CTACTTCAGA CAAGTATATT AACTGCGATA 
TATCATTTTG TACACCTTCA ATTAATATAG 
^5 TAATCGTTTG AGAAGCCTGT GCCGCTTCAG 

AAGGTTTACC CAATATGACA AAAAAGGGCG 
CAATACTCAT TACTACAGGC AAAATAACAA 
ACTTAAAAAA GTCAATAACG CCCGGTGCTT 
CATGTCTCAA TGCCGTCTTT CCTTCTGGCA 
CCTTAGCATA CTCATCAGGA ATTTTATTTA 

45 

CAACGAGGCA GGATACTATC ACTGATAAAT 
CTGTTTCAGC AACTACAATT GCAAAGGTGA 
CTGTTGCCTC ACGACGAGAA TAATATCCTT 

50 

CAACAGTTCC GTCTCCAATA AATGATGCTA 
TAAATAAAGG TCTCATAACC GGTCTAAAAA 

55 



TATGATTGCG ATAAATAATA TTGAATGCGT 420 

AAAAAAACTA ATTACCTTAA ACCTTCTGCC 4 80 

TAATAAGCTA GCTAGAGTTG CTAGAATGAT 54 0 

AAAATATTGT GTTGTTAGCG CTCCAACGAG 600 

CATATTTGTT CTCCTTTAAT ACTCACATTT 6 60 

TTTTCGGGTT AAAATTCATT AGTATGACAC 72 0 

AACGATTACA AAATCGTTTA TGATGTATTT 78 0 

TGGCAATGAC AGTAAAAAAG GATAATAATG 84 0 

TCAAAATTCC TACAAGTGAA ATTAAAAATA 900 

CTAAATTAGA CAGCAAAGAT GTATCTAGAA 960 

TTATTATCGA TACTGAAGAC CACGAATACA 1020 

ACAATGAATT AACTAAATAA ATTGTATAAA 1080 

GATTTTTTTA ATTGATTCAA CACCCAGCAT 114 0 

AATTTTAACA GTTGTCCTTT TTCACATATA 12 00 

TAATTGGTAG TGTAATGATT GTTCTAATCA 1260 

CAGGAATCTT CGAACCAAGT ATGACGCCAC 132 0 

TACTAAGTGC CCCAATAACA AAACGAGTTA 1380 

AAGGTAAAAA CATATCGGCA AAACCGATAA 144 0 

GTATTTGCAT TAACTCTAAA AATGGAACAA 1500 

TGTAGTTCX3C AATAATGGTA GCAATAGTAC 1560 

ACCACATATC AATGACTGTT TTTAATCCTG 162 0 

TAATACCTAC TTCTGTTGCA GTATCAAAGC 168 0 

ATGCCTCAGT ACGCGCACTT TCAGGTACCT 174 0 

AAGGCCAAAT TCTTGGCATA ATGACTGCTG 180 0 

AGAAAGCAAA AAATTGATTT TGCATGTGCA 1860 

TAGAAACTAC ACTAAATGTC GTTGAAATAA 1920 

CACCATATTG TCTACTTGTA ATTAAGACAC 1980 

AATTATCTAC CGTCGAACGT CCTGGCAATG 204 0 

TAGGACCCAA CATCTCTAAC AAACCGTATT 2100 
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AACTTGAGAA CAATAATCCA CCCGTTTCAT CTGAGTAAAT AACCTTTGAA CCAATTCGTA 2220 

AAAATGTCAT CCATGCAAAA ACAACTGCTA ATATTCGTAA AATTAACCAA CCAATTCTAA 2280 

CGTTAAAAGC ATTGTTCATT AGCCCGTCAG GTTTCAATTT ATCTTTTAAA ATAGTTGAAC 2340 

AAATCAGAGT TATGATACCC GATAAAGTAA TTATCGTCAC AATTAAAAAT GGCATTACGC 2400 

CACCTAATAC ATCTTTAAGC ACGCCTGCTA AAAATGCCAC GGGCAACGTT GTTTGCTTCT 2460 

GTCCATCTTG TTCGACTGGA ATTGGTACTA AAAATAATAA GATACCAATT AAAGACATCG 2520 

TAATAAACTT AAGTCTCCCA ATAACTATCT CTTTCCTTGA AAAGCTATCC ATAAAATCAA 2 580 

TCCATTTCTC TATGTATTCG TTTTAAGTAT ATACAGAATT CTATTCAGTT AACAAACATA 264 0 

TTCCTTATCA TTCTATCTTT CAAAATGTTT ATGTATGCAA AATAATGAAT AATTACAGTT 2700 

ATTAAATATA CGCTATTTCT TGTAATTTTT CAAGATGAAT TCAAAAAAGG TTAAGTACAA 2760 

20 TTACTGATTT CGTACTTAAC CTTTTTTAAA CTCTAATCAT ATGTTAGTTA TTTCATTCTT 2820 

CGTAATAATA TTAAGAAGTA TGGTGCACCG ATAATTGCAA TGATAACCCC AACAGGAATA 2880 

TCCAGTGGCG GATGAATGCC ACGGGCTAAA CCATCTCCAA ATGTTAACAA TATAGCACCA 2 940 

ATTAACCCCG ACATGATAAT AACGTGTAAT GTTTTATTTC CTATTAATTG TGTCGCAATA 3 000 

TGAGGTGCAA TTAATCCTAA AAAGCTAATA CCACCGACAA CTGAAATTGC GGATCCTGCT 3 060 

AATAITACTG CTAAAATTAA CAATAGCATT TTAATAGTTT TAACTTTTAA ACCGAGTGCG 3120 

GTTGCAACAG CATCACCTAG ATTC7VATACA TCTAATTGAT AACTCAATAA AATGATGATT 3180 

GGTATCGTTA TTAAAAACCA AGGTAATATA GTATAAATAT TCGACATATC ATGTCCATAT 3 24 0 

AGACTACCTG TCAACCAAAC AAGCGCTTTG TTTGCTTCCA GTGGATTTCT GATTAATAAG 3300 

AACTGCACAA TCGCCGTACA TATTGCGCCT ATTGCTAAAC CAATTAAGGC AAGCTTTGAA 3360 

CCnlAACAT CATATTTTGA AATTAAAAAT GATAAAAATA AACTTACTGC AAAGGCACCT 3420 

40 AAGAATGAAC CTATAGGTAA TACAAACAAT GGTGCTGTTG GAAAGGTCAT AATAATAATC 34 80 

ACAGCAGCTA AACTGGCACC TTTAGAAATA CCTATAACAT CAGGTGAGGC TAACGGGTTT 3 540 

CTTATTACAG CTTGTATAAT TGCACCTGAA ATAGCCAAGC TACTACCGAT AATAATACCA 3600 

AGTAATGTTC TAGGTATACG ATACTCATTT AAAATAAAAT CATCTTGTGT AAAGATTCCC 3660 

TTAATAGCAT CAATCGGATG AATCATGACA GACCCTACAC ATAAACTTAT GAATATACTC 3720 

ACAATTAAAA GGATTGTGAT TAAACTATAA CGACGTATAA TTTTCGTTGT CATCATATTC 3780 

TTTTCACCCC TTTAATCGTT ATAAATAAGA AGTAAAGTGC ACCTACGAAT GATGTAACAA 3 840 

TCCCTACTGG TGATTCATAA GGATATGTAA TTAAACX3ACT TAATACATCT GATAGTAGTA 3 900 

55 
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AGCGTTTGAC TATATGCGGT ACGATTAAGC CAACAAATCC AATTGGTCCT GCCACTGACA 4020 
CCGACATACC TGTAAGAATA ATGACTAATA GTCCAATGAT AATTCTAACT TTATTTATAT 4080 

5 

TT 4082 

(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 
^0 (A) LENGTH: 1145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

75 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 



20 



25 



30 



35 



40 



45 



50 



TATTAGAAGG 


TCGTTCGGAT 


GAACAATTAA 


AAAATTTAGT 


TAGCGAAGTA 


ACTGACGCCG 


60 


TAGAAAAAAC 


AACGGGGGCA 


AATAGACAAG 


CAATTCACGT 


TGTTATAGAA 


GAAATGAAAC 


120 


CAAACCATTA 


TGGTGTGGCT 


GGCGTAAGAA 


AGTCAGATCA 


ATAATTCTTC 


ATAAGATGCA 


180 


TGCCAATTAA 


TTCTTTGAAA 


ACGAACAAGG 


CGACTTCTAT 


CTGaGTATGA 


TAGAAATCGC 


240 


CTTGTTTATT 


TTTAATCTTC 


ATCTAAAAAG 


TCTTTAATAG 


CTTGTTTATT 


TGTGTTTTTA 


300 


TTAATCTGTA 


ATGCACTACC 


ATCAGTATTT 


GTATTGACAT 


CyTCGTATGA 


GTTCTTGATT 


360 


GGCaCAGTCA 


ATGACTTAAC 


ATCTTTTTCA 


CCTCGGATAC 


CaAAACTCAA 


ACCTGTTTGG 


420 


AAAATCCCTG 


AATCAGGAAT 


GTTTGTATTC 


ACATAGCCTC 


TTAAAATACC 


TGCAACTTTT 


480 


GGTAATTTAA 


CAACTGTTCT 


AAAATTAACC 


ATTTCTTTTT 


TCAATGTTTG 


CATCACTTGT 


540 


TGCTGACGTC 


GCACGCGTCC 


GAAGTCACCT 


TCAGGGTCGT 


GACGGAATCT 


TGCATAACCA 


600 


AGTAATTCTT 


TACCATTCAA 


CCTATGGTTA 


CCCTTTTTCA 


AAGATACACC 


AATATTTTTC 


660 


GACATATCTT 


TTTCGACATT 


AATTGGTACA 


CCTTCAGGCA 


TTAATTCATC 


AATCATTTTC 


720 


TCAAATCCAG 


TAAAATCAAC 


TACTGCATAA 


TATTCAGGAT 


TAATTCCTAA 


ATTTTTATCA 


780 


AGTGTTTTTC 


TAAGTAGCTC 


TGGACCACCT 


AAAGCGTATG 


CTGAATTAAT 




840 


CCATATCCTG 


GAATATCTGC 


ATAAATATCA 


CGCATGACAG 


ACATCATTTT 


CATCTTTTTA 


900 


TTGATAAAGT 


CATATTGAAC 


AACCATGATA 


GAATCTGTTC 


TTGATTGTCC 


ACCTTGTGCT 


960 


TTATCTGCAC 


CGAGTACAAG 


AATAGAAATT 


nTACCATCAT 


TTTTTACTGG 


TCCATTAAAT 


1020 


TGATGTACTT 


TAACATCTTT 


CGCATGTTTC 


TTGGCATATT 


CTACACCGCT 


ATTGTAACTA 


1080 


TGTACAATAT 


ATACAACTAA 


TGCCGATAAG 


TAAAATTACA 


ACAATCAGAA 


GAATGATAGG 


1140 


TAATT 












1145 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7075 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

10 



15 



20 



25 



30 



35 



40 



4$ 



50 



TATGGCTCAT 


CATTAATGCA 


CGTATCGGGT 


AGCGTTTACC 


ATTGATAAGT 


GCTTCATGTT 


60 


TAGCACGAGT 


TCTTAAAATT 


CCATCGCCAT 


AACCGATATC 


AACTACAGCT 


AATTTTGTAT 


120 


ixj'rrrrrAGT 


CACTTCAAAG 


GCAAAGCTAT 


AACCGCAATA 


ATCACCAGCT 


TGTACTTCGC 


180 


GCACTTGAAT 


AACATGTGCT 


TTTAAAGTTA 


ATGACTGAAC 


TATATCATGT 


TGATTCAGTG 


240 


AACTATATGG 


TCTTGAACCG 


TATAACGCAA 


TACCTACACG 


CGCATGTGTA 


TGGTGGGGTA 


300. 


GTAATAyvyg 


TCCTTCCCGA 


TAAAAACTCG 


CACTATTTTG 


AGCATGGATT 


AGGTCGAACT 


360 


GATAACCTTC 


AGATAAAAGT 


GCTTCAACAA 


TTTCCATCCA 


TTGTGAACGT 


TCAACATTAT 


420 


AAtCTGACAC 


ATCGAATTCA 


TCAGCATATC 


CAAAATGGGk 


CCATAAACCA 


CTAATAATCA 


480 


TTTTTGCATT 


TTGATTATGA 


TGGTGATCTT 


TCAATACTTC 


TTTAATTTCG 


TTTAAATCTT 


540 


TAAATCCAGA 


CCGATGTAAT 


AAATTTTCAA 


ATTCTAAGTG 


AACATGAATA 


CCAGCTAAAT 


600 


CATTTTTATG 


GTTATAGTAA 


TATGTCAACG 


ACGGCAAAGT 


CATGTGTATT 


TGATGTTCAC 


660 


GGACTAAATC 


AAACTCGTAA 


ACTGCATTCA 


TTAAAAAGAT 


TGTTGCATCT 


GGAGCAAGTT 


720 


GTCTAATTTG 


AATTGCTTCT 


CGTAGTGATG 


TTGTGCTAAA 


TGTATCTATA 


CCTGCATGGA 


780 


TAAACTGAGT 


TACAGCAAAT 


TCTAGGTCAT 


AGTGATATGC 


ATTaTTTTTA ACAACTGCCA 


640 


TTAATGGCTG 


ATTGTTTTTG 


ACTGTGATTG 


CATTTTGTAA 


AAATATTTTC 


TTATTTACAG 


900 


ACC^TGTTGC 


TGTCAATGTA 


TTACACCTCT 


TTCjlAAl lAl 


TTAATAAATT 


TTCGTAAAAA 


7 w U 


TTAACCACGT 


TTATTAACAC 


TTTTTCATCA 


AAATTTAAAT 


GTGATGTGTG 


CAAACCAGTT 


1020 


ACAAAACCTT 


TATCTTCATT 


TCGTGTTCCT 


ATAAAAACAA 


AGTAAGCTGG 


AGCTAGTTGT 


1080 


TGACCATAAA 


AACTAAAATC 


TTCCCCAAAT 


AAGAATGGCG 


TTGGTTTGTC ATAGACATTT 


1140 


AAATCAGCTT 


TTATTAAGGC 


GTCCTCTATT 


TGAGTACGTA 


ATTTCGGACT 


ATTGATTGTA 


1200 


GGGGGATAAC 


CTTCTGCAAA 


TTTAACTTCA 


CAATCTACAT 


TAAACAGAAG 


CTTGACACTT 


1260 


TCTGCTATCT 


TGTGCATTTG 


ATTTTTAACG 


ATTGTTAAAT 


CATCAATATC 


ATATGTACGA 


1320 


ATAGTACCTT 


CTAAATAGCC 


ATTACTTGGT 


ACAGTGTTAA 


TCGCTTCACC 


AGCTTTAAAA 


1380 


TGACCAATAT 


GAACAATATT 


TCGTTTCAAA 


CCGTTAAGGT 


GAAATTGTTG 


AATTTGTGAT 


1440 
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ACATGACTTG ACAGGCCTGT TAAGAAAAAG CGATACTCTG TTGCGCTGGC CGTAATTTCT 1560 

TCATCTCTTA TCACTGCAAT GCCTTCATCA GCAAATGGGT TAACATGAAT ACCAAATACC 1620 

5 

GCTTCAATTG GATACTTATC AAAGGCACCG GCTTTTATTA ATCGATTTGC ACCGCCACCA 1680 

GTTTCTTCTG CAGGTTGGAA AATGAAAACG ACATTTTGCG GTAATTGACC TGCATCTTGC 174 0 

ATGTCTTTGC AACGTTGTAC AAAAAGCATT AATGCAGTTG TATGACCATC ATGTCCACAA 1800 

GCATGCATCA CATGATCAGA TTGACTGCGA TAAGGCACAT CATTTTCCTC TAAAATAGGT 1860 

AACGCATCAA TATCAGCTCT ATACGCTATC GTATGTGAGC CATTACCTTC TAAGTATGCA 1920 

IS ATGACGCCAG TTTCCAATGG GCAATCGTAT TTAATATTTA AACTATCTAA AAACGCTTTA 1980 

ATATAAGCAG TTGTTTCAAA TTCATGTAAG CTTAATTCAG GATGTTGATG TAAATGACGG 204 0 

CGATGTTTCG TAACAAATTC TAATTCATTC ATAATTATCA ATCCTTTGTG TTAAATTACT 2100 

ATATAAATAG TGTAACGtAT TTCGAAATTT GTGATCATAA GTTTATTCAA TGCTAAACAA 2160 

TAAGGTTGAG ACATAATCGT ATCTCAACCT TGAAATTATT ATACGTTGAC GTCACTAGTC 2220 

ATTCAGTTTT CTTAATGCTG CTACAATCTC TTTTTTAGTA TCTTGTACTT CAGAAGCTTG 2280 

2S 

CTTAATCACT TTTGCAGGTG TACCAGCAAC AACTGCACCA GCTGGTACAT CTTGTGTCAC 234 0 

AATCGCGCCA GCTGcAACAA TAGCACCTTT ACCAACACGT ACACCTTCTA AAATAACTGC 24 00 

ATTTGCACCG ATTAATACAT CATCCTCGAT TATAACCGGT GAAGCACTAG GGGGTTCAAT 24 60 

30 

CACACCTGCT AATACTGCGC CAGCCCcTAC ATGTACATTT TTACCAGTTG TAGCACGACC 2520 

ACCGAGAGTA GCATTCATAT CAATCATTGT ACCTTCGCCA ACGACTGCGC CAATATTAAT 2580 

35 TGTTGCGCCC ATCATAACGA CAGCACCATC TTCAATAATG GCTTGTTCTC TAATAAACGC 2640 

ACCTGGTTCA ArTCGTGCAT TCGTATTTGT TAAGTCTTTT AATGGAATAG CAGAATTGCG 2700 

ACGATCCATT TCAATTTCTA TATCTTCGAA TTGACTACCA TATGCTTCGT AAAAAGGTTT 2760 

'^^ CCAATCATCC GCTTCACAAA AGATTACTTT AGATTGTTCT GAACCAAATA CTTTAAAACT 2820 

TTCTGGATAT GTGATGCCTT CAAAATTACC ATTTAAATAT ACmTATTG GTGTAGACTT 2880 

TTTAGCATCA CTTATATATT GAATAATTTC TTCAGCTGTT AAATGTTGTA CCATAAAATA 294 0 

45 

ATCGATCTCC TTTAATATGT TTATAAGTTG TCAAACGTAT AAAAGCCGTT TGGTTTATTA 3000 

ACTAAGCGTT CTGCTGCTTG TATTGCACCA TTCGCAAAAA TATCTTTTGA TTGTGCACGA 3060 

TGCGTGATTT GAATCGTTTC ATCAGTGCCA GCAAATAGAA CTTCATGTTC ACCGACAATC 3120 

SO 

GTACCTCCAC GAATAGAATG TATACCAATA TCTTGTGGCT GGCGTTTTTC ATTTAATTCA 3180 

TGTCTATCAT ACACAGGTGT TACATTTTCT TTCAAAGATA CX3ATCACATC ATACAATTTT 3240 
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TCGAAATCAT CAAGTAGGGG AACAGCAGCT GCTAAAATTT TAGTCAATGC ATGAACX3CCA 336o 

TAACTCATGT TCGCGCTGAA AAACACAGGC ATATTTTGAC TCAATTCATC TAACTTATTA 3420 

AGTAGTTTTT CTTTCTCGnC CAGTgTTGCC ACAACTAATG GCAAATGAAA ATCTTCATCT 34 80 

AATAAAGGGA AAAGCAGATT TGGATTTGAA AAATCTATTG CAACATCGGC ACCTTTAACA 3 540 

TCTGCAATAT GTTGATATTG TTGATATGGC GTTGTTGCTT TCGGTGTATT TTCAATGACC 3600 

CCAACGATTT CATGTCCTTT TTCTTCTGCT AATCTAGCAA CGCGTTGATT CATTGCGCCA 3 660 

TAGCCAATTA GTAATATTTT CACTCATTTT CACCCGCTTT AAATGTGTCA TATGTTTCAC 3 720 

IS GAAGCACTTT AGTATCTGTA TCTTCTAGGC TAACCAATGG TAGACGTAAT TCATAATTTC 3780 

CAAATCCTAA ATAACTTGTT AGAGCTTTAA TAGGAATTGG GTTAATATCA ACTGATAAAG 3 840 

CTGATAACAG TGTGCCGATT GGTTTAAATT GATCTTGAAT ATCTAATCCA CTTTGTTGAG 3 900 

CATCGTATAA CGCTTGAAAT TCTTTAGGAA TGACATTGGC AATAACAGAG ATAACCCCTT 3960 

GACCGCCACG TTGATAGTAT TCGACGACGT TGTCATCATT GCCACTATAT AATGCAAATG 4 020 

AATTTGTATC AATGCGCTTT TTCACTTCTT CTAAATACTC AAAATCATTC GTAGCATCTT 4 080 

TTAAAGCAAC TATATAAGGA TGTTGACTTA ATATTTCTAC AGTTTCTGGT TCAATTGTCA 4140 

TGTTCGTTCT TGAAGGAACA TTGTACAGCA CGACTGGTAA TTTCACAGCA TCTGCAATCG 4 200 

CTTCAAAGTG TTTGACTAAA CCACGTTGGT TCGTTTTGTT GTAGTAGGGC GTAATTAACA 4 260 

TAATTGCATC AGCCCCTAAG GCTTTAGCTT GGATTGAAGC TTGGATTGAC TTTTCAGTAT 4 320 

CATTAGTGCC AGTTCCTGCT ATGACAGGAA CACGTTTATC TACAAGATCA ATAACTGTTT 4 3 80 

35 TTAGAATGCG TTCTTTTTCA TCTGTTGTTA AAGTAGGGCT CTCAGCAGTA GTTCCATTAA 4440 

CGATGATTGC TTGGGCATTA TTTTCTAGTA AAAAATTAAC GTGT G TTTTC AAAGCTTCAA 4 500 

TATKUVCTTT GTTATTTGTA AAAGGGGTTG TAAGTGCaAC SCCAACACCC TCAAATAAAT 4560 

GTGTCATTTT AATTCGCTCC TTTTAAACGC ATAACTTGTT CCAATACTTG TACAGCATTT 4 620 

AATGCAGGAC CTTTTAATAA ATTGTCTGAT GTACACCATA CATGGAAAGT ATTTTCTAAT 4680 

GAATCATCTC TACGTATACG GCCAACAAAC ACTTCATCTT TATTAGTAGA ATTGATTGCC 4740 

ATTGGATATT CATTGTTCTC TGGATTGTCT ACTAAAACAA CGCGGTCATC TTGATCAAAT 4 800 

AACGCTTTAA TATCTTCTGC TGTTGTTTCT TTGTCAAGCG TTACATCAAT TTCAACACTA 4 860 

TGACTATCTT GAACAGGCAC ACGTGCX3CAT GTTGCTGTTA CTTTTAAGTC TGGCGCATTT 4 920 

AAAATTTTTC TCGTCTCATC AATCATTTTT TGTTCTTCTT TTGTATATCC GTTTTCTAAA 4 980 

AACACATCAA TATGCGGTAA CACATTATTA TAAATTGGAT GTGGATATGC TTCTGGTGCT 504 0 
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TGATATGTTG TATATGCCAC TCGTTTTAAA CCATAAGCAT CTTGCAATAC TTTTAGAGGT 5160 

ACAACAGATT GAATCGTAGA GCAGTTTGGA TTGGCAATGA TACCTCTTGT AAATGTAGGT 5220 

5 

TCATTGACTT CCXX3AACGAT TAAATCAATA TCTTCTGCCA TACGCCATTG ACTTGAATTG 5280 

TCTATAACGA TTGCACCAGC TTTTTCAAAA AGTGGGGCAA AGTGTTCGCT TGTACCGCCA 534 0 

CCAGCACrCA TTAATACATA ATCGAAATGT TCACTTGCAC GAGCATCAGT TAATTCTTGA 54 0 0 

10 

ACTGTATATG TTTTTCCTTG AAATTCAACT TCTTGCCCTG CAGAACGTGC TGATGAAAAT 54 6 0 

AATACTAATT CATCGAAAGG AATATTTTTA CGATTTAATG TCTCCAACAT TTTTGTACCT 552 0 

75 ACTAATCCTG TTGCACCCAC AACTGCTAAC TTTGTCATAA CTTGTCACTC CATTTTATAA 5580 

TAATTTCCaA TTTTTAGAAT ATTTTAACAA TCATTTTACC ATTAAATGTT AAATGCGTCA 564 0 

TATAGTTTTT CTACCGCTTG TTGCCCATTA AAATCATCAA TGACGTATGA AATACTTATT 5700 

TCAGATGTTG TTGTTTGGTA GAAAGGTATA TTATTTTGAA TTAATGTGAA AAATGCTTTT 576 0 

GATGCCACAC CTGACATATC ACGCATGCCT GAGCCAATTA ATGAAATTTT GACATAATGC 5820 

TCATTGATTT TATAAGCTAA TGClTCATAT TGATTCTTTA ATGTTTCAAG AATCATAGAA 5880 

25 

ATTTGATGAA AATCACTATC TTTAATCGTG AAGGATAGTT GTAGCCCATC CAAGTTGACG 594 0 

ATTTGTGAAA TCATATCAAC ATTTACAGCA CCTTCTTCAA GTTCCGTAAA TAGTTGGGTA 6000 

AGTAGCTGAT TGTCAGGTAG GGGATAACTA ATTGTTACAT GCATCATATG TTTATCCAAA 6060 

30 

GCCACACCAG TAACTGCTTT TTTCTCTAAT ATTTCTTCAT TTGACATAAT CCATGTTCCT 6X20 

TTCACGTTCG ATAAAGTTTT TCCTAAATAT AAAGGGATAT TATAOTTTTT AGCTAATTCA 6180 

35 ACACTTCTTG TTTCAAGTAC ACCAGCACCT AAAGCGCTCA TTTCCATCAT TTCTTCATAT 6240 

GAGACGATGT CTAGTCGTTT AGC CT TTGGT AAAAGTCTTG GGTCAGTGGC ATACACACCA 6300 

TCA^rcrGTCGG TATAAATTTC ACAAGGTATT TGATTACTAA CAGCAAGTGC CACAGCGGTC 6360 

GTATCAGAAC CACCTCTGCC TAAAGTTGTT AATTCCTGAT GTTCATTGAT GCCTTGAAAT 6420 

CCAGCAACTA CTAAAATATC GTTTTCTTGA AAGGCTTGTT CAAATGTTTG AGGATTAATT 648 0 

TGAGCAATTT TACTTTTTAA ATGATGGCCA ATGGTTTTAA TACCCGCTTG ATAGCCAGTC 654 0 

45 

ATTGCTTTGG CATTCATACC GATATCATTT AATACCATTG ATAAATAAGA TACAGTTTGT 6600 

TGCTCTCCGG TTGTCAATAA TAATGCCAGT TCTTGTTGTT TTGQTGCTTT AGTCAAGGTT 6660 

GATACATTCG TCATTAATTG ATCTGTTGTG TTACCCATAG CACTTACAAC GACAATTAAa 6720 

50 

TTGTTCATCT TGATTGACTC GCTCCTTTAA CATTTCAGCG ATCCTTTTTA TTriTGTAAA 67 80 

ATCACTGACG GATGATCCGC CAAATTTCAA CACACTTCTT GTTACCATAT AATCCTCCTA 6 84 0 
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TAATCTATAT ACAAGTGATG CACTCCATTA TTTTTAAATA ATGACAAACT CTCAGCTCTT 6960 
AACCAAAAAG TCCAACAAAT TATAACTGCT ATTATAATTG CTTCGGCATC GCACCCTTTC 7020 

5 

AAATTTAGCT GTTAGCAGAC AGTAATCTAa ACTTTACTCA TGATTGATGC GCCTC 7075 
(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; doiible 

(D) TOPOLOGY: linear 

IS 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 
AGACGTACTT TGTGATTmCG AAGyrCGTAC TGmAGCACTT GTCGACGTTG ATGTACTTGT 60 

20 -'— 

TGAACCTGAT TGACTAGTAC TTTGTGATAA TGACTTACTA TCAGAATCAG ATGTACTTTG 120 

TGAATCACTT AATGATTCTG ATGTACTACC TGACTGAGAC GTGCTCATTG AACTACTTAC 180 

GGACATTGAT TTACTGTCTG ATGCAGATAA TGACCCACTT GTACTGATAG AGTCACTTAC 24 0 

25 

TATCTCTGAA GTACTCATCG AGTCTGATGT ACTTGTTGAG ACACTTTGTG ATGCTGCTAT 30 0 

GCTTAGTGAT CCAGAAACAG AACCACTTGT GCTCGTCGAA TCGCTCAATG ATTCTGATGT 360 

ACTCATCGAT TTTGAATCAC TTGTACTTAA TGATATTGAT GTACTTTGTG AATCTGATTT 420 

30 

GCTTGTTGAC GCACTTTGAG AGTTGGCTAT GCTATTTGAA ATACTGATAG AGTCCGAGGT 4 80 

GCTAGCTGAC TCGCTCAATG ATGTTGATGT ACTAATTGCA TTCGATGTAC TGTCACTTAA 54 0 

35 TGATGCTGAT GTACTAGACG ACCCTGATAT ACTCGTTGAT AAGCTTTGTG ACTTAGACAA 600 

GCTTCCTGAT GTACTCATAC TTAATGAGTC ACTGAGTGAT GTTGATGTAC GCAATGAATC 660 

AGATSTACTT GTTGATAGAC TTTCGGATTT TTCAGTACTG CTAGAGTTTG AAATAGAATC 720 

GCTTAATGAT GTTGATTTAC TAGCTGAATC CGACATGCTT GATGATACAC TTTGTGAATT 780 

CACTAAfiCTT GTGCTTGTTG AGCTTGATAC ACTATTACTT TCAGATGTGC TTAATGACTT 84 0 

AGATGCACTC ACAGAATCAG ATAGGCTTAC ACTTGTCGAT TTCGAGGTAC TAGCTGATGT 900 

45 

AGATACCACA ATCGATCCTG ATGTACTCGT TGATGCACTT TGTGAGTCAG CTTTACTTGT 960 

TGACACACTT TGAGATTGTT GTGTACTTCC TGATGTTGAT ACGGAATCAC TCATGCTATT 102 0 

TCTTGTTACT TCATATTTAA AAGTTGTCGT CGTTTTGTTA CCGCTCGGAT CTGTAGAAAC 108 0 

SO 

GATTGATATA GTACTTGTAC CAATGTTTGT TGGTGTACCA CTAATAGTAT TATTTGTACT 114 0 

ATCAAATGTT AGTCCGGATG GCAATCCAGT CACTGTATTC GTCACCGCAT TTCCACTGTT 1200 

55 
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ATTGGTGTCA CTGTTGGTGC TGTCGTATCC ACAACATTTA TTGTAAAAGT TGTCGTCGAT 1320 

TTGTTATTTG CTTGGTCAGT AGACACAACT GTCACTGTTG ATTGACCAAT TTTTGTTGGT 1380 

GTCCCAATGA TTGAATTCGT TGCACTATCG TAACTTAATC CGCTTGGTAA TCCTGTAACT 1440 

GTATTTGTCA CAGTCCCAGT ACCATTATCC GTTGTAGTCA ATACAATAGG ATTCATTGTT 1500 

TTACCCACTT CTATGGTTTG ATTGCCTACA GTTACAGTTG CTGCTTTTAC ATCAGTAAAA 1560 

TAATATGTCA CTGATTGTCC AGCATTCGTC ATTTTTACAG TTTTATTTGT ATCATTATAA 1620 

GTTGACGCAT ATGAACTATC GACGGACGTG TAGTTATATC CTTTAGCAGT CAATGCAGAT 16 80 

IS TGCTGaTTAT CGATTGTCAC GACTTGATCA ACATTTCCTG AATATGTTTT TGGTGGAATA 174 0 

ATATCTTTAC CTGTTGTTAC ATCAACGTAT CTCACTTGTG TAnCAGCAGA CTCTGTATAT 1800 

TCGAATGTTC CAAATTGTAC TTGTTGTAAA TTTGTCGCGC CACCTGTTGA GGCTGTCATT 1860 

GATAATGAAA AGTTGGTCGT ACCACTTTTC GCAATCCAAT CTGAAATATT ACGTGTCCAT 1920 

GTTTGACCTG CATATTTGAC AGTCATAACC TTTGTATCAC CATTATAGTT AATATCAAAA 1980 

TCTTGGaACG TGTTATTTGT AGGTTGaACA TTTAACTTCG CAGCATTATC AGCTGTTGAA 2040 

CTTGATGTAT ACGTTGTCGC AACACCATAA CTATCTGTTG TTACAAATGC ACCAAACGCA 2100 

CCTCCACCAG CTACATTAGA TGGGTCAGCA TTCGCCTTTG CAGCTGAATT TGGTTTAGAT 2160 

GTATTGTGAT ACGTATCCAA TTTGAAGCCA AATGCGTTAC TTAAGCCACC AATACCTACT 2220 

GCGGCACCGT TTAACCCTGT TTCACCTAAT ACACCTGGTG AAAAGGCAAA ACCGATACCA 2280 

TCTCCACCAT TTCCATGCCC TTCATATTTG TTACCTAAAT TTACTTTTCC AGAAAAATGA 234 0 

55 AAACTCTTAT TAGAGTCAAT ACGTGTTCCT AATGTAATAG CACCTTTTTG GCTGTATGCA 2400 

TCCTGTGTTA ACGTCACAAT ACCGGTACTT TGATCATAGG TAGCATTACC TGACX3TTGTC 2460 

ATATATTGTT TTAAGTTATC TTTATTAACT GTAATTGTAT TAGCAGTTAC TGCGGTTGTC 2520 

GTCgCTGctG ACGCAAATGT TGACATAGCT AAGCGACTGA AAGTTCGAAG TTTTACTGGT 2580 

GCGGTGCTAG TTGACGTTGT GCTAGTTTTG TTTAAGTTGA. CCGAAGATGG CGTTGTGCTT 264 0 

TGTGAAGTGT TATTTGATGC AGTACTTTGA TITGTTGATG TATTAATTGG TTGTTCTGTA 2700 

CTTGAAGTTG AAGCTACAGA TTTAGTATCA GAACTTGATG TAGTATTCTT TGAGGATGTT 2760 

GATTCTGATG TAGATGTCAA TTTCTCTTGT TGATTGCTTG TACTATTAGT TGTCGAAGTG 2820 

ACCTTTTCAG ACTTTTCACT TGAGACTGTG TCACTATTTG ATGTTTGTAC CGAACTACTA 2880 

TTTtTCGTTA CACTTGTGGA ATCGGCTGTT GATGTTGATG CTTCGATTGT CGTTGAGTTT 2940 

TGATTACCTA CTGTTTCACT TTGTGTGTTT AATTCAGAAG TTAATGGTGC ATCAGAAGCC 3000 

55 



30 



45 



SO 



1071 
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GTTTTCAGTC CGTATCCCGT CATTTTTTTA CTAATGCTTT GATTATCTTG ACTCACTAAA 3120 

CTATGACTAA TAAATGGTAG CCCCATAATT TTGAACATTT CTATTTCTTT AATTCCGGAT 3180 

5 

TTTACCCAAT TTTTTCCAGA TTTATAAAGT CTTACTCTTG TTTTTTCX3TT TGCTAAGCTG 324 0 

TCATGAAATG CTTTCTGTCT TTTACTCATG TAATAACTCC TTGTATTATC TTTACATTCA 33 00 

TTAGATTATA ATATATGCCA CTATTCAATT TAATACAACT CTTTTTTGAT ACAAAAATAC 33 60 

TCATTTTGTT AAAATTTGTA AAAATTCaTT TTTATTCGTC TAAATGTAAT CGTTTTCATA 34 20 

TTTTTAAAAT TACTTTTTCT CGTTTATGCG TATAATCTTT TTTTATATAA ATTTGGCTAA 34 80 

IS TTGGCTTTAT GTTTAATCAT TATAATTGTT TCGTTTTTAA AATAATTATT GTATTAATAT 3540 

ATCTATACCA TCCACCTTTT ATTTATAAAT AGTTAATTTA CAACTAAACG ATAAATATTA 3 600 

TATGCAAAAT ACATCTTTAA TATTAAAGTA ATACCAATAT TTTTTCAATA AACCTAGTGT 3660 

20 

AATATATGTG TAATTCTAAA AGATTCTTCT TTAAAAATAT AAATACCACG ACATATTGCT 3720 

TTAACATTTT CATTTATAAA GCX3AAAAAAT GCATCGGTAC TAAGTTGAAT GTTTAGTAAG 3780 

GATGCATTGA ATTCACTAAA ATGATTAAAT TACTTATATC TTTTCATCTG ATTGATTATC 3 84 0 

GAAATTTCTT CCTTCTAAAC CTGCTAACTC TTCTTTAGAA GCTGCAGGTG CTTTCATTTC 3 900 

AAATATCTCA TTCACTACTG TGTAATCGTA ATATCCTAAT CTGGCAATAG GTTTAATCGA 3 960 

CTTAATGTCC AATTTACCAT TATCAAGAAT AACCTTATCG TCAATATGAA CTTGGGCAAC 4 020 

30 

TCTTCCTATA ACAATATCTA CGGTAGATAC TGGATCTCCA GTTGGAATAC GAATCGTTTG 4 080 

AACGTACTCA CATTCAAAAT GAACTGGCGA TTCTTTTACA CGATATCCTG GAGCTTCTAT 4140 

35 ACATTTTTCC TTTGTTACAC CTGCAAAATT AAATTCATCC TCTTCTGGTG GCAATGCTTT 4 200 

CX3ATGATAAA TTAACTGCTT CTCTTAAATC ATACGTTGCC ATATTCCACA CAAACCAACC 4260 

TGTCTCTTCA GCATTTTTCA CTGTATCTTT ACGTTCGTGA TCACCAAGAA CGGATTGATT 4 320 

TGCTGCGAAC ATAACCATAG GCGGATCCCA AGTTAAGTTT TGATACTGAC TATAAGGCGC 4 380 

TAAATTATGt TTCCCATCTT TCGATACAGT AGAGATCCAC CCTATTGGAC GTGGTACrTGT 444 0 

ACTACTTTTA AATGGGTCGT GCGGTAAACC ATGACTTCTT ACACCTTGTT TTGGCX3AATA 4 500 

45 

ATTCATACTA TCTTCACCCC TTATAAGTAA TTACATTTAA GGTTACGCCC TCTTTACATA 4 560 

AGCGTCTAAT ATAAATAAAC AATTTATTTA TAAGTAGAAA CTATATATGA CGTGGTTGCT 4620 

TATAATTTGC GTTCTTGATT CGAAAAATTC AGATAAGGAT TTATACAATT AATATTTATG 46 80 

SO 

ATATCTTTTG TAAATTTAAT TAATTATAGT TACTTCAATC ATGATTAGTT TATAATAATA 4 74 0 

AAGTGAAATT GAAAAAGACA GCTATTATGC GATGAGCGAA AAACTTCAAG TAAAACAAGA 4 8 00 

55 
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TAAGTTCAAA AAAGAATTCA AACCTGTTAT GCACTTAAAA GGTGATGCAT TCAATCAACA 4 920 

GTTACAATCT TTGATTAACA AATATCCACA AATACAAAAA AATATGAAAT CAGAGTTCAT 4 980 

5 

TGCTTATTAT GATAAAGAAA AAAATAGAGA AACAGTAAAA AACTATGCTT GGAACCTTCA 504 0 

AAAATCTATA AATGACATTA TGCAATCATA TCCTAGCACA AAATTTGTAC AGTTTTATAA 5100 

AAGATGATGT TTCCCCGTCA ATGGTAGATG GAAATGGCCG TTTAAAATCG GGATACTAAT 516 0 

10 

GTATTTCCAT C 5171 
(2) INFORMATION FOR SEQ ID NO: 265: 

75 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 



2S 



30 



35 



45 



50 





AACCTATTTC 


AGTTATGGGT 


GGTACAGTGA 


TTCTCTTTTC 


ATTTTTAATA 


60 


GGTATTTGGA 


TTGGTCATCC 


TATTGAAACA 


GAAATCAAAC 


CACTTATTAT 


TGGTGCGATT 


120 


ATTATGTACG 


TACTTGGGCT 


TGTAGATGAT 


ATCTACGATT 


TGAAACCGTA 


TATAAAATTG 


IBO 


GCTGGTCAAA 


TTGCCGCTGC 


CTTAGTAGTT 


GCriTTTATG 


GTGTGACTAT 


TGATTTTATT 


240 


TCGTTGCCAA 


TGGGTACAAC 


GATTCATTTT 


GGATTTCTTA 


GTATTCCAAT 


TACTGTGATT 


300 


TGGATTGTTG 


CTATTACAAA 


TGCAATTAAC 


TTAATTGATG 


GACTCGATGG 


TTTGGCGTCG 


360 


GGTGTTTCkG 


CAATCGGACT 


CATTACAATA 


GGGTTCATTG 


CAATTTTACA 


AGCTAATATT 


420 


TTCATAACGA 


TGATTTGTTG 


TGTTTTATTA 


GG CTCTTTAA 


TTGGGTTTTT 


ATTTTACAAT 


480 


TTCCATCCTG 


CCAAAATATT 


TTTAGGTGAT 


AGTGGGGCTT 


TAATGATTGG 


ATTTATCATC 


540 


GGATTCCTTT 


CTTTACTCGG 


ATTCAAAAAT 


ATTACAATTA 


TTGCATTGTT 


CTTCCCAATT 


600 


GTTATCTTAG 


CAGTTCCATT 


CATTGATACT 


TTGTTCGCAA 


TGATTCGACG 


TGTGAAAAAA 


660 


GGGCAGCATA 


TAATGCAAGC 


TGATAAATCG 


CATTTGCATC 


ATAAACTATT 


AGCTTTAGGC 


720 


TACACACATA 


GACAAACAGT 


ATTATTAATC 


TATTCAATCT 


CTATTTTATT 


TAGTCTTTCG 


780 


AGCATTATTT 


TGTATGTATC 


GCCACCATTA 


GGTGTTGTAT 


TAATGTTTGT 


ATTAATCATA 


840 


TTTAGTATTG 


AATTAATTGT 


TGAATTTACA 


GGATTAATAG 


ATAACAACTA 


CCGACCAATA 


900 


TTAAATTTAA 


TTAGTCGTAA 


GTCATCTCAT 


AAAGAGGAAT 


AGGGAATGAA 


AGCATAGCTG 


960 


TATGGGATAA 


TTTGTATTAT 


ATGGCTTTAC 


TCTTTACAAT 


TTTTTTGTAT 


TAAATTTCAA 


1020 
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ATTTACCGTC TTATGATAGT GCTTTTTATT TTTATTCAGT TGGTATATCG MAGGTPACT 
GCTTTGGAGT TTCTTCAGTC AAATCGAAAT TTCCTGCAGT CATTTGATTT AAAAAGTTAA 
TAAACGCTTC ATAGTCACTT TTAACGACAT CGATATAGTA GCTTACCTTA TCAGTGTAAG 
TTTGGTTTCT TAACATAAAA TGAGTTGAAG CTAATTCATA TTCAAATTTA CCAGTTTGAT 
CATAATTCAG TGTTACTATA CATGGTACTG CTTCTCGTAG TTCGACACGC CCGATATCAT 
AAATGACGTC TCTAACAGCA CCGCTATAGG CGCGAATTAA ACCGCCACCA CCTAATTTAA ^ 
TACCACCAAA ATATCTTGTT ACTACGACAC ACGCATTATG aACATCGrGC TTTTTTaATA 
,5 TGTCTAACAT TGGGaCACCG GcAGTTCCtG TCGGTTCACC ATCATCATTC GCnTTTTGAA 

TATTCATTtC AGGTCCAATA GTATATGGAG AACAATTATG AGTGGCATCT TTATGTTCTT 
TTTTTATTGC AGCAATAAAT GCTTTaGCTT CATCTTCATT TTGAACAGGT TTGATATCAG 
20 CAATGAATCT TGATTTACTA ATCACATTTT CAATAATGTG TTCTTTTTTA ACAGTAATGA 

TATTTTGTGT CATAATAACT CCTTAATTCA TAAGCTTAAG ATTATTTAAT CTTCATTATA 
CACTGAAAAT GACATGACTA TAAATCGTTT GATTGCCATT TTCTTTTTAA CTGAAATATT 
GTATCATTGC TATGAGTATA TTTTAGGAGG ACGACTATGA AAATTGCTGT GATGACCX^VT 
TCTACAAGTT ATCTGTCGCA GGACTTAATC GATAAATATA ATATTcAAAT AGCGCCATTA 
AGTGTGACTT TTGAAGATGG CAAGATTATA CCAGAAGAAA AAGTTCGTAC TAAAAAGCGT 2040 
GCCATTCAAA CATTAGAAAA GAAAGTATTA GATATTGTAA AAGACTTTGA AGAAGTAACT 2100 
TTATTTGTCA TAAATGGAGA TCATTTCGAA GATGGTCAAG CGTTATACAA AAAGTTACAA 2160 
GATGATTGTC CTTCAGCTTA TCAAGTAGCA TACTCTGAGT TTGGTCCAGT TGTTGCAGCA 2220 
CATTTAGGTT CTGGTGGATT AGGTTTAGGC TATGTTGGCA GAAAAATAAG ATTAACATAA 2280 
TTATAAAATT TTAATAAAAG AGTCTATATT GTAATTGGAA ATTATCTCTC GTATACATGG 234 0 

40 CTTTAAATCT tCATCATTTG AAAGCCAAAA TGCTAAAGAT ATAAGAAAAT CATTATAATA 

TTAGGCTCTT TTTTACX3TTG AAATGAGGTT TTAAGCATTA AACATTACX3G GAAATTAATT 
CATCCTCATA CTTCACTTAC TAATGAAAAA ATTAAAAAAG AAGTAACAGG TGTCATCAAA 2520 
CAAAATTCAA ACTATTATTG TGTTCAATGT GAAAGTACAA ATCCAAAGCA TTTTTATCAG 2580 
TATGATTCCT CAGTACATTC CAAGAAAATT GTATATTGCA GAAATTGTAT ATCACTGGGT 2640 
CGAATGGATA ATGTAACAAG ATATAAAATA ACAGAGAGTT CGCAAAGTTC ATCACAAGCA 2700 
TATTATCATC TCTCATTTGA ATTGTCGGAA CAGCAGTCTT ATGCCTCAGA ACATATTGTT 2760 
CGAGCCATTA GAAAGAGACA AACGATTTTG TTATATGCCG TAACAGGTGC AGGTAAGACA 2820 
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45 
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1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 



2400 
2460 
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IS 



20 



TCACCACGTG 


TAGATGTTGT 


TGTAGAAATT 


AGTAAACGTA 


TTAAAGACGC 


ATTTCTTAAT 


2940 


GAAGATATAG 


ACATACTACA 


CCAGCAATCA 


AGACAACAAT 


TTGAAGGGCA 


TTTTGTTGTA 


3000 


TGCACAGTGC 


ATCAACTTTA 


CCGATTCAAA 


CAGCACTTTG 


ATACTATTTT 


TATTGATGAA 


3060 


GTCGATGCCT 


TTCCTTTATC 


AATGGATAAA 


AATTTACAAC 


AAGCATTGAA 


GTCATCTTCT 


3120 


AAAGTTGAAC 


ATGCAACAAT 


TTATATGACA 


GCAACACCAC 


CGAAACAACT 


TCTGTCAGAG 


3180 


ATTCCCCACG 


AAAATATAAT 


TAAATTGCCA 


GCTCGCTTTC 


ATAAAAAATC 


ACTTCCAGTT 


3240 


CCTAAATATC 


GTTATTTCAA 


ACTTAATAAT 


AAGAAGATTC 


AGAAAATGTT 


ATACCGAATT 


3300 


TTACAAGATC 


AAATTAATAA 


TCAACGTTAT 


ACACTGGTGT 


TTTTTAACAA 


TATAGAAACA 


3360 


ATGATTAAAA 


CATTTTCGGT 


TTATAAGCAG 


AAAATTACTA 


AATTAACATA 


CXrrCCATAGC 


3420 


GAGGATGTTT 


TTCGCTTTGA 


AAAAGTTGAA 


CAATTAAGGA 


ATGGACATTT 


CGATGTCATT 


3480 


TTTACTACGA 


CAATATTAGA 


ACGTGGATTT 


ACAATGGCAA 


ATTTGGATGT 


TGTTGTTATC 


3540 


GATGCACATC 


AATATACTCA 


AGAGGCTTTA 


ATACAAATTG 


CTGGACGTG 




3589 



(2) INFORMATION FOR SEQ ID NO: 266: 

^5 . (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 
(Cj STRANDEDNESS : double 
CD) TOPOLOGY: linear 

30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 



35 


TTTCCAAGAA 


GGcTTgAAAA AtGTTwCaAG 


TGGTGCGAmC 


CCAGTTGGTT 


TACGACAAGG 


60 


TATCGACAAA 


GCAGTTAAAG 


TTGCTGTTGA 


AGCGTTACAT 


GAAAATTCTC 


AAAAAGTTGA 


120 




AAATAAAAAT 


GAAATTGCGC 


AAGTAGGTGC 


GATTTCAGCA 


GCAGATGAAG 


AAATTGGACG 


180 


40 


TTATATTTCT 


GAAGCTATGG 


AAAAAGTAGG 


TAACGATGGT 


GTCATTACAA 


TTGAAGAATC 


240 




AAATGGACTA 


AACACTGAAC 


TAGAAGTGGT 


TGAAGGTATG 


CAATTTGATC 


GTGGTTATCA 


300 




ATCACCGTAT 


ATGGTTACTG 


ATTCAGATAA 


AATGGTTGCT 


GAATTAGAAC 


GCCCATACAT 


360 


45 


TTTAGTAACA 


GATAAGAAAA 


TCTCGTCTTT 


CCAAGATATC 


TTACCTTTAT 


TAGAACAAGT 


420 




GGTTCAATCT 


AATCGTCCAA 


TCTTAATTGT 


AGCTGATGAA 


GTTGAAGGCG 


ATGCATTAAC 


480 




AAATATCGTG 


CTAAACCGTA 


TGCGTGGCAC 


ATTTACAGCT 


GTTGCAGTAA 


AAGCACCTGG 


540 


50 


TTTTGGTGAT 


CGTAGAAAAG 


CGATGCTTGA 


AGATTTAGCT 


ATTTTAACTG 


GTGCGCAAGT 


600 




GATTACTGAT 


GATTTAGGCT 


TAGATTTAAA 


AGATGCATCA 


ATTGATATGT 


TAGGTACTGC 


660 
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CAGCATTGAT GCACGTGTTA GCCAATTGAA ATCTCAAATT GAAGMACTG AATCTCACTT 
TGATCGTGAA AAATTACAAG AGCGCTTAGC TAAATTAGCA GGTGGTGTTG CAGTTATCAA 
AgTAgGTGCA GCAAGTGAAA CAGAGCTTAA AGAACGTAAA TTACGTATTG AAGATGCATT 
AAATTCTACA CGTGCAGCAG TTGAAGAAGG TATTGTTGCA GGTGGTGGTA CTGCATTAGT 
AAATGTTTAC CAAAAAGTAA GTGAAATTGA AGCTGAAGGT GACATTGAAA CAGGTGT ioi7 

(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 09 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

TTAATCCAGC GTTAACTGTA TTTGCATTTA TTATGATTAT TTCGATTCTT TTAGCGTATG 60 

TATTTAAATG GCTTGGATTA GTGGATGATG TGTTATTAAT GGTCATTATC ATTTGAACTA 120 

TTTCCTTAGG CGTAGTTGTT CCAACTTTAA AAGAAATGAA TATTATGAGA ACAACTATAG 180 

GGCAATTTAT CCTATTAGTA GCAGTACTTG CGGACTTAGT AACTATGATT TTATTAACGG 24 0 

TCTATGGCGC AATCAATGGT CAAGGCGGCA GTACAATATG GTTAATAGGT ATATTAGTTG 3 00 

TTTTCACAGC AATTTCATAT ATTTTAGGTG TTCAATTTAA AAGAATGTCA TTTTTACAAA 3 60 

AATTGATGGA TGGTACGACG CAAATCGGTA TTCGTGCGGT ATTTGCATTA ATAATATTAT 4 20 

TAGTAGCCCT AGCAGAGGGA GTTGGCGCAG AAAATATATT AGGTGCATTC TTAGCAGGTG 4 80 

TCGTTGTTTC ATTATTAAAT CCAGATGAAG AAATGGTTGA AAAGTTAGAC TCATTTGGTT 54 0 

ATGGGTTCTT TATTCCTATT TTCTTTATAA TGGnTGGTGT AGATTTAAAC ATACCTTCAT 600 

40 TAATTAAAGA ACCGAAATTA CTAATTATCA TACCGATTTT AATCGTnGCA TTTATCATTT 660 

CAAAATTAAT TCCAGTCATG TTTATTCGAC GTTGGTTTGA TATGAAAACA ACGATTGCAT 720 

CAGCATTTTT ATTAACATCA ACATTATCGC TCGTGATAGC TGCAGCCAAA ATTTCAGAAA 780 

GATTAAATGC TATTTCAGCT GAAACGTCAG GTATATTAAT TTTAAGCGCA GTCATTACAT 84 0 
GTGTATTCGT TCCGATTATT TTCAAAAAAC TGTTTCCAGT TCCAGATGAG TTTAACCGTA 
AAATTGAAGT TAGTTTAATr GGTAAAAATC AATTAACGAT TCCTATAGCG CAAAATTTAA 
CATCTCAGTT ATATGACGTG ACATTATATT ATCGCAAAGA CTTGAGTGAT CGTCGTCAAT 
TGTCAGATGA TATCACGATG ATAGAAATTG CTGATTATGA ACAAGATGTT TTAGAACGAC 



45 



900 
960 
1020 
1080 
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AAGTTGCTAA ATTAGCCAAA GCACATCAAG TTGAGCGTGT CATTTGCAGA CTTGAAAGCA 1200 

CAACGGACGA TACAGAGTTA GTTGATTCAG GTATTGAAAT TTTCAGTAGC TACTTAAGTA 1260 

ATAAAATCTT ATTAAAAGGT TTAATTGAAA CACCTAACAT GTTGAATTTA TTAAGTAATG 13 20 

TTGAAACGTC ACTATATGAA ATTCAAATGT TAAATTATAA ATATGAAAAT ATTCAATTAC 13 80 



GTAATTTCCC ATTCGGAGGA GACATCATC 

10 

(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 702 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 



25 



30 



35 



40 



SO 



AAAGAGGGTT 


TTTTACTACT 


ATAATCCATC 


CTTTAATGGA 


ATTCCATTTG 


TCCCTTTGGC 


60 


CCGTTCATAA 


CCATAGGAAA 


ATATATGCnG 


haatcattga 


TAGTAAATAC 


ATCAATAATA 


120 


CGAATAATGT 


CGGTGAAATC 


ACGTCCGTAT 


TACCATTATT 


AGCTAAAACA 


TTTTCCAAGT 


180 


TTTCTTTTGA 


ACCGGATACT 


CATAAATGCT 


TTTAATGCnT 


GGTTTTGTCT 


GTCGCCATCT 


240 


TTAGAATTTG 


TAATACTTTT 


TTAAATTCTT 


TAGCGAACAA 


CTCATTATCT 


TTATCGTTTT 


300 


TAGCCATTTG 


ACGATTCAAA 


TCGTTAGCTC 


TTACGCCTAA 


TGCTCGACCT 


GTCGCATGTA 


360 


G CTT ATTCAC 


ATTATTGTCG 


ACTTGATTTA 


ATTGGCCACT 


AACAGAATCT 


GCAATTGATT 


420 


TTGATTCTTG 


TGTATCTGAT 


AGCAATTGCG 


TACTCTTTTC 


AGAAATTCTA 


CTAATTTCTT 


480 


TATCTAAATT 


TGAAGACATC 


GTATTAAATT 


CATCATTTTT 


GCCTTTATCA 


ATTTTTGGTT 


540 


CTTGTGGCTC 


TTCAGCAAAA 


GTCTTTTTAA 


CGTTTTCTAA 


CTGATCAATC 


AGCTTGGAAA 


600 


TATCTTCTTJc 


ArrrrriviT 


gTATTCTTTT 


TGTTATTTAA 


AATGTCATCA 


ATCAGTTTGT 


660 


CTGAGTTTTC 


TTCCATTGAA 


TCAATTTGaT 


GTAACACAGC 


TACTTTATCG 


TCTTTGAAAC 


720 


TTTCCATGTC 


ATTGATAACT 


TGGTCAACCA 


TCATATCAAT 


TAAACGTTTG 


TTGTCGAATG 


780 


GTTTATCTTC 


tCTGCcri-rr 


GTATCTGTGT 


ACATTTTATA 


ATGCGCATCA 


AACCTAGATA 


840 


ATGCACTCAA 


TTGCTGGCTT 


AATGCATCTT 


TCGATAAGCG 


ACCATCaAGG 


TTATGATTCA 


900 


ATGTTACATC 


CACAACACTC 


GTTGCTTTCT 


TATCATTTGG 


TTCATCTTGA 


CGATTTGCTT 


960 


GTCCAAATAA 


CAATTGTAAA 


TGCATTGTTT 


TATCTTTTAA 


GAAATCTTTC 


TCAGCATCCT 


1020 


TTTTCAATTT 


AGCAACGCCA 


TTGACTTCAA 


CTTTATATTC 


CTTGTTAGAT 


GTATCGAGTT 


1080 
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TAAAATGAGG AtCTGTTGCA ACA(m-AATT GATTAATATC ATTACITT^ ' 

TACGTTGCAC TTTGACACCA TCATTAATCA AACTACTTCT GTCTTCCGCA ACTTGGTTAT 126 

* CGTAATCTGT TAAATTAATG TGTTCTGCTA ACGGTTrPTT CAAATTATAT TCAITTTTAT 132 

AACGTTTTGC TTCTTTGACA ATTGCTTCGT ATTTATTAGC TTCATCCTCA rTTAAACCTG 138, 

CAGCTATAAA GTCTTGTTTA GACATGTTAT AGATAAATGT TGTATCTGTA TCAGGTTCTT 144, 

TGACAATATC ATCATGAAGT TGTTTCTCTA AGTTTrCAGC GAATTGAGCA rrGTTCATTT 150( 

TAATGCTATT TAGCGCATCT TGTAAGTCTT TGTTATTTGC AAGCTCATCT TGCAGTGATT 156< 

CTGTTAATTG CTTACGATAn TCTTCAATCA TACCTTTTGA AAATGGTGAC TCTTGrGAlT 162C 

GAATGATTTC TCTTAATTTA TCTAAGTTTT CTTTAACAGT TTGTTTATAT TCTTCriTAC 168( 

CTGTATCTTG CATACTTGAT TGTTGATCAA TTTGGCTGTC CATCTGTTTT AATGCATTGA 174 c 

20 TATAGTTATC AAGTTCCACG CTATCTTTTT GCGATTTATA ATCTTGTAAC ATTTTATCCA 180C 

TCGCTGTATT GTGCTCGTCA AATAATGAAT TTTGTTTTTC AATTAAAGTC GAAACATTAT I860 

AATCTGTGTT CACTCTGAAC GTATCTGAAT TCGCACTCAA TAAIXSATTTA TTCTATGTTT 1920 

25 GGAACCATTT TGTAATGTCT TTGTTTGCAG AAATTGAATT TACAAGCGTA TCTGTAAATA 1980 

ATTCCGGGAA GTCGTTAATT GGATTTAATA AGTAATTCGA GAATTTACTA TTCACACCAT 2040 

GTTCACGCGT CATAATAGCG CCAACATTTT TTTGTGCATT ATGTAAATTA TCAATGATGC 2100 

30 

TTGTTAAATA AATITCGACT AAGTTTTTGT TAAAGTCGTT AAGTACATTA CTTACAACTT 2160 

rrTCTGTGTT TTTAGCTACT TCTTCTTTTT GTCCTACAGC TCTTTTATAC TGTAGCGATA 2220 

^ TTTTCGATGG TGTTTTAGCG TCTAATTGCA TTGCCAATTT TCAAAAGTTT TCTCGGATAA 2280 

CAATCATGAC TTGGTATCCA CCATTTTTCA AACCAGACTC AGCAACGTTT CTTClTACrc 2340 

TTTCAAATTT ATAGTrTTTC TCATTTGCTA ACCTTTTAAT AAATGCTTGA CCCAGCTCAA 2400 

CTTTTTTACC GTTATATGTC GTTGGTTGAT CCTCGTTAAC AATTGCGATA TCTATTTTAT 2460 

TAi 1-1-1-1 ATT ACTTACACTT TGGGATCCTT TTTCTGATTG ATCTCCATAT TTTGTTTGAA 2520 

CAAAAAATAT CATACTAACT ATGGCAATTA TAATAATTAA AGTGACAATT AATGCATAAA 2580 

TCCAATTTTT CTTTTTCATG CTTATTTCCT TTCAGTTGTT TTCTTAAAAA AATGATAAGC 2640 

AAAGCCACAT TAGAAAATGT GACTTTGCCA ATTTCAGAAT GCTTATTGCA AACCX3AAATT 2700 

ATTAGAAAGT TGTTGGTCTT GTTCTTGAAC GGCATCAGCA GTGCtATTcA ATTGTTCTTr 2760 

AATTTCTTCT AATAATTGTG CAAATTTTTC TACTTTAGGA CTAAGTTGTT GGAATTGCTC 2820 

TTCGAAACGG CTGAAAGCTT GACCTTCCCA GTTCGCTGCA ATTTCACCTt GTGCACGkGT 2880 
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TCTGATTTCC TCTGGACTCA TCTTAATCAT TGCCATAACT AGAAACCTCC TGAATATTTT 3000 

AAGTTTATCa AAACTTTrrA GGGACACTAT TTTTTGAAAA AGTGCTCCTT ACTCAAATAA 3060 

TATATAAATT ATTAGTATAT GTATATAGTt TTTTAAGTAT TTTTAGCTTT TTTAAAATAA 3120 

ATATATTGAA TATAACCATA TATTTTTAAT TAACCATTCA TTTTTGTAAT ATAAATGTGT 3180 

ATACTAAAAT TAAATTAAAT ACATAAAGGA TTAAATGGTT ATTATGAAGA AAACAATTTT 324 0 

ACTGACGATG ACAACTCTTA CTTTATTTAG TATGTCGCCT AACTCGGCTC AAGCATATAC 33 00 

GAATGATAGC AAAACATTAG AAGAAGCAAA GAAAGCACAC CCAAACGCAC AGTTCAAAGT 33 60 

GAATAAAGAC ACCGGCGCGT ATACTTATAC ATATGACAAA AACAACACGC CAAACAACAA 3420 

TCATCAAAAC CAGTCACGTA CAAACGACAA TCATCAACAC GCAAATCAAC GTGATCTTAA 3480 

CAACAATCAG TACCATTCTT CATTAAGTGG TCAGTATACG CACATTAATG ACGCAATTGA 354 0 

20 TTCACACACA CCGCCTCAAA CGTCACCAAG CAATCCTTTG ACACCAGCAA TACCGAATGT 3600 

CGAAGACAAT GACGATGAAT TAAATAACGC TTTTTCAAAA GATAACAAAG GGCTTATTAC 3660 

AGGCATCGAT TTAGACGAAT TGTATGACGA ATTACAAATC GCCGAATTTA ATGACAAAGC 3720 

AAAGACCGCT GACGGTAAAC CTTTAGCATT AGGTAACGGT AAAATCATTC ATCAGCCTCT 3780 

TATCACAAGT AAGAACAACT TATATACTGC TGGACAATGT ACATGGTATG TCTTTGATAA 3 84 0 

ACGTGCCAAA GATGGACACA CGATTAGTAC ATTTTGGGGA GATGCTAAAA ACTGGGCAGG 3 900 

CCAAGCTTCA AGCAATGGCT TCAAAGTAGA TAGACACCCA ACACGAGGAT CAATTTTACA 3 960 

AACAGTAAAT GGTCCATTTG GTCATGTAGC CTACGTTGrA AAAGTTAATA TTGATGGAAG 4 020 

TATTCTAATT TCAGAAATGA ACTGGATTGG TGAATATATC GTTTCATCAA GAACCATCTC 4080 

TGCTTGAOAA GTTTCATCAT ATAATTACAT CCATTAAATT AATCATGACA TCAATAAAAA 4140 

GCGACCAGTT CGCAGTTTAC AATTCGTAAC ACTGCAAAAT TGGTCGCTTT ATTTTGTATG 4200 

TTATTCGATT ATAAAATTAC AAAGAAATGT TCTCTACATT CCCCATTAAT CAAAATCGTT 4260 

TACGAAAGTA TAATTGTAGC TATAATAATC CAAGTCGTAA CAACTAGTGG CACTATCGTC 4 3 20 

TTGAATAAGA ATATACCGTA TTTTTTCTTG CGATATATAT CCAGTACTAG CCAAATTAAA 4 380 

ATGATTATAA CACCAACAAA AATAAATACA GGATTCATCG ATATAGCATC TGCCrTGTAAC 444 0 

TCAGGTTGCA TTCTTAATTT AGTGATAATT AACATCACTA CTGAAATAAT GAAAAAGTAG 4 500 

ATACCTCTTA TCTTTGATGT CTGTAAATCT AATTCCTGCT CTTCAATGAC CTCTTTAGAT 4 560 

TCACCCAATT CTTTTGCAAT CAAATAATTT ACTACCTTAG GTTTCACCCA TAAACACTTA 4620 

ATTGCAAAGT ACATAAAAAT ATATGATCCA GTATCCATAA ACATTAAAAA GTTGCTTAAA 4 6 80 
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i 



(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


269: 






ACAAAAAATT 




i A 1 1 0 AAA 


CTAATGTTAT 


CGACAGCAAC 


ATGTTTGCCA 


60 


TAACGCTTAG 






ACTTTGCCAC 


CTCriTTTTT 


CTCATAGCAT 


120 


AAAACCGAQA 




GTATTCCCTA 


TTTAACCACG 


TTTATTACAA 


TTTTCAAATT 


180 


TAAATGATTT 




X X X I 1 X'AATA 


AAATAATGT^ 


TAAwAGGwAA 


TCwCCAGTTA 


240 


AGAAATAGTG 


1 X/xX X X XAv^U 


X XtjAArTCAA 


AAAAACACCC 


AGTAAAACAA 


GGAATGCTTA 


300 


CTAGGTGTCT 


X X A X *\\- 1 


1 X LjCjL. X 1 X AT 


AATTTTGAAT 


CGTTtCTAAA 


AATGCTGGAC 


360 


AATAATGTTT 


XAnX X X\^XAA 


U X AUCTACGC 


CATCmATATT 


AATCATATCT 


TGTTTCGAAG 


420 


CAGGCTTACG 


TTTAGCAAAT 


TCCTCCAACG 


TGTAATCAGA 


AAATATACTT 


ACAGGTGCTA 


480 


TCGTTAATTT 


GTCACTTAAC 


TTTTrACGAA 


CTTCTACCAA 


CTGACTGAAT 


AATACTCGGT 


540 


CAACCCCTTC 


AACCGTATTT 


ATAAATACTT 


TTTCAGTCGC 


TTl-lTGCTTA 


AATGGTGTTG 


600 


TGAATACTTC 


TACTTCATTA 


CTGAGTAATT 


TTTTAATTGA 


AGTATCACAC 


ATTAATATTT 


660 


CGTCATTTTC 


ATTTAAGAAC 


CCTTTGAATC 


TTAATTCATC 


TATTAAGTGA 


CTTAATTCTG 


720 


ATGTTGTGTA 


ACCTTTCATT 


AAACCATGGG 


TTGAAATTTG 


GTCATAACCT 


TTATACTTAA 


780 


TATAATCTGk 


TGACTCTCCT 


CTTAACACTT 


GAATGATAAC 


ACTATAACTC 


TCTTGTTGTT 


84 0 


TCATACGAGC 


GATGCAACTA 


ATAATCATCT 


TAGCTTCTTG 


TGTCATATTA 


TATGATTTAT 


900 


CTTGTTGAAC 


ACAATTACTA 


CATTGTTCAC 


ATTCTTCTAA 


TTTTTCATTC 


GGTTCAAAAT 


960 


AATGGACAAT TGTTGCTTCT AGACATTTTT 




ATATTGAATC 


ATTTTAGTTA 


1020 


ACTTTTCGCC 


CATTTTATCT 


TTATAGTCAT 


CATCAGCTTG 


AGAGACTGTT 


ATAAAATACT 


1080 


CGTGTAAATT 


GATAtCGCGT 


TCGCTAAATA 


ACAAAATACA 


TTCAcrri"i'c 


AACCCGTCAC 


1140 


GACCTGCACG 


ACCCGCTTCT 


TGATAATAAG 


ATTCTAAATC 


TCCAGGCATA 


TTATAATGAA 


1200 


TAACAAAGCG 


TACATTGGAT 


TTATCAATAC 


CCATACCAAA AGCATTTGTA GCAACGACTA 


1260 


CTTTAACACG 


ATCAAATAAG 


AAATCATTCT 


GCGCTTCTTC 


TCTTTCTTTA 


TTGCTCAAAC 


1320 


CTGCATGATA 


TATAACACTT 


TCAATTTTCT 


GACTTTCTAA 


GGCTTCriXiA 


AGCTCTTCAA 


1380 
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CCAATATAAA TTTTTGACGT TGATAAGTAG GATTTACTTT AAAAATTAAG TTTCTACGCT 
TAGTACTCGT TTTAATTTGA TCAGTTTGAG CGATATTTAA CTTTTCTCTA ATATCTTGCTT 
GTACTTCAAC CGTGGCAGTT GCTGTCAACG CTATTATTGT AAAATCTTGA GGTAACGTAA 
ATACTTTTGA AATAACATTT TGGTAACTCG GCCTGAAATC ATGACCCCAT TTAGAAATAC 
AATGCGCTTC ATCAAACGCG ACTAAGTGAA TCTTTATACG CTGAAGCATA TTTAAAAAAT - 
ATCGGTTTTC AAATCGTTCT GGTGCAACAT ACAAAAATTG AATTTCTCCA TTTGATAATG 
CTTTTTCAAT ACGTTGTTGC TCTTTTTGAG TCAAACTACT ATTTAAAAAA GCAGCTTGAA 
TTCCCATCGC TTTTAATTGA TCCACTTGAT CTTTCATTAA TGATATTAGT GGACTTATTA 
CAATTGTTGT ACCACCTAAC mATAAACCTG GTACTTGrTA GCmTATAGAC yTACCTCCAC 
CAGTtGGkAA GrCACCAAGC ACAT 
(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 
AAAGATTGCT TGCCTTGAGG GTTTATATAT CTGACTCAAT TGCCACATTT TTATCAAGAG 
TAGTTGATAA TACTCATCAT AATTATAGCT AATATTATAT TTTTTTAAAA GATAGTGTAT 
GATTTTCTGG TGTTTGTTGT ATACGTCATT AAATTTCAAG TAGTCATTCT CCAAGTTATA 
CGTATAACAA ATATTTCCGG ATAAAGTTAG AATAAAATAT TTAGAAAAAT CATTCATTTG 
CGTPATCGCT AAATTAAGTG TTAAATATAA GACATAAGTA ATTAATTTAA TGTGATATGA 
TGTATTATTA CTTTGCTAAA TAGTAGATAG AACAAAATTT GTAATCGGGA GGTAACAATG 
GATTACGCAC ATTTAAATTT AGAACATTTT TTTGCACGAA ACGACGATTT AGATGTTATA 
AGAGATCGCG CTGATTTCGT GATGATAAAT AACTTCACTA ATGAAATGAT GTATCGTGAT 
GGTCAAATTG AAGGCACGAT TGATTTAAAT CAGTACTATT ATAAAAATAG ATCAAATGCA 
GCAAGTTTTA TTATGATGGA TTATAAAAAA GAAACTAAGT AAACGAACAA AAGAATTTTT 
TGTTTTTTAA TACGTGAATA ATAAGATTAT TGATATAAAG GTTTTCAAAO GTTATACAAA 
AAGATAAAAC ATTTATGATT CGTAGATCAA CGTAAAGTAA TGTTGATAAA TGGTTTAAAA 
CGTTTCATTT ACATTACTGT TTATTTATGA ATATGTAACA ATOCATAGAT AAAATTGTTA 
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X Vj X X A X\v 


AATAAACACA 


AGAAAGGTTC 


TATTTTTGGA ATAATAGGAC 


900 






*»X X Xo^XVjX X 


GTCyCaTTTT TATTTTTCTC 


AATGATATCC 


GATCAGATAT 


960 


5 


TTTTC'AAACA 




GACATTAAGA 


TTGAAAAGTT 


AAATGTTACA 


TTAAACGATG 


1020 






ar^aaaTaaaT 

M^>1MAX AAAX 


AATTATACGA 


GTCAACAGGT 


ATCAAATAAA 


AAGAATGATG 


loao 


10 


nATfTTZA^A^^a 


TY^ r* a T'/"'nr'/^ a 
i\jLAX w Xvjl«A 


ACTGAAATTA 


AAAGTGCAAT 


GGATAGCGGT 


ACTl'ITATCG 


1140 




A T" A A TYS A A A a 


^\.J\AAAAX Al 


CAATTTITAG 


ATTTATCAAA 


GTATCAAGGG 


ATTGATAAAA 


1200 




AT B A a'T'T A a 


Av-GTATGTTA 


GTAGATAGAC 


CAACGTTATT 


GAAACATACG 


GATGATTTf*T 


1260 


IS 


T A A A Ani-'Tf^r* 


XAAAkxATAAG 


CACGTTAACG 


AAGTTTATTT 


AATTTCACAT 


GCATTATTAG 


1320 




A A A Off^/^ /"V^ r» 


ACj 1 TAAAAC^T 


GAATTAGCTA ATGGAGTCXy^ 


AATTGATGGC 


AAAAAGTACT 


1380 




/IL-AAX 1 X^-IA. 


TGeAGTAGGA 


GCCCTTGATA 


AAGACCCAAT 


TAAAACAGGT 


nPA^3AATATf2 


1440 


20 


^' "T* R R ?V JV a rf^^lV 


TGGTTGGGAT 


ACACCTGAAA 


AAGCTATTTC 


AGGCGGTGCT 


^snx X X v-xix X w 


1500 




A X AACj Q-ACTT 


CTTATCAAG C 


ACAGATCAAA 


ATACATTGTA 


TAGTATGAGA 


TGG AATnP A A 


1560 




& & a RT'/^^R/^/^ 
AAMA X t_t-AtiG 


AGAACATCAA 


TATGCTACAG ATATTAAGTG 


GGCAGAAAGT 


AATGCAAC!AA 


1620 


25 


X XAX v^'^ 


C X''1"X~X ATAAG 


AACATGAAGA 


CTGAAGGAAA 


ATACTTCAAA 


X X X XV^XwX 


16S0 




AX AAAwAXWA 


JV ("T'lV TV TV TV Frt 

CAGTAAACAT 


TTGAATAAGT 


AATTTGATAA 


GCTACGAGTT 


GTTTTTATGA 


1740 


30 


t. X vJt»<j ACAT A 


jv TV TV -IV TV ^ 

CTAAAAAGAC 


GCTTTCTATC 


TTGTTTTGAT 


AGAAAGCGTC 


rrriTGCATT 


1800 


Avj>v.xAAAAUA 


UA 1 TTGATKGA 


TAAtCCCaCC 


aATGCAAgTG 


GGGcAGGACa 


TCGATAAAGA 


1860 




AX XA^X X i X X 


CTTTAGAAAT 


TAGTATTTCT 


TATGCATGAG 


TTTTACTCAT 


GTATTCCTAT 


1920 


35 


XXX X An^j X A^ 


ALIA X l AG X X A 


TAGCTAATGA 


TAAAGAACCA 


CTACATAATA AATCATTAGT 


1980 




GTTTTTTTAT 


CATTTCTGTC 


CCaCTCTCAT 


CGTGATTTGA 


AATTTTCAAT 


TGCGATTTTA 


2040 




ATTTCATCTC 


TTACACGTTG 


GAACTCTGAC 


CAAGGCTTGC 


CTGCAGGATC 


ATCAAATCCC 


2100 


40 


CAATGTTCTT 


TCTTAACATT 


TGTTGGTAAA 


GAAGGGCAAT 


TTACGTCTGC 


ATCACTACAT 


2160 




AATGTAACAA 


CTAAATTTGA 


ATTTnTAATA 


ATATTATTAT 


CGGATTAAAA 


TCTGATGGAT 


2220 




GATTTGATAT 


ATCAATGCCT 


ACTT 








2244 


45 


(2) INFORMATION FOR SEQ ID NO: 271: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ATAAGCAATT TAATTTTGAG TCTACAATGG AGGAATTATC ATCTTTATCA GAGACTTGCC 60 

AACTTGAAGT GTTGGGTCAA ATTACTCAAA ACAGAGATCG TGTAGATCGC AAATATTATG 120 

TTGGTAAAGG TAAAATTGAA GAAATTCAAG CATTTATTGA GTTCAAAGAT ATTGATGTAG 180 

TCATCACAAA TGATGAATTA ACGACTGCAC AATCCAAATC ACTAAATGAA GCTTTAGGTG 240 

TAAAAATTAT TGATAGAACT CAGTTGATTC TTGAAATATT TGCATTAAGA GCAAGAAGTA 300 

AAGAAGGTAA ATTGCAAGTA GAGCTAGCAC AACTTGATTA TTTATTACCT AGATTGCAAG 360 

GCCATGGTAA AAGCCTTTCT CGTTTAGGTG GCGGTATTGG AACTAGAGGC CCTGGTGAAA 4 20 

CGAAGTTAGA GATGGATCGC AGACATATTC GAACTCGTAT GAATGAAATT AAACATCAAT 4 80 

TGCGGACGGT AGAAGAACAT CGCGAAAGAT ATCGAAATAA AAGAAATCAA AATCAGGTGT 54 0 

TTCAAGTAGC TTTAGTTGGT TATACAAATG CTGGTAAATC ATCATGGTTT AATGTTTTAG 600 

20 CAAATGAAGA GACGTATGAA AAAGATCAAT TATTTGCAAC GTTAGATCCT AAAACACGAC 660 

AAATTCAAAT AAATGATGGA TTTAATTTAA TTATTTCAGA TACTGTTGGT TTTATACAGA 720 

AACTACCTAC GACXnTAATT GCAGCTTTTA AATCAACTTT AGAAGAGGCT AAAGGTGCAIS 780 

ATTTATTAGT ACATGTCGTA GATAGTAGCC ATCCTGAATA CCGTACGCAG TATGACACAG 84 0 

TTAATGATTT AATCAAACAA TTAGATATGA GTCATATTTC TCAAATAGTT ATTTTTAATA 900 

AAAAGGACTT ATGTGATCAT GCATCAAATC GTCCAGCAAG TGATTTGCCT AATGTTTTTG 960 

TTT C TTCTAA AAATGATGGT GATAAATTAC TTGTTAAGAC GTTATTTATT GATGAAATCA 1020 

AAAGGCAATT AACTTATTAT GATGAGACAA TTGCGACGAA TAATGCAGAT CGATTATATT 1080 

TTCTAAAACA ACATACATTA GTGACTGAAC TTAAATATGA TGAAATTGAA AATGTTTATC 1140 

GTATAAAAGG ATTTAAAAAA TAATAAAAGG ACGAAATTCA AATGAAAGAT ATAAGTAAGA 1200 

TAGTAGCTGA CGTCGAATCA ACGTTAGCAC CATATTTTAA AGAAATTGAA GAAACAGCAT 1260 

ATATTAATCA AGAAAAAGTA TTAAATGCAT TTCATCATGT CAAAGCAACC GAAAGTGATC 1320 

TACAAGGATC AACAGGATAC GGGTATGATG ACTTTGGACG TGATCATTTA G 13 71 
(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6035 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
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CGTAAACCTA TGCGTITTAA 


. TATTCTGAAG 


TTACTTTAGTT 


1.^— rvj X 1 lU-HXV^l.j\i, 1 


120 




TGTTTAATAT 


' AAATAATACA 


TCCAGCTGCT 


ACTAAAAATG 




180 


5 


ACAAATATTA 


. GAATACCGTT 


AGTAGCATCG 


ACCTCTTTTT 


TCATTTCATC ATArriTfiaTY^ 


240 




ACTTTGTCTC 


CAAACTGTTT 


TGCAATTGCT 


TGAGCmTT 


Sir A X ^ A Nj X vxn X vj XXX u XXX £\ 


300 


10 


ATATCATATC 


CATAAAAAGT 


ATGAACGTTA 


TTTTGTGTTT 


^— x\j\w XVJ MX/IWIXIXCa 


360 


GGACTTACTT 


CGATGACAGG 


TGAGTTGAAG 


CTTAGATTTA 


AAGG ATAAAr* f^TT* & cCTTny^ 

r*n\^\jn. a X X X X X\3 


420 




TCTTCTTGTG 


TAcACGGAAA 


GTTTCATTCT 


TAGTACCTTT 


TACTAtTTAAA TOfT^TYi'rn'^a. 


480 


IS 


AATGGATATT 


AATAATGTTA 


GGCAGCGATT 


TTGTATTTGT 


AATGA*IY^f3f A '1*Pr^T*'TY^/^r'**TV* 


540 




TtAACTTGTT 


ATTTGCACTT 


AAAATAGAAT 


TCX3TGCGACC 


TGAATf^APTA T*r*r*i*o^a 
x\armxv.AV«XA ^WVI 1 1 lWv,ZA 


600 




AAGTAATAAC 


CTGATCATTA 


ACATTATCTA 


CAGTAATAGT 


X X X MAV^v^ X X X X 1 ACxAAA 


660 


20 


ATGTAATTTG 


TTGTTGGCTT 


AGTTTAGTTT 


CAAATTGTTT 


AGCATT'T'TYS A f2Ta/"2r»r» & r»r^ii 
^ X xvaM. W7 Xn(«7<,*UAWV*>A 


720 




CGTTAAATTC 


ATTTGGTGCC 


ATAGATGTAA 


GGGTTTGATC 


■LVJXAX X xuAX 1 1A\jA1AA1\j 


780 




CCGCAAAACA 


CAATACAGTT 


ACTGTAACTG 


CAGAAATAAT 


TGCAATGATA m^Tnanr' JVir«iv 

xv^VnnnxVjnXA \a± XnnnSaAXmA 


840 


2S 


TGGCATTTTT 


CTTCATTCTG 


TACATAATAG 


ACGATGTGAA 


TAC^AAf^ATf f2 f^THHTa/^iiTa 


900 




CGCGTCCATT 


TTTTGATTTT 


TTCAATGTTT 


TAAAAATAAG 


TGAC!Af^fV^AA fT^IV^IY^Aiv^n. 


960 


30 


ATAAATAGGC 


GCCTACAACC 


GTTAAAAATA 


AAATGATAAA 


^«^-Avxf\x\* X\- /ilAi^V*\..Al 


1020 


TTAGTGCTTT 


GAACGTACCA 


AACATTTCTG 


TCGCCATATA 


ATAGCCTAGT GCAATCATAG 


1080 




CAATACCTAA 


TACGCCTGAA 


ATAACCTCTG 


CAGTCGTTAC 


TTTAGCAGTT GTGGCATCAG 


114 0 


3S 


TTTTAATTGA 


ATCTTTCATC 


ATTGATAAGA 


TACTACGTCT 


TTTTAGAAAT AAAGCACTTT 




GAAATAAAAT 


CAGTACATAC 


GCAATAATTA 


GCATGAAAAT AGTTAAAACA AGGGCCATAG 


1 T £ rt 




GTTCGAAATG TATCGATAAG TTAATCGATA ACGAGATCAA TTTAGATACT ATGGAAAGCA 




40 


ATAaTTOTGC 


ACCTGCAATG 


CCACATAATA 


CACCGACAAC 


ACCTGTGATT AAAAATACGA 


1 ^ a n 




TCATTtGTTC 


AAGTGCTAAC 


ATTTnCAAAA 


TGTTTTGTCG 


TGTTAAACCA ATCaACTGAA 


X^^ U 




ATAGCGCAAA 


TTCACGTGTA 


CGGCGTTTTA 


CGrmTAAATG 


ATTGGCATAC ATTAAAAAGA 




4S 


TGACAATAAT 


GATAAATAAA 


AATATTGATC 


CGACTAAAGC ACCTTTCTTA ATGATGGCCA 


1 C f A 




TCGAGTCGTC 


ATTATTTACA 


CCTTTAGTAA 


ACTGTAAGGT 


TGTAAAACTG AAATATAAGA 


1620 




CGATGCTAAA 


AAATAATGAA 


AATAAATACA 


TTGCATAATG 


TTTTAAGTTT TGTCGTAAGT 


1680 


SO 


TTTTGAAAAC 


GATATGGTTA 


AATGTCATTT 


GAGACACCAC 


CTAATACTGA TTGAAGATGT 


1740 




ACAATGTCTT 


CATAAAAGGC 


CTGTTTAGAA 


CGTCCTTCCT 


GATAAAGTTG TGTATGAATT 


1800 
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ACCATGACAA 


TAGTTGTATC 


AAACGATTTA 


TTCATTTCTT 


CCAAACGTl'G 


TAATAGGTCA 


1920 




TTTGCACTTT 


TCGAGTCGAG 


TGCGCCTGTT 


GGCTCATCTG 


CAAATATGAT 


TTGTGGTTTG 


1980 


5 


TGAACAAATG 


CTCTCGCTGC 


TGCAGTTCTT 


TGTTGTTGAC 


CACCAGATAA 


TTCGCTAGGG 


2040 




TATTTATTTC 


CTAGGTCATA 


AATACCTAAT 


GCTGTCGTGA 


TCGCTTTATA 


ATTTTCTTCC 


2100 


10 


ATTGTTGCCT 


TCGACATTTT 


TTGAACAGAT 


AAAGGTAACA 


TAATGTTTTC 


TTTAACGGTT 


2160 


AATGTCGGCA 


GAATACTGTA 


ATCTTGGAAG 


ATGAAACCTA 


ATGATTCTTT 


GCGGAATTTG 


2220 




GCAAGTGCTT 


TTTGATTAAG 


TTTATTAAGC 


TCTTGTCCGT 


TAGCAATCcg 


CTACCGCTAG 


2280 


IS 


AAATTTGGTC 


AATTGAACTT 


AGTACATTTA 


ATAAGGTTGT 


CTTACCTGAT 


CCAGAAGGCC 


2340 


CCATAATCGC 


AACGAATTCG 


CCTTTTTGTA 


TGTCAAAGTT 


AATATCTTTA 


AGTGCTTGAA 


2400 




ATGTGTGCTT 


TTTACCGTAT 


GTTTTTGAAA 


CATGTgCACT 


GATAATATCG 


TCATAGTCTC 


2460 


20 


ACTCCTTtTG 


TATTTAATTT 


CATTTTAAAT 


AATGTTTGGA 


GTAGTAGCCT 


TTATCTAAAC 


2520 




TTACAATTCA 


ATGAATGAAC 


CTTACAGAGT 


TGAAArcTAT 


CG CT ACTTAG 


TAGA'rrrriG 


2580 




AGTGAGGATA 


CAGATTCATC 


GTACATATTA 


GACAAAAGCA 


ATGGTGCTTT 


CTAAGTGATG 


2640 


2S 


ATGTTtGTGT AAATTGAGAA 


AAGGGAATTT 


AATTATTGTA 


TAATAAATTT 


TTTGTAAAAA 


2700 




TTAAAAGAGG 


GTTTTATTTG 


AAAGGAATTG 


ATTGTTATGG 


AAAAAGGAAA 


TCAAGGTATT 


2760 




AAATGGTCTA 


GTTTAATAAT 


GGGTGTATTA 


TTATTAATGT 


TGGCAGTCGT 


TA'rrriTACA 


2820 


30 


TTTCCAATTG 


AAAATTTTTA 


TGCTATTACC 


TGGTTGATTG 


GACTGTTTGT 


ATTAATTAAC 


2880 




GGTGTGATTC 


AAATCGTTTA 


CCGTAGAAAA 


GCAAAAGCTT 


TAGTAGGTGG 


TAACCAAAAT 


2940 


35 


TGGATTCTGT 


TTATGGGGAT 


TGTAGATATT 


CTATTTGGTC 


TATTAGTTAT 


TTTTAATGTT 


3000 


GGCGCAAGTT 


CAGCATTCTT 


TATTTATATG 


TTTGcrrnT 


GGTTTATTTT 


TAGTTCTATC 


3060 




TCTGGATTAT 


TTACX5TTTTC 


GGGTAGTGGT 


AGCTTAAAAC 


TAATTTCAGT 


GA'i-rrrrAAT 


3120 


40 


TTATTAGGTA 


TTGTTTTCGG 


TGTCATTTTA 


TTATTTAATC 


CATTAATGGG 


TATCGTCTTT 


3180 


ATTTCGACGA 


TGATTGCTAT 


TGCATTTGTA 


TTCGTAGGTG 


TCATTTATGT 


TGTAGATGCA 


3240 




CTTGCTTAAG 


TAAAATGAAG 


CGGTTCAAAA 


GAAGGGTGTG 


ACATGAAGTT 


TGTCTCATAT 


3300 


45 




GTGTTTATGA 


AGCATAAAAA 


AGGGGCGCTA 


CCTACAATAA 


GTAAGATACG 


3360 




CCCATATTTT 


TATATTTTAC 


TATTATTGTT 


TTTCAATACG 


ATTAATAGTT 


ACATTTAGTC 


3420 




CAAAATATTT 


TTCTAAAAAA 


TGTTTATAGT 


TATCTTTAGT 


GACATCAAAT 


TTTTCTGAGC 


3480 


SO 


TACCATTCCT 


TGTTAAAGTT 


AAATGATTTT 


CAGACATTGT 


AGCACGGCCA 


AATGATTGTG 


3540 




GCATTGTAAT 


TAATAAATGC 


TGTACAAATA 


TTGAATCTGG 


ATGCGTTTGA 


TTATATTCGA 


3600 
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AATGATCATT TTCC3AATTTT TGAACATAGA AAATATCCTT GTCTTCGTTC TTAAAAATAG ' 372, 
CGCGGAATGT ACCACTGATA TCAGTAATTG GTTGTCTATG CTCAGATCAA GTAATAGGAA 378( 
TGGCATGTAG AGGTAAGTCT CCAAAGCCAA CATCAGTTAC ATAGAATACA TCATTTATAG 384 c 

AAACAACAAG TGAAGCATGT GAACCGTTCA GACTACGACC GCCACCGGGw GTGTGAATAG 390c 

TAGCTGACAT TAATTCAGGA TTAAATCCTT TTTGTTGTAA ATAGGCTTTG AAAAATGTAT 39sc 

TTAATTCATA ACAAAAACCA CCACGTTTAT CATGAACAAT TTTATTAAAA AGTGCATCGA 4 02C 

TATTTATAGA TATCGGCTTA CTATTTTGAA CATCAATATT TTCAAAAGGT ACAGTTAACA 408C 

TAAAACGTGT TGCATAATAA TTTAATGCTT CAATACTCGG TCGATTATAA CGAGATCAAT 4140 

CAATTTGTAA ATAATTCTCT AACTTCGCAA TATTCATAAG CATAGCGCCT CCTGTATTAA 4 200 

AGATTATAAT TAAATTTTAA ACAGAAATAC TGAAATTTTA AATTCGAAAG CATTGAATTT 4260 

20 TGGATAAATA CATTTTAAAT AGAAAAATAC GCTCTCAAAA TGAAGTCATC TCTAAAAGAA 4320 

ACGATTTAAA GATGACTACT GAGAGCGTAg CATAATGGAA GAAGTGTGCA GGGTGTCTAA 4 3 80 

AAATGCAACA ATACAAAGGT AGTTGCAAGA CAAGTTGCCT TATCTAGACC ATTTGTGTTC 4440 

25 TATGCGACCA AACTTCCAAA TTAAACTTGA AATAAGCX^VA GTAATTAAAA ATAATCCAAC 4500 

TAAAATATAG CCTAAATAAT CAAATTCGAT CGAACCAATG AATGCCCAAA ACGCACCATG 4560 

TAAATCTAAC TTATCAGCAA GAATTTGTAG CAATTCAATC ATCCCAATCA CTAATGCTGC 4620 

CATGACTGAT ATCGCAGTAA TCGTTATATT GTAATAGATT TTGCGAATAG GATIGAAGAA 4 680 

TGCCCAATTA TAGGCATACT TCATTACAAC ACCATCTAAT GTATCCAATA AACTCATACC 4740 

TGATGCXSAAT AAAATTGGTA AAGATAAGAT TCCGATAAAT GAAATGGCTT GTTGTGATGC 4 800 

GCCTGAAGAA AGAGCGAGTA ACGCAATTTC ACTAGCTGTA TCAAAACCAA GTCCAAATAA 4860 

AAAGCCAAGT GGCAATACGT GCCAACTACG CGTGATTAAT TTCAAATAAG GTCCTACAAA 4 920 

40 TCGAGAAACC AATCCTCTAG ATTCAAGTAA TGCATCGACT TCAGCTTCTT CAATGTGTTC 4 980 

ACGACGTAAT TTAGCGAACA AGTTAATTAA AGAGATTAAA ATAATTAGAT TCAACACACC 5040 

GATAAGCACT AAAAAGAAAC CTGAAACTAG TGTACCAATC GITCCACCAA TATCTTGGAA SlOO 

45 ATGCGGTAAT TCATCTTTAG CCCATTTTAC AGATACCCCT AAAAAAACAG CCATTAAAAA 5160 

TACGACAGAT GAATGTCCAA TTGAAAAATA GAAACCCACA CCAGATCGAT CTrTGCCTTG 5220 

CTGTAATAAT TTGCGaACCG TATTATCTAT TGCAGCAATG TQATCTCCAT CAAATGCATG 5280 

ACGCAAACCT AATGTATATG CAAGAATCCC CaTACCAAAT AAGATATGAT GGTCmTCC 534 0 

AGCAATCCAT AAAAAACTAA ACCCAATAAC GTGTAACAAA ATGACAATAG CTATGTATGG 5400 



so 



30 



35 
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ATATTTAATC ATACTGTATG TTCAATGGGC ACTCTAGTAA TAAGTGTTCA TATAACAAAA 
ATGTTATGCC AAATTATTTG TTATATAAAA ATATACATGT AACCACAAAA GATTTTTTGC 
GATATATATA ATTTGATAAA TTAACCAACA ACAATGTAAG ATGTCACTTT GCTTAACTTG 
GCATCCTTTT TATGATTTTC AAATTCAAAA AAATGAGCAA AATGAATCTC TTTAcCAGTT 
TTTAATATTT CaATACCATG CATGGAACCT AAGCACCCAT GTGTGATGCT GGAATGGATA 
TTGAGACTAG CAACCTGATT GTAATGATTA GATAGTTCTT GAATTAATAT TTGAGGTCCG 
TATATGTCAA AGCGGCCAGG GACAGACCAA ATAAATTCTG TTGTAACCAG TGAACGTAAT 
AATTCAATAT CTAATGCTGC TGTAACAACT ATAAAATCTA TCATTTGTTG ACGTTTAGGC 
GCATGATTGC ATGACACATC TCCTGTTAAC TTAAAAGGTA ATGATGACTG AACTTCCGTT 
TTAAAATGTA GTTGGTGCTG AAATAAAGCT TGTTC 
(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
TTTTGAACAG CCATATTTAT TCACCCTCAA CATCATTATA ATGGTATTAG TCGCATTACC 
TTCACTTGTT TTAGCTATAT ATGATTATAT GAGTTTTAGA ATTTCTTCTG CTATTTTACA 
ATTTCTAGGG GCTATCTCTT GGTTCTTTTT ATCATTGATA TTATCGCTCA CACAATTTAC 
ACCTTTTACA TTAGCGTCAT TTATAACTTC AATTATTTTG TTCACAAGCA CAATTATCAC 
ATT^CCATT GGTGGTAAGT CTGTTGAAAA GAATGATTCC CCTTAAATTC CAAATGAAAA 
AAAGGTTCTG AAGGCCGCTA TAAAACACAG TTTTTCAGAA CCTCTATACT TCTATTCAAT 
GATATATGGT TTGCAATTTT CTACCTTTAA ATCCACAGCT TCTGCCCTTG AAACTTTGTT 
AAAATAAACC ATCAAACAAC GAATGACAAC TTGATGTGCA ACAATGACAA TATCATCTTT 
TTGTGTATCT TCATTGACAA CATGATTCAT AAAATGTTCT ACGCGTTGAT ATACATCTTC 
ATAACTTTCT CCTTCAGGCG CTTTXTGTGA AAAACTATGA CGAAAGTCTT TAAAGTTTGG 
ATCATTGAAA TATTTTTCAT ATTTCGGATT CGCACTGATT TCATCTTTAT ATTCACCCTC 
AAATACGCCA AGTGAACGTT CTCTTAATAG AGGGGTAGTC GTTGATGCAA TGTCATATGG 
AAAAATATGT TCAAACGTTT GCTGTGTTCT TAATAAGTCT GAAACATATA CATGTTTAAT 
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CGGCACATCT AATTGTCCAC AAAAATAAGA TCGAAAATGT TTATTATCAT AATTCGATTT 
TGATTCGCCA TGTCTAACTA AATAAATCGT CATAATATTA CTCCTTACCT TATGTATrTC 
ATATCTACCA TAACACTTTG ACTACTAATT CGATATCAAT CTTAATATTC TATTCTAAAA 
AAAGAATTAA TTCATATnT 

(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



900 
960 
1020 
1039 



60 
120 



300 
360 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274 : 
2^ GAGAGAATnT GCAATTAGTT ATTCAATTAG TTGATTTAAG ACATGATCCA ACACAAGATG 

ATATCTTAAT GTACAATTAT TTGAAACATT TTGATATTCC TACTTTAGTT ATATGCACTA 

ArGAaGACAA AATTCCaAAA GGTAAGGTyC AAAAGCATAT TAAAAATATT AAGACACAAT IBO 

25 TAGATATGGA CCCAGACGAT ACAATTGTAA GTTATTCATC AATTCAAAAT AATAmaCAAC 24 0 
AACAAATATG GAATTTAATT GAACCGTATA TTTCATAGTT TTTGTACGTC AAAACTTATA 
CAAAAATTTT AAAAATAATG TAAGCACGAA ACTTTTAATT AGTACACAAT TGATAACATT 

TTTCAACGTT CATCATTTTG TCAAAAACTC AAAAGTAAAT TAGAAAGATT ATAATTTATT 4 20 

TAAGCATCGT ACTTAATTGG ATTTTAAATT ATGTTATAAT ATTTGTATTG TTAGTATATA 4 80 

TGGGGGCTTT TCAAATGCAT TTTATTGCAA TTAGTATAAA TCATCGCACA GCTGATGTgC 54 0 

ACTAAGAGAG CAAGTTACTT TTAGAGATGA TGCGTTACGA ATTGCCCATG AAGATTTATA 600 

TGAAACTAAA TCTAyTTTAG AAAATGgTCA TATTaTCAAC ATGTAATCGA ACTGAAGTAT 660 

ATGCTGTTGT TGATCAAATT CACACAGGTC GTTACTATAT TCAACOATTT CTAGCTCGTG 720 

CATTTGGATT TGAAGTAGAT GATATTAAAG CAATGTCAGA AGTAAAAGTG GGGGACGAAG 780 

CaGTAGAACA TTTATTGCGT GTCACTTCTG GTTTAGATTC AATCGTACTT GGAGAAACTC 84 0 

AAATTTTAGG TCAAATAAGA GATGCATTTT TCTTAGCGCA AAGCACAGGT ACGACAGGrA 900 

CAATTTTTAA TCATCTATTT AAACAGGCAA TTACTTTTGC AAAAAGAGCA CATAATGAAA 960 

CAGATATAGC TGATAATGCT GTAAGTGTGT CTTATGCTGC GGTCGAGTTG GCGAAAAAAG 1020 

SO TATTTGGCAA ATTGAAAAGT AAGCAAGCTA TCATTATTGG TGCAGGGGAA ATGAGTGAAT 1080 

TATCACTATT AAATCTTCTT GGTTCTGGAA TTACTGaTAT TACAGTAGTA AATAGAACAA 114 0 
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TACCAAATTT ACTTGAAAGT GCAGATATTG TGATTAGTTC AACGAGTGCA CAATCTTATA 1260 

TCATTACAAA TGAAATGATA GAAAGAATTG CAGAAAATAG AAAGCAAGAT TCACTAGTAT 1320 

TGATTGATAT TGCAGTTCCT CXSAGATATTG AACCTGGTAT TAGTGCCATC ACAAACATCT 1380 

TTAATTATGA TGTTGATGAC TTAAAAGGTT TAGTTGATGC AAACTTACGT GAGCGACAAT 144 0 

TAGCGGCTGC AACAATTTCG GAACAAATTC CTACAGAAAT ACATGCACAC AATGAG 14 96 



10 

(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 826 base pairs 

(B) TYPE: nucleic acid 

'5 (C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

CTTGATTTTT TCCCcTTAGT ATTTTCCaTt TGanTGTCGC AGCTTCTAAA TCCTGCTTTG 60 

GTTCTCTAGT GAACTTCATA ATTAAAGCAG CTACAACGAA' TGATACAAGT GCAGCAAGGA 12 0 

25 AGACACCGAG TAACATGTGC AAGAATTCAC CTCTAGGTGC ATTTAAACAG TAAACTATAA 180 

ATGAACCTGG TGACGCGGGA CTTTTAAATC CAAATCCTGT TGCTTGATAA GTTGCAACAC 24 0 

CAGTCATTCC ACCTAAAATA ACAGCGATAA ATAATAAAGG ACGCATTAAT ACATATGGGA 300 

AATAAATTTC ATGAATACCA CCTAAGAAGT GGATAATTCC AGCACCATAT GACGTTGCTT 36 0 

TTGCAGTGCC TTTTCCAAAA ATCATATAAG CAAGTAAGAT ACCTAAACCT GGTCCAGGGT 420 

TAGATTCAAT TGTGTATAAA ATTGATTGAC CAGCTTTTGC AGCTTGATCT GCACCAAGCG 480 

35 

GTGTGAATAC ACCATGGTTA ATCGCATTGT TTAAAAATAC AATTTTTGCA GGCTCTACTA 54 0 

AAATACTTAC AAGTGGAAGT AGGTGTGCAT GTACTAATGC TTCAACTGCC ACTGATAAAA 600 

TATGCATAAT AAATTTCATA AGTGGTGCTA AAATTTTAAA TCCTGCAATC GTCATGATAA 660 

40 

ATCCTAAAAT ACCAGCAGAA AAGTTATTAA ATAACATTTC AAAACCTTGC GGCGTTCTAG 720 

GTTGAATCAA TTGGTCGGTC TTCTTCATTA ACCAACCAAC AAGTGGACCC ATAATCATTG 780 

^5 CACCAAGTAA CATTGGTGTA TCAGGTAATG CAACGATGAC CCCCATAGTT GCTGTTGCTG 840 

CGATGATACC ACCACGTAAA TCATAAATTA AACGACCACC ACTAAATGCG ATCAATAATG 900 

GGATTAAATA AGTAATCATT GGTCCTGCTA AAGTAGCTAA ATCTTTGTTA GGTAACCATC 960 

so CATTATCTAT AAAAATGGCC GCGATAAAAC CCCAAGCGAT GAAAGCGCCA ATGTTTGGCA 1020 

TGATCATACT ACTTAAGAAT GATCCAAATG CTTGAACACG ACGACCAATT CCTTTTTTCT 1080 
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GAGAGGTTAC 


TTGTTACTCA 


ATATAAACAA 


AAATCAACTT 


TGTCAAAATA 


AATGTGACAA 


1200 




AATTAAATAA 


AGTGTCATCA 


ATGTGACAGT 


ATAGATATTT 


TGAAAAAGTA 


AAACAAAAAA 


1260 


5 


ATTGTTTTAG 


GAT-rriTAAA 


ATTTTATTGT 


GAAAATATTT 


GCAAAACAAA 


ACAACACCGT 


1320 




gTACAATAAT 


GATTAATGGA 


AAGGGGGAAA 


GTTCGGCAGT 


ACAGTTAAAG 


CGCCTGTGCA 


13 80 


10 


AATAAATATT 


TGTATTTGAA 


GATTAAAGGT 


TAATATATGA 


GTGGCCTTTA 


TAGAGTGCAA 


1440 


TATATGTATT 


TGTAGACGAG 


GAGGATAGTG 


ATCGAATAGA 


TCGGCGGATG 


CTATCCCGGA 


1500 




TGTGGCTCAT 


TCGTTAGCTT 


ATTAAGTAAA 


ACATTAGGGT 


GACTTAATGG 


ACAAAGTTAA 


1560 


IS 


TAAGATCGCC 


AGAAATTGAA 


TATAAAAAAT 


ATTAATATGG 


AAAGTACAGT 


GTGAGCAATT 


1620 




TGTATAGTTG 


TAAAAATAAC 


TATGCTTAAT 


TTGTTATGGA 


TGAATGCGAT 


GATAGCATGT 


1680 




TCCTATTTAT 


ATTATGAAAG 


CAGATTGTCA 


ATCTAAATTA 


TCGGCAATAA 


ATCATAATTT 


1740 


20 


ACGCGTACTA 


TTCCAATATG 


GAGGAAAATG 


TCGTTATGTG 


TGGAATTGTT 


GGTTATATTG 


1800 




.GCTATGaTAA 


TGCCAAAGAA 


TTATTATTAA 


AAGGTTtAGA 


AAAATTAGAA 


TACAGAGGTT 


1860 




ATGACTCTGC 


AGGTATCGCA 


GTAGTAAATG 


ATGATAATAC 


AACTGTATTT 


AAAGAAAAAG 


1920 


25 


GTCGTATTGC 


AGAATTACGT 


AAAGTTGCTG 


ATAGTAGCGA 


TTTTGATGGA 


CCTGTTGGAA 


1580 




TCGGTCACAC 


ACGTTGGGCA 


ACACACGGTG 


TACCGAATCA 


TGAAAACTCT 


CATCCACATC 


2040 




AATCATCAAA 


TGGCCGTTTT 


ACTCTAGTTC 


ATAACGGTGT 


TATTGAAAAC 


TATGAAGAGT 


2100 


30 


TAAAAGGTGA 


ATACTTACAA 


GGTGTATCAT 


TCATTTCAGA 


AACAGATACA 


GAAGTTATCG 


2160 




TTCAATTAGT 


TGAATACTTT 


TCAAATCAAG 


GACTTTCAAC 


TGAAGAAGCA 


TTTACAAAAG 


2220 


35 


TTGTGTCATT 


ATTACATGGT 


TCATATGCAT 


TAGGTTTATT 


AGATGCTGAA 


GACAAAGACA 


2280 


CAATCTATGT 


TGCTAAAAAT 


AAATCACCAT 


TATTATTAGG 


TGTTGGTGAA 


GGTTTCAATG 


2340 




TTATCGCATC 


AGACGCACTT 


GCAATGTTAC 


AAGTGACAAG 


CGAATATAAA 


GAAATCCATG 


2400 


40 


ACCATGAAAT 


CGTTATTGTT 


AAAAAAGATG 


AAGTTATTAT 


TAAAGATGCA 


GATGGAAACG 


2460 




TTGTAGAACG 


TGATTCATAT 


ATTGCTGAAA 


TTGATGCATC 


AGATGCTGAA 


AAAGGTGTTT 


2520 




ATGCACACTA 


CATGTTAAAA 


GAAATTCATG 


AACAACCAGC 


AGTAATGCGT 


CGTATTATTC 


2580 


45 


AAGAATATCA AGATGCAGAA GGTAACTTGA AAATTGATCA AGACATCATC 


AATGATGTTA 


2640 




AAGAAGCAGA 


CCGCATTTAC 


GTTATTGCAG 


CAGGTACAAG 


CTACCATGCA 


GGTTTAGTAG 


2700 




GTAAAGAATT 


TTTAGAAAAA 


TGGGCTGGCG 


TACCAACTGA 


AGTACACGTT 


GCATCAGAGT 


2760 


SO 


TTGTCTACAA 


CATGCCATTA 


TTATCTGAAA 


AACCATTGTT 


CGTTTATATT 


TCTCAATCAG 


2820 




GTGAAACTGC 


AGATAGCCGC 


GCCGTATTAG 


TTGAAACTAA 


TAAATTAGGT 


CATAAATCAT 


2880 
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TCCTGAAATC? 


GGAGTTGCAT 

X Xw^*«^X 


CTACAAAAGP 


ATATACTGCA 


CAAATTGGAG 






TATTATCAAT 


CTTGTCTCAA 


ATCGTTGCAA 


AAGAGCaTGG 


TCGTGAAGCA 


GATATTGATT 


3060 


5 




ATTAGCAAAA 


GTAACAACAG 


CAATAGAAGC 


AATTGTTGAC 


GATGCACCAA 


3 120 






AATTGGTAC!A 


GATTTCTTAG 


AAACAACACG 


CAATGCATTC 


TTTATCGGAC 


3180 






PT ATA AfTiT A 


AGTTTAGAAG 


GTGGG*T*rAAA 


ACTTAAAGAA 


ATTTCTTACA 


3240 


10 


•nT'a AnC AGA 


AwVj XXX X\J\. X 


GGTGGAGAAC 


TTAAACATGG 


TACAATTGCC 


TTAATCGAAG 


3300 






aVVj X X V7 X >\kjVj X 


TTAGGAACAC 


AAGAGAAAGT 


TAATTTATCA 


ATTCGTGGTA 


3360 


IS 


a ^r"r^ A A 21 A 


G/^T A G*r A G 0 A 
X AvjIAVjV-iA 


OGTGGTGCLAC 


ATC CATGT AT 


TATTTCTATG 


QAGGGTCTTG 


3420 




wVjAWxUI lAX 


\,9X^.A1 X^^l^ 


ATGTACATY^A 


ATTGTTAACG 
X X w X X ^%r%wvj 


P CATTAGTAT 

V« Vai^^ X X J^*a X#^ X 


3480 




CAGTGGTTGC- 


Al XAuAAl lA 


A 1 X X UiA X Ak« X 


ATGGAGPATT 


ACACAGAGAT 


TTAGATGTTG 

X X «»VJ*» X W X X w 


3540 


20 




XAAWV^X 1\9^X 


A A ATP AGT^TA 
AAA X ^hA^9 X 1 a 


PTGTGGAATA 


f\ X X \ai^% V» X X X X 


TTAGAATCAA 


3600 




i catgtatta 


AAA X 1 AAAGT 


AX/\X\3V3V.>%^^ 


/ " 1 ■ 1 " 1 T AG ATT 
«M X X X X Awr^X X 


AA'T*PG A'^^ rs 


AAGGGTnPTT 


3660 




TTTTAGGTCG 


Ai. X X XXX 


TACTTCATCT 


TAATTTGGCA 


GAAATGCGTa 


AAAATGAAGT 


3720 


2S 


GTTTTATTTA 


1 1 X AAA 1 Avj 1 


CTGACAATTA 


AGGGTGTTAT 


GTTAATATGA 

W X X#W\X^%X SiX^ 


TTTTATGAGA 

X X X X X VA^^VJi^^ 


3780 




AGTATG<5At»T 


A/^^^ A ATa A A^^ 
Avj V.-AA X AAA^ 


GGGTGACCTC 


GCATGTTAAT 


TCAATTAQAT 

X ^^J^m^ X X X 


CAAATTGGGC 


3840 




GAA^LvjAAvjC^ 


H/'I/^RIV A A Ar*A 
AvjuAAAAAtJA 


ATTTTAAAAA AGATTTCTTG 


GCAAATTGCT 


AAAGGTGATA 


3900 


30 


AATCXjATAT^t 


AxAX\jVivX l\j 


AATGGTGCTG 


GCAAGACAAC 


ACTTCTAAAT 

^^^^ X X w X M^r%t^ X 


ATTTTAAATG 


3960 




^Aj 1 AloAu^v. 


1 VjLpAAV.*A X V* X 


GGAACTGTTA 


ACCTTTTCGG 


TAAAATGCCA 


GGCAAGGTAG 


4020 






An Ar^ A r^TYTT A 


CGACAACATA 


TAGGTTTTGT 


ATCTCATAGT 


TTACTGGAAA 


4080 


35 


Au 111 v^v\utA 


nnr2*TY2 A A AG A 


GTAATCGATG 


TGGTGATAAG 


CGGTGCCTTT 


AAATCAATTG 


4140 




^ 1\9 X Al V«>A 


AGATAT*rGAT 


GATGAGATAC 


GTAATGAAGC 


ACATCAATTA 


CTTAAATTAG 


4200 


40 


*nY2f^ A ATYITT* 
1 l\JUvw\l w X ^ 


TGr^AAAGGG 


CAACAATATA 


TTGGTTATTT 


ATCTACCGGT 


GAAAAACAAC 


4260 


uno X Van l\An 1 


TY^r'A^^GAGfTP 


TTAATGGGGC 


AACCCCAGGT 


TTTAATTTTA 


GATGAGCCAG 


4320 




V,.Aw^ X Olv XXX 


A G A P^TT^T A 
AVxAV* 1 X X Al 1 


GCACGAGAAT 


CGTTGTTAAG 


TATACTTGAC 


TCATTGTCAG 


4380 


45 


AJ. X V-AlAl ^,.1. 


A ACGT^PTYZ PG 


ATGATTTATG 


TGACGCACTT 


TATTGAAGAA 


ATAACTGCTA 


4440 




ACTTTTCCAA 


AATTTTACTG 


CTAAAAGATG 


GCCAAAGTAT 


TCAACAAGGC 


GCTGTAGAAC 


4500 




ACATATTAAC 


TTCTGAAAAC 


ATGTCACX3AT 


TTTTCCAGAA 


AAATGTAGCA 


GTTCAAAGAT 


4560 


50 


GGAATAATCG 


ATTITCTATG 


GCAATGTTAG 


AGTAAATAIT 


TTGCAAATAA 


TAAGTAATAA 


4620 




TGACAAAATT 


TAATTAAGAT 


AAAATGGACA 


GTGGAGGGCA 


ATATGGATAA 


CX3TAAAAGCA 


4680 
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AAAGATGTCA TTAATCAATT GAGAGAGAAA GGATATAAAG TATTTTTGGC AACAGGACGT 
TCGCATTCTG AAAATACATC AACTTG 
^ (2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 84 6 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



75 



30 



35 



40 



4800 
4826 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

GAATAAAAAG TAAAATTACT CGCCTTTGTT ACCTTTTACT TTATCAATaA AATCAGTTGC 60 

TTTTTCTTTT GCATTTTCAA CGAATTCTTT CGCTTTACCA GAAGCTTTAT CTTCTTTACC 120 

20 TTCGTTTTCT AAATTTTTAT TATCAGTAAC ATTACCTACT GTTTCTTTAA CATTACCTTT 180 

TGCTTGTTCA AATTTACTTT CGTCTGCCAT AATAAATGCC TCCTCGGAAT AATTAAATGT 240 

TATATATAAT ACTTACCCAC TGAAAAATTA TCTAAACATT TTACTTAAAT AATTTTTGAT 300 

2S ATTGATTTGA CGTCATTTTA TAACTAGCGA AATAGATTCA TCATTAACTT GAGGGAGTGG 360 

GACTGAAATA ATAAAGAATC ACTAATGATT TATGATGTAT TAGTCACTAG CCATGTGAAA 420 

TTAAAAATAA GAATAAATGA GTAGCACGCA TGCATATAGG ATTTTACTTT ATCCGTAATA 480 

GCATCTCATT CCTAAATATC ATATAAATAC CTGTTTAAAT TAAAAAGCCC AGCAACATCA 54 0 

CGTTACTGAG CCATTAATAT GATTTATTTA GCAGGAATAA CTGCACCATT GTATTTTTCA 600 

TTAATGAAGT CTTGAATATC TTTAGATTGT AATACTTCAA TTAATGCTTT GATTTTCTTA 660 

TCATCTTGAT GTCCTTCTTT AACAGCAATT AAGTTTGCAT AAGGATTATC TTTCGCACTT 720 

TCTACAGCAA TAGAATCTTT TTTAGGATTT AGTTTTTGTT CGATTGCAAA GTTCGAATTA 780 

ATGATAACAG CATCAGCGTC TTCATTTTGA TAAATTTTAG GTAAGAATTC TCCTGATTGT 840 

TTATTATTAA ACTTAATATC TTTTTTATTC TCTGTAATAT CACTAAACTT AGCATCTTCA 900 

ATTTTTACGC CTTTTTTGAT TTTAATTAAA CCTGCATCAA CGAAGAATTT TAAGAAACGT 960 

4S CCTTGTTCAG CTGGATTATT AGACACATAG ACTGTTGCAC CTTTTGGTAA TTCTTTTAAA 1020 

CTTTTATACT TTTTAGAGTA TACAGCCATA GGTTCTAAGT GAACATCACC GGCACTTACG 1080 

ATTTTGTAAC CTTTATCCTT TTTCTCTGTG TTTAAATATG GTGTATGTTG GAAATAGTTT 114 0 

SO GCGTCAATTT CACCTTTGTC TAGTAATTTA TTAGGTGTAG TGTAATCGTT AATTGTTTT A 1200 

ATATCTAGTT CATAACCTTT TTTCTCTAAT AATGGTTTTG CTTTTTCTAA AATTTCAGCA 1260 
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TTACCGCAAG CTGCTAATAC AACTGCAAAT GTTAATACTA AAATAAGACC AAATAATTTT 1380 

TTCATAAAAT GAAACCCCCA ATTTATCGTT TATCAAGTTT ATTTGTAAGC CAATCCCCAA 144 0 

TGAATTGGAT TATAAATACA ATAATTAAAA TAAAAACTGT TGATACTAAA ATGACATCAT 1500 

TTTGATTTCG AGTGAAACCT GTTAAGTATG CTAAATTTCC TAAACCACCG GCACCAATTA 156 0 

CACCTGCAAC TGCTGTTGAA CCAACTAAAG CGATTGCTGT AACTGTAATG CCAGACACTA 162 0 

GCGCTGGCAT AGCTTCAGGT AAAAGGACTT TACGAATTAC TGTCCAAGTA TTAGCGCCCA 1680 

TTGACCAAGC CGCTTCGATG ACACCTTTAT CAATTTCTTT AAAAGCAATT TCTACGAGCC 174 0 

TTGCATAAAA CGGTGctGCG CCAATGATCA AGGCTGGTAA CGCACCTGTC GGACCACTTA 1800 

TCGTTCCAAG TATCAAACTT GTAAATGGAA TTAATAATAA AATTAAAATA ATAAATGGTA 1860 

TCGCTCTAAA TAAGTTAACA ATGAAAGAAA CGATAGAATA AAATAACCTT GCACCGATAG 1920 

ACTTACCTTT AGCAGACAAG AATAATAACA CACCTAAAAT AAGACCAAGT ATAAATGCAA 198 0 

ATATAGTTGA GACGACTGTC ATGTATAGTG TTTCGACTAT TOCAGTCCAA ACTTCTGGCC 204 0 

ACTGAATATT AGGCATTGTA ATCATTTCAT TTATAATTTC ACTAAATGAT TTACCCATGT. 2100 

25 CTTAACACCT CCATTTTAAC TTGTCGCTCA ATTAACTCTT TTTCGAATTT TCCGAAATCT 2160 

ACACTTGAAA TATATGGAAT ATGCAGAACT AAAAAGCCGA CTGTTCCATT TTTTGTATTT 2220 

TTAATATTTG CTTCTAAAAT ATTAATTTTA ATATCATAGG CAGTTGATAG ACTCGATACA 228 0 

^0 ATAGGCTCGG TTGTTGTTGA ACCAGCGAAA ACTAATCTAA CX3ATATATGC ATCTTTTTCT 234 0 

AATGGCTCTA ATTCTGTTAA AGATGTTTCG AAATCATCAT TTAAATCGTC TTTCACAAAT 24 00 

CG' rrrX GTCA CAGTGTGTTG CGGATTTTCA AAAACCTGTG TCACCGGTCC TTGTTCTATC 246 0 

ACTTTACCAC TTTCCATAAC TGCAACTTCA TCACAAATAC GACGAATOAC ATGCATTTCA 2520 

TGCGTAATTA GTACA A TT G T TAAATTTTGT TGTTCTCTAA TTTTTAGTAG TAGATCTAAA 2580 

ATTTCATCTG TTGTTTGCGG ATCAAGTGCA CTTGTTGCCT CATCACAAAG CAAGACCGTT 2640 

GGATCATTaG TAACGCTCGT GCAATCCCAA CACGTTGCTT TTGTCCACCT GATAACTCTG 2700 

ATGGATAAGC CTTTTCTCTA CCTTTTAAAC CGACGAGTTC GACAAGTTCT AATGCTTTTT 2760 

GCTTAGCTCT CCTTCTAGGG ACACCTGCAA TTTCAAGCGG AAACATAATA TTTTTTAACA 2820 

CAGTCCTTGA CCATAACAAA TTAAAATGTT GGAAGATCAT ACTTACTTTT TGTCTTTTTG 2880 

CTCTTAATCC ATTTTTGGAC AATTGACCTA TATGGTCTCC ATCTATAATA ACTTCACCTG 294 0 

so ATGTAGGCGC TTCTAAATGA TTAAACATTC GAATCAAAGT ACTTTTTCCT GCTCCAGAAA 3000 

AACCAATGAC GCCATAAATC GATCCTGCTC GAATCGATAA ATTAACGTGA TCTACAGCAA 3060 
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TTCCTCCCTG TGTTGCTTAA TAAAATAAAA AATGCTTTCT CAATATCGAT AGAAAAATTC " 3ib( 

AGAAAGCAAT AGTAGTATTG TTTCTCTCAT CTTCAAAAGT TAAAACTITA TGTGAATTCG 324( 

CACCATTTCT ATATAAGACG GTTGCCGGGC TTCGTAGGGC ACATCCCTCC ACCACTCTCG 330C 

ATAAGAGTTT ACGCATCATT TAATTTGTAT TAATCCTAAC ACCTTAGTAA AATTTCGTCA 336c 

ATAACTATTT TAAATTTTCT AACAAATCAG TCACCGATTT AAATGCATAA ATTCGTTTTA 342c 

CTTCTTTATC TTTATTCATC AACAATAAAA TCGGCGTAGA CATGATTTGC ATATCTTTAC 34 80 

AAAACTGAGG ATAAAAGTTT AAATCTATTT TCAATAATGG TAACTGCAAT ATTTCATTAG 3 54 0 

CAATGTCTAA CATTCTTTCT GaAACCTTAC AAGTACCACA CGTTGGTGTA TAACCAAAGA 3600 

TTAAATGTTT GTCTTCCTCA TAAAATGTAG TTACATCTTT GATGTCTAAT GAATTATTCA 3660 

TTTACTAAAA CTAACCTTTC ATTATTTATA TTCGGTAAAA GAGGTGTTTC TTTCTTACAA 3720 

20 GTAAAGCCAT GTTTTGAAAG TACATGCGCC AAATATTGTT TGGGGCAATT CGCAACTTGA 3780 

CAGTAAGTTT TATCAATAAA TATATGTTCA CTTTCACTCA AATAACGTTT AAACCAATTT 3 84 0 

CTAATTCGAT CTCCTTCGTC ATCAGAATCG GCTAATACAA AAACTTGTTT ATCATACAGT 3900 
GATTCTATCA TATCATCAAG CTTATCTATA CTCATTGTTC CATGAGTACA AATAATATTG 
ACTGGTTCTG CAATAACCTG TTGCACCCTT TTTTTATCAG ATTTTCCTTC AACAATTATC 
ACTTTATTTA CAATAGCCAT CATCATCACC CTTTAAAATC AATAAACATC TGTCACTGTA 
TCATTTCACA AAATTGGTAT GAATAAAACA TAAATCACAA AAAATTTAAA CTAGCTTAAT 

ATAATAATTA CAAACTCAAT GTTTGACTAG CTGGAACATT TAACATAAGC AGACAAAGGC 4200 

TAAGTCAAAA ATCAACATCC TAAAATCTAC AATGTTATAT TAACAATAGT TAACCAAAAG 4260 

AAAATACACC TATAACAAAC TTTTCAATTA TAGCGGGGCC CCAACACAGA AGCTGATGGT 4320 

AAGTGAGCTT ACAATAATGT GCAAGTTGGC GGGGCCCCAA CATAAAGAAA TACTTTTTCT 4380 

TTAGAAATTA GTATTTCTTA TGCATGAGTT TTACTCATGT ATTCCTATTT TTAAATACAC 4440 

ATTAGCTGTG GCTTATGAAA ACAGGCTGGG ACATAAATCA ATGTTCTATG CTCTACGAAg 4 500 
TTATATTGGC AGTAGTTGAC TGAACGAAAA TGCGCTTGTA ACAAGCTTTT TTCAATTCTA 
^5 GTCAGGGGCC CCAACACAGA GAATTTCGAA AAGAAATTCT ACAGGCAATG CAAGTTGGGG 

ATGGGCCCCA ACAAAGAGAA ATTGGATTCC CAATTTCTAC AGACAATGCA AGTTGGGGTG 46 80 

GGACGACGAA ATAAATTTTG CGAAAATATC ATTTATGTCC CACTCCCTAG ATTGATCTAT 4 74 0 
SO AGATACTACA CTTATTAAAG TAATATATTT TTATGATTCT CTTAGCTGCA ATCCCATGAA 



25 



30 



35 



40 



3960 
4020 
4080 
4140 



4560 
4620 



4800 



TACATGTAAT CATCAAACTT CATAGCCTCA AGGTCAGTAG ATTTCA 4 84 6 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dotoble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

10 





AACAAAGACA 


CAATCGAACA 


TGAACCATCA 


GTAAAAGCTG 


AAGATATATC 


AAAAAAGGAG 


60 




GATACACCAA 


AAGAAGTAGC 


TGATGTTGCT 


GAAGTTCAGC 


CGAAATCGTC 


AGTCACTCAT 


120 


IS 


AACGCAGAGA 


CACCTAAGGT 


TAGAAAAGCT 


CGTTCTGTTG 


ATGAAGGCTC 


TTTTGATATT 


180 




ACAAGAG ATT 


CTAAAAATGT 


AGTTGAATCT 


ACCCCAATTA 


CAATTCAAGG 


TAAAGAACAT 


240 




TTTGAAGGTT 


ACGGAAGTGT 


TGATATACAA 


AAAAAACCAA 


CAGATTTAGG 


GGTATCAGAG 


300 


20 


GTAACCAGGT 


TTAATGTTGG 


TAATGAAAGT 


AATGGTTTGA 


TAGGAGCTTT 


ACAATTAAAA 


360 




AATAAAATAG 


ATTTTAGTAA 


GGATTTCAAT 


TTTAAAGTTA 


GAGTGGCAAA 


TAACCATCAA 


420 




TCAAATACCA 


CAGGTGCTGA 


TGGTTGGGGG 


TTCTTATTTA 


GTAAAGGAAA 


TGCAGAAGAA 


4 80 


25 


TATTTAACTA 


ATGGTGGAAT 


CCTTGGGGAT 


AAAGGTCTGG 


• 

TAAATTCAGG 


CGGATTTAAA 


540 




ATTGATACTG 


GATACATTTA 


TACAAGTTCC 


ATGGACAAAA 


CTGAAAAGCA 


AGCTGGACAA 


€00 


30 


GGTTATAGAG 


GATACGGAGC 


TTTTGTGAAA 


AATGACAGTT 


CTGGTAATTC 


ACAAATGGTT 


660 




GGAGAAAATA 


TTGATAAATC 


AAAAACTAAT 


TTTTTAAACT 


ATGCGGACAA 


TTCAACTAAT 


720 




ACATCAGATG 


GAAAGTTTCA 


TGGGCAACGT 


TTAAATGATG 


TCATCTTAAC 


TTATGTTG CT 


780 


35 


TCAACTGGTA 


AAATGAGAGC 


AGAATATGCT 


GGTAAAACTT 


GGGAGACTTC 


AATAACAGAT 


840 




TTAGGTTTAT 


CTAAAAATCA 


GGCATATAAT 


TTCTTAATTA 


CATCTAGTCA 


AAGATGGGGC 


900 




ctt;Atcaag 


GGATAAATGC 


AAATGGCTGG 


ATGAGAACTG 


ACTTGAAAGG 


TTCAGAGTTT 


960 


40 


ACTTTTACAC 


CAGAAGCGCC 


AAAAACAATA 


ACAGAATTAG 


AAAAAAAAGT 


TGAAGAGATT 


1020 




ccattcaaga 


AAGAACGTAA 


ATTTAATCCG 


GATTTAGCAC 


CAGGGACAGA 


AAAAGTAACA 


1080 


45 


agagaaggac 


AAAAAGGTGA 


GAAGACAATA 


ACGACrCCAA 


CACTAAAAAA 


TCCATTAACT 


1140 


GGAGwAATTA 


TTAGTAAAGG TGAAyCgAAA 


GAAGAAATCA 


CAAAAGATCC 


GATTAATGAA 


1200 




TTAACAGAAT 


ACGGACCAGA 


AACGATAACA 


CCAGGTCATC 


GAGACGAATT 


TGATCCGAAG 


1260 


SO 


TTACCAACAG 


GAGAGAAAGA 


GGAAGTTCCA 


GGTAAACCAG 


GAATTAAGAA 


TCCAGAAACA 


1320 




GGAGAyGTAG 


TTAGACCACC 


GGTCGATAGC 


GTAACAAAAT 


ATGGACCTGT 


AAAAGGAGAC 


1380 




TCGATTGTAG 


AAAAAGAAGA 


rATTCCATTC 


rAGAAAGAAC 


GTAAATTTAA 


TCCTGATTTA 


1440 
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CCAACACTAA AAAATCCATT AACTGGAGAA ATTATTAGTA AAGGTGAATC GAAAGAAGAA 
ATCACAAAAG ATCCGATTAA TGAATTAACA GAATACGGAC CAGAAACGAT AACACCAGGT 
CATCGAGACG AATTTGATCC GAAGTTACCA ACAGGAGAGA AAGAGGAAGT TCCAGGTAAA 
CCAGGAATTa AGAATCCAGA AACAGGAGAT GTAGTTAGAC CACCGGTCGA TaGCGTAACA 
AAATATGGgA CCTGTaAAAG GAGACTCgAT TgTaGGAAAA AGarGAATTc CaTTCaGGAA 
AGAcGTaATT TaTCCTGTTT AGCACCCGGG GCAGAAAAGT TAC 
(2) INFORMATION FOR SEQ ID NO: 278: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



180 
240 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

TAAACAGCGC GTGTACTTGT GATTCCCCCT TCTTCTATTT TACCCACCCG GGAAATAATA 60 

CTTTTCGCGA TTCCTTACTT GaACAAGCAA TATTTTATCa GCTGTTTCTT CAACTAAACA 120 

GACACATTTA ATCATCTTTG ACACCCCAAC TTTGTGAAAT CAATTTTTCA AATTATACTG 

TACAATTATG TTATCATATA TGAGTAGTTA TAGCGCAAAA CGTTAGCAAT TCAGCGCACC 

CAACTTTTCA TATAAACAGA AGATACTAGG GGGAATTATT ATTATGGCAA AACgTTCCAA 3 00 

ATCACAACGT TTATCAAGTT TACTAAATGT CGCAGGTTTC ATAGTCGACG GCTACAATGG 360 

CTATAAATAT CATGCTAAAA ATAAAAAATT AGTATATCTT TCATTAGGTT TAAGCACTGT 420 

AGGAACCGTG TTAGACTTTT ACATTTCAAT TAAGTCACCA CGTAAGTTCA AAAAAGCAGT 480 

GGO^TTGTT ACTTTAATAA CAAACGGTGC TAGATTATTT ACAAGCATTC GCAAAGTAAA 54 0 

40 ACATGAATAC TAATTCAGAA AAGGATTGGT CGAACATAGA ACATOAAGTT CATTCGACCA 600 

ATCCTTTTTC ATATACAAAA ATTCCTATTA CAACAATCAC GACTTTGATA GCCCCGCCAT 660 

AAAATTTAGG ATTCAATCCA ACTTTTCAGC TTGTGaAATXS TAATAGGaAT TCATTATATA 720 

4S ^^^^^ 

TTTATATACG TAAGACTTTA GTGAATATAT CTATAATTAT TTACTTGGTA AGCTGGTACC 780 

GTTCTGTAAG TTAAAATACC TGGTGCAGCT GAATAGTTCA TTTCTGAAAC TAAAATACTA 84 0 

CCATCGTTAT TTACACGTTC TACAAACATA ACATGACCAT AGTAACCTAC ATCTGTTTGA 900 

GCGATAGAAC CTACAGTAGG TCTATTGTCG ATAGTGTAAC CATCTGCTGC CGCTGCGTTA 960 

TCCCAGTTAT TAGCATTCCA CCAATAAGTA CTAATACCTT TACCAATTTC AGCACGACGA 1020 
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GTATTGTAAC CTCTATTTGT CGTTGTTGCA GATCCTGAGT TCGTAGATGC aTTACCAGTT ' 114 0 

ACTTTCAATT TTTGACCCGG ATATATAAAG AAATTATTTA AGCCATTTAA GCTCATAATT 1200 

TTTTGATAAG TTGTACCATA TTTTGATGCA ATTAATGACA ATGAGTCACC TGCTTGTACT 1260 

GTATAGTATG ATCCGCCACC TGAGTTCX3TT GATGGACGGC TACTATTGCT CGCAGCGTTA 132 0 

CTTGAGCTAG CAGTACCTGA TACTTTTAAT TTTTGACCTG GATAAATAAA GAAATTATTT 13 6 0 

AAACCATTAA GTCGCATAAT GTTTTGGTAA GTTGTACCAT ' ATTTTGATGC GATTAATGAT 144 0 

AATGAGTCGC CTGCTTGTAC TGTGTAGTAT GATCCGCCAC CTGAGTTCGT TGATGGACGG ISOO 

CTACTATTAC TCGTAGAATT ACTTGAGCCA GATACTTTTA GTACTTGGTT TGGGAAAATT 1560 

AGATTAGATG TTAAATTGTT TAATGACTTT AATTTAGCAA TCGAAATCCC ATACTTATTT 1620 

GAAATTGCCC ACACTGATTC ACCCGGTTTT ACTGTGTGAG TTGTAGCCGC ATTTGCTTGA 1680 

GTTGCCGCAA CAGCGCTAAT CGCGCTTGTC CCAATAATAG CTGCAATTAC TTTTTTTTGC 174 0 

ACTTTAAAAT CCTCCTCTTG CTTAACTTTC CTAACATTCT TTTATCCGAA TTTATGAATA 1800 

CTACATCATT ATACGATTTT ATTATGTATA ATAGGTTGAT GTTTGATGAC ATTATGaTTA 186 0 

25 CAAAAAAATC ATATACTGTA TCATCAAATT TTATAATTAT CCCTTAAAAT TATTACAACT 1920 

TATTAGATTT TACAATATCT AAATTATTAC AATTTCATAA TATTTCACTA TAAAATGATT 1980 

ACAATCCCTT TCTCTATTGG AAATAATTTT ATTCTCCAAC AATAACGCCC TACAAACATA 2 04 0 

AGCATGAACT TTGCTTGTAG GACGCAATAT AATTTATTTT GCTAATGACG TTTCTATTGC 2100 

CTTAATCTCA TCTTTAGATA AATTAACAGG TTTCTCTCCA TCTTTGACAT CTTCCTGCAA 2160 

CGCTTTTTGA GCTTCTTTTG AATGATACAA TTCTACGATT TTAGCATATG TTTTGTTATC 2220 

CAAGTCTTTG TCATTAACTG CAACAATATT AATATATGGC TTTACTGCAT CTGAATTTGA 2280 

TTTTTCTAAA AATATCGGAT CATTTTTAGG ATCTTTACCC GCTTTAGTTG CTACACCX3TT 234 0 

ATTAXTAACT GCAATATCGA CATCAGATAA AGCACGTGCA GTTTGTTGTG CATCTACTGC 2400 

AGTAATTTTT AAATGTTTTG GATTTGACGT TATATCTTTC ACCGTGCCTG CTAATCCGAA 2460 

ATCTTTTTTC AGTTTTATTA AACCAGCTGC TTCTAATAGT TTAAGTGCAC GTGCTTGGTT 2520 

"ts TGACACATCA TTTGGAATGA CAACTTTAGC ACCATCTTTA ACCTTTTTGA CATCTTTAAT 2580 

TTTATCTGAG TAAATGCCCA ACXK3TGCTAA AACTGTTGTA CTTAATGCTG AAATCTTTGT 264 0 

TCCTTTATGC GCCTTTTTAT ATTGATCTAA AAATGCAAAA TGTTGGAATG CATTCATATC 2700 

SO 

AATATCACCA TCATTTAATG CTTTATTCGG TAAATTGTAA TCTGAGAAGT GCTTAATCTC 2760 

CACATCAATA TCATCTTTTT TAGCTAATTC TTTAACCTTC TCCCAAGCCT TAGTGTCATT 2820 

55 
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AAGTGCTACG ATAACTAACC CAATCAATCT TTTCATTCTA TCAATTCCTT TCAAaItCTT 
CACTATATAT CATTAATGTC TACGTATGAA TCTAGCTAGA ACATTCCCTA GCGTTTGAAT 
CACTTGGACA ATAATGACTA ATACAATAAC GGTAATAATA ATGACCGTCG TATCAAATCT 
TTGATAACCA TACACTAAAG CTAAGTCTCC TATACCACCA CCGCCAACAG CTCCTGCCAT 
CGCCGTACTT CCAATAAGTC CAATAATCGC AGTGGTAATT GCTAATACTA ACGAACCTAA 
AGCTTCAGGA ATTAAAAAAT ATCTAATGAT TTGTAGTGGT GAAGCGCCCA TCGnTTTCGC 
CGCTTCAATA ATCCCCTCGT CTACTTCCAA TAATGAGTTT TCAACAAGTC TTGCAATGTA 
AGGTGCCACA TATACTGTTA AAGGCACGAT GGCAGCAGTC GTACCAATTG AAGTACCTAC 
TACTAATTTT GTGAATGGCA CAATCGCAAT TAACAAAATA ATAAATCGTA GTGACCITAA 
AATATTGATT AAAGGATTTA AAACTTGATG TATCACTATA TTGGGCCATA TGCCTTCTTT 
20 TCGAGTAATT ACCAATAAGA CACCTAATGG AATACCAATC ACTGCTCCTA AAAATAAAGC 

AATAGATACC ATATATAGCG TTTCGTACAA TGCTTGTAAT AACTGTGCAC TGTCTAAATC 
AGAACCAAAC ATATGTTAAT GcACCTCCTC AAATTGAATA TTTTTCTCTT TGAAATATTG 
25 ATTTATTGCC GTGTCTTCAA ATTGTTGATC CATATTAAAT CGAAGCCACA TATAACATAC 

■ GGTGTTACCT TGTATTTCTG ACATAGATGA AAATAAAATT TTAACCTCTC TGCCACAAAT 
TTGAATCAAG TCATTTATAA TCGGTTGTGT CACCTGAGTT TCCTCGACGA AGATTTTATA 

30 

ATCirrrAAAA tcgccaactt gttcgtcatt caatcgacga atcaatgatg tacttggctc 

AGTCTGTATA ACTGTAGACA CAAAATTTTG AGCAATCGTC GTTTTAGGAT GACTAAACAC 



40 



4S 



SO 



2940 
3000 
3060 
3120 
3180 
3240 
3300 



;5 ~*www,v.v.«x ww^ov^vjiv. (jiAtJUAATTG AAGTACCTAC 3 360 

3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 

CTCTTTAAcA GTTCCTGTTT CAACCACTTT CCCCTTTTCC ATTACAGCAA CACGATTACA 4020 
AATGTCTTTA ATAACGCGCA TTTCATGTGT AATCATCATA ATTGTAATGC CAAAGGTTTG 
ATTG£CATTC TTTAATAACG TCAATATCGA AGCAGTCGTT GCTGGATCCA ATGCGCTTGT 
TGCTTCATCG CATAGGAGTA TTTTCGGATT AGTAACAAGC GCTCTTGCAA TAGCCACCCT 
TTGCTTCTGC CCACCAGATA ATTCATCAGG AAATTGGTCT riTTTATCAC TCAATCCTAC 
AAATTCAAGC ATTTCCGTTA CTCGTTGCTT AATTTCTGTT TTGCTTTTCT TACTTAAAAT 
GAGTGGCATT GCTACATTTT TAAATACGGT AGCTGAATTT AATAAATTGA AATGCTGAAA 
TATCATACCG ATATCTTTCT TAATATCCCT CATCATTTTA TCGCTATAAT TCGTAATATC 
ATGTCCATCT ACAATCACTT GTCCATTCGA GGCAGCTTCA AGATGATTCA CGAGTCTTAC 
CAACGTACTT TTTCCTGCAC CACTATATCC AATCACACCA AAAATATCAT TCCGATTGAC 



4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 



4560 



CGTAAATGAT ACGTCCTTCA AAGCATCTAT TTTTrGCTTC TTTTTATTAA AGGTCTTACT 4 620 
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TAAGAGCATT ATATGTAAAA TTGCATATAT CGTcAATACA ATTTGCCX3AA TTTTCTAAAA " 474 0 

AATTAAAAAA TAAGTAATTC ATGTGACAAT GACGAATTGT GAGACTACTA TGACATTTAT 4 800 

CAAATTAAAT CCATAAAAAT GTCCACCAAT CCTCCACAAC GCAATTACTA AATATTAACA 4 860 

TCGCACAAAA AAGCACTAGC ATATTCAAGA ACAACAAACG TTGAACTCAA AATATATGCC 4 920 

AGTGCTGCTA TTATTTATAA AGTATCTAGT GCTTGTTTTA AATCATCGAC TAAATCTTCA 4 980 

GTATCTTCAA TACCTACAGA AATTCTTACA AGTCCGTCTG TAATACCTTC TrTAGCTCGA 504 0 

ATATCTGCTG GAATGGATGC ATGTGTCATC AATGCAGGTA CTGAAATTAA ACTTTCCACT 5100 

GCACCTAAAC TTTCAGCTAA TGTGTAATAC GATGTTGCTT TAATCAATTG TTTGGCACTT 5160 

TCTGTATTTT TCACTTCAAA TGCAATCACA CCTGTATGGC CATCCGCTTG AGCCATATGG 5220 

ACATCATGAT TTAAATGACT TTCAATACTT GGATGGAACA CTTGTTGCAC AGCTGGATGT 5280 

20 GCTTGTAACA TTTTAATAAT TTCAATAACG CTGCGATTAA TTTGTTCCAT ACGTAAACCT 5340 

AATGTTTTAA TACCCCTCAC AAGTAAATAG CTATCTTGAG GTCCTAAAAT GCCACCTGTT 5400 

GAATTTGAAA TAAATGCTAA ACGTTCTGpV AGCTTGTCAT CCGATGTTGC AACTAAACCA 54 60 

GCAACGACAT CACTATGTCC ACCTAAATAT TTCGTTGCAG AATGTAAGAC AATATCGATA 5520 

CCTAAATCTA ATGGATTCTG ATAATAAGGT GTCATAAATG TGTTATCAAC AACTGAAATC 5580 

AAACCGTGTT CTTTCGCAAT TTCAGCAGAC TTTTTAATGT CAGTAACACG TAATAATGGA 5640 

TTAGAAGGTG TTTCAATAAA CAACATCTTT GTTGTTGGGC GTATCGCTTG TACAATTGAA 5700 

TCTGTATGCG TTGTATCTAC AAAATCCACT TCAATGCCAA ATCGTGTAAA TACTTTTGTC 5760 

AATGCGCGAT AAGTACCGCC GTATACATCT GAATTTAAAA TAATATGATC TCCTTTGTCC 5820 

AACAGCATAA CAACTGCACT GATTGCTGCA ACACCTGAAC TAAATGCAAA GCCATGTTTG 5880 

CCAITTTCTA ATGTCGCAAT AACGCTTTCT ACAGAACTTC TTGTTGGATT CGCAGTACGA 5940 

40 GAATATTCAT ATCCTTGACG TAAATCACCA ATATCATCTT GTAAATATGT ACTTGTTTGA 6000 

TAAATTGGTG TTGTAACGGC ACCTGTATAA TCGTCTGTTG TGTGCCCACC ATGAATTAAT 6060 

TTAGTTTTCr TGTTCATTAT TATTCTCCTC ATAATTAAAT ATTTGCTTAG ACATATATCG 6120 

ATCACTACCA TCTGGAAATA CGACAACAAT CGTACCTTCA GATAATTGCG CTTTTAAATT 6180 

CAATGCACCT TGTAATGCTG CACCTGaAGA ACTGCCTACT AACAACCCTT CATTTATAGC 624 0 

CAAACTTTTG ACATTTCGAA AGGCATCTTG ATCTTTAATC GTAAATATCC CATCTACAAG 6 300 

SO 

ACGTCTCTCT AAAAATATCG GCCATTTCTC AGAACCGATA CCTTCAGTGT CATGTGCATG 6 360 

AGCTGGCCCT CCATTTAACA CGGACCCTTC TGGCTCAACG GCATAACATT GCACGTGATG 6420 
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ATAATCAATT TGCTGTAATG CTGAAGTCAA TTCGGGTCCC AATCTATGAA AATATGTATC 
CGGATTATGT TCGGATTCAA ATTGATTCAT ATAAACGGCA CGATATTTTT CAGCATAGGA 
ACGTGCAGCT AATTGTGCCC CATGCATACC TTCAGACTGA CTCGTCCTTG AAACTTCTGC 
ACCAAGCGCT ATCATAATAT TAATCTTTTC TTCTGAAAAA CCATACGGCG CAAAGATCTT 
ACATTTCAAA TGATGTCTAT TCGCTGCAAT AGCTAACCCT ATGCCTGTAT TACCAGCAGT 
CGCTTCAACA ATAGTTTGAC CTGCACGCAC ACGCCCTTCT TGAATTGCCT TCTCTACTAA 
ATATTTCCCX3 AGTCTGTCTT TAACACTGCC TCCAGGATTC CATTGTTCAA GCTTGGCATA 
AATTTTAACT TTATCATCAC TATAATGTTC TAACAGTACT AATGGTGTAT TGCCaATTAA 
ATCATAAGTA ATCATAGATG CACCCTCATC TGACATGCCG ATCAAATGAA TGAAACCTTT 
CTTCATGTCT CAATTTTAAT TCTTACTTTT CAGATAAGAA TTATAAACGA CATTTTGTTA 
TTTTGCAATT ATCTAAGTTT CGATTAATTC AGAACCAGTA CTAAATTTTC AATTCCAAAC 
AAAAAAACAC CTGAGCAACA CAAATACTTG TGTGTCAGAT GCTTCTATAT ATTAACTAAA 
TAATTGCACG ATAAAGACTA AAATAATAAC GACAGGCATC GCATACTTAA TTAAGTAATA 
CCAACCACTG AATAATCTAA ATCGATCTTT ACCAAAATAT TGTTGTAATA ATTTTTTATC 
TAATAATTGT CCTACGACAA GCGTAGTACC TAATGCGCCT AATGGCATCA ATACATTCGA 
AACGATGAAA TCCATATTAT CAAAAATCGT TCCCGCACCG AATCTTACAT CTTTTAAGAT 
ACCAAAAGAT AAGGTTGCTG GAATACTAAT GATAAATACT AAAATACTAC CGATCACTGC 
GACTTTTTTA CGTTTTGTAT TGTCATTCTT CGTGAAGTTA GAAACATTTA ATTCTAATAA 
AGAAATAGAT GACGTTAAAG CCGCAAATAA GAACAGCACT AAGAATCCCA AATAGAATAA 
TGTGCCTAGA TGCATTTGAC TAAAGACCAT TGGCAGTACT TTAAATAATA ATCCAGGCCC 
ttctSgtggt TCATAGCCAA AACTATGTAA AGCCGGAAAT ATAGCTAGAC ctgccaatac. 
AGATACAAAG ATATTCATAA CAACGATAGA AATAGCTGAT GACTTAATCG TCATGTCTTT 
AGAGGCATAA CTCGCATAAG TAATCATACC TGTAGTTCCT AATGATAACG TAAAGAATGA 
TTGACCTAGC GCAAACAAGA TGCCATCAGC AGTAATCTCT GATACTCTTG GTTGTAAAAT 
AAATTTCACA CCTTCTAAGA CGCCATCTAA TGTTAAAGAC TTAATCACAA TGACGATTAA 
AAAGACAAAC AGCAATGGCA TCATAACTTT CGATGCCTTT TCTAATCCTT TTTCAACACC 
TAACATGACA ATAATCATCG TAGCGAATAT GAATATACCT TGCCCTAGAA CGGTTAACCA 
AGGATTTGAT ATTACCGCTT CAAAATTCAT TTCTTGGAGA TGATTGATCC GTTGAAATAT 
AACTAATTGC CATAATACTT GTCCGATGTA AATGACAATC CAACCACCGA TAACACTATA 
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TTTACCAGTT AATTTACTAT ATATTTGTGT TGTATATGTC CGTCCCATTT TCCCAACAGT 834 0 

GAATTCCATA ATGAGTAATG GCAACCCAAC AAAAATGGTG AATATTAAGA ACATAGCTAG 84 00 

AAAGGCACCG CCGCCATAAA TCCCTGCCAT ATATGGGAAT TTCCACATGG CACCAAGACC 84 60 

GATTGCAGAA CcCGCACTAG CTAAAATAAA TCCAGTTGAT GACTTCCATT GTGATTGTTG 8520 

TCTTTTCATC ATTCAC 8536 



W 

(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 328 base pairs 
,5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

GCTTTGGCCA TTTTATGTGG CGATTGAGAC AATCtGTkGT TGTCTTATTT GATGTTGTAT 60 

TTCAACTGGT AATTCTAGTT GCGATTGAAA TAATGGCAAC TTTTCCCAAT CATTAACAAA 120 

25 

TAATTCAATA CCTGCTATGT CTAATACTTT AGCACGTGCA TCATCAACAA GACGGCGTTC 180 

CAATTGATTT GCTTCTTCTT TAATACCTGG TGACGTACTT TCTAATATCA AATTAGATAT 240 

AGGGATGTGA CCATTAATTG CATAATATAA TGCAACACGC CCACCCATTG AATATCCAAA 300 

30 

CAATGTTATT GATTTATCTT TATATTTATC TAAAATTCGG TCTAACAACG TCGTAATATA 360 

ATCAAAATTC CACGTTTCAT CCATTGAAGA CTGATCTTCG CCATGGCCTG GTAAGTCTAT 420 

35 AGTGATGACA TGATAGTTAT CAGTAAATTT TTCGATGTGA TTATGATAAG TACGGCTGTC 4 80 

GCTAAGAAAT CCATGCAGAA ATACTAAAAC TTGATTGGTC TCAACGTTTG CTTCATAAAA 540 

TTTOTAATGT GTCATGAATC ATTTCACTCA ATTTCTGGTA TAAAATTTGA TGCTGTTTAA 600 

40 AGTTATCTTC GCGATTCGTT ATCAATTCAT AAATCGTCGA AGTTTCAGAT AACAATGTGG 660 

C A TTTTTAAA TTCTGAAACA CTGTTAAAAC GTTTAAAATC GAATTGATAT AACTTAGCTG 720 

TATACTCGAA ATCCAATCCC GTCGGTGTGC CAAACAACCG TTCAAAATAG TCAGTTGCAC 780 

45 

TTTCTTTTTG TGGTAAATAT GAAAAAATAC CGCCACCATC GTTGTTCAAT AATACAATAT 84 0 

TCATCTGAAT ATTATTTAAT TTTGACATTA ATAGTCCATT CATATCATGA TAAAATGATA 900 

AATCACCTAT CAATAATGTT ATTCGTTTAT GCACAGCCAT ACCCAGTGCA GTTGAAACGA 960 

SO 

TACCATCAAT ACCATTCGCA CCACGATTCG CATAGACATC TATATTTTTA TTCAATAACA 1020 

AGTTATCTAC ATCTCTGATA GGCATACTAT TACTAATAAA TAATGCATCT TTTTCAGATG 1080 
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TAATTTCmT ACGCCCTTTT TTCTCTAAGC ATTGCCATTT TTCTAACCAA CTTACGCGAT 
TAACTGTCGT GTCTTCCATT AATGACCTAA AGAAATCATT CGCAGAAATC TCATATGAAA 
TATCTGGCGC TATCGGAAAG ACATCAATCT TATCATTGTT TTGCACTAAT ATTTGAAATG 
CATCAGTTTT CTTTAACCAT TGATTTAACT TTTTAGAAAT CACTGGTTTC CCAACACGAA 
TTACGAAATC CACATTTAAG TCTAAGCCGC TTCTAAACAG CAAATCATAT GTACAGATAA 
CATTCGGATG ATCAAATTTT CTTAAATGAC TTAAAGGATC AGCTAAAATA GGCAAATCAT 
ATATCGTTGA ATACGTTAGT ATTTGaTCAA CTTCTTGGTG CTGCATATCC CCTACAATAA 
TTAAACCTTT TTTCTTATTT AAAATGTGTC TTAATGCCGA TGCATCTATA CTTTTTTGAT 
AGTGCGGTAA AATCTTCATC TCAGAAGTTA ACAATTCTGT TGCATTCAAA TCAGGTGTTA 
ACGGATCTCT AAATGGCAAG TTAAAATGAA TTGGCCCTTT ATGTGGTCCA TATAAATATT 
GACTAGCAAT TTGCATTTGA TAGTAAATTG CATCAATGGT CTCTTTACTA TCATCCGCAA 
TAGGCATATC GAACTCATAA CTTACATAAT TATTAAACAT ATTTACTTGA TTAATCGCTT 
GTGGTGCGCC TACACTTCTT AATTCATGCG GACGGTCACT TGTTAAAACG ATTAAAGGAA 
TTCTACTAAT TTGGCTTTCA GCAATTGCAG GCGTATAATT CGCTGCTGCT GTACCTGACG 
TACATAATAT AGCGACAGGT CTTTCACTGC CTTTAATTAA CCCAACTGCA AAAAACGCTG 
CACTTCGCTC ATCGGGGTGT ATCCATGTTT TAATATTTGG ATGTGCTTCA AATGCAAGTG 
CAAGTGGCGT TGAGCGTGAT CCCGGACTGA TAACTACTTC CCTTACGCCG TACGCATATA 
ACTCAGATGC AAATGTAAAA ACTTGCTTCG TTAAAGCTGC TTTATGATTT CCCATTCATA 
TCGACTCCTA ATGCATTCAT CATAGGTGTG AACTTAAGGT TCGTTTCTGC CAATTCACTA 
TCTGGATCAG AATCTTTAAC AATGCCACAC CCAGCAAATA AAGTTGCTTG TGCTTTCTTA 
ATAACCATCG AACGAATTGC AACAATAAAT TCACAATCAT CGTATATATC TATATAGCCA 
ACCGGTGCAC CATATAATCC TCGCGTACCA AATTCTTTCT GCTCAATAAA ATCCATTGCA 
AATTCTTTTG GATAGCCACC TAAAGCAGGT GTTGGATGTA AATTATCAAT TAAACTAATA 
TACGAATCAT CCTTCAGTGG CGCCTTTATT TCAGTGTACA AGTGATATAA ATGATCATTT 
TTTAGAATTT TAGGCGTCTT ATCATAATGT AATTCAGTGA TATAAGGTTT AATATCATGT 
AAAATACTGT CAACAACAAA TCGATGTTCG ATTAAGTTTT TATTATCTTT TAAAAATGCT 
TCAACATTTT TTGTATCTTC GTCCTCATCT TGTGAACXnT TAATTGTACC TGCTACAGCT 
TTAGTCGATA GTATTTTATT ATTGACCTTT ATTAATT G TT CAGGTGTTTG TGAAAAGAAT 
ATAGAATCTT GTGATTCTAA CAAGAATATA TAACTGnTT TTTCTTTAGA ATATGCTTGC 
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ACAATTTTTT CTTCATTATT AATAGA1TCT ATAGCTTCTA CTACAAGTTG ACGCCAGTCA 3000 

TCTTTATAAA TATCTTCATT TCTAGTAATT TCCCCAATTT GCTCGTCCAC ATCTATGTCC 3060 

^ GATATATTGT TGAACAAATC CATTAAATCG TTCAATGCCT CAACAGTAAA ACTTTCCCTT 3120 

TTAACTGTAT AAGTTAAAAA TGTCCCATTA TTATCAGTTG AAATTAAAAC TTCAGGTAAT 318 0 

ACAAAATGAT TTAGTCCAAA CTCTCGCCAT TCATCATCTG ATTTATGACT TGAAAATTGG 324 0 

10 

AACCCTCCAA CAACTCX3AAG ATGATGTTTC TCAGATTGCG GATGTATAAA TGTGATGTTA 33 00 

TGTTTTAATT TTTCCCAGTC TTTAAAAATA GATTGTTTAT TTTTAGAATT ATTTTTGAAT 33 6 0 

AATTGAATTG CTTTGTAGCC AAAATATGAC GTTCGATTAT CATTCAAACG CATATAAAAG 3420 

15 

CGATCTCCTG CCTCATTGTC AGTGAGATGA AATAATGTGC TCGGGTCTAG TGACTGTGAT 34 8 0 

AATTTCACTT CAACTGAAAC CCATTCCTTT GAGCTGCCAT ATATCTCTTT GACAATATCG 354 0 

20 TCCTCTAATA CGCCCGTAGC CATCCATTTC ACTTCTTTCT TCGTCTTTTT TCACTCATTA 3600 

TTATATTGTA TCATTTTTGG ATAATTGTGT TACAAGAATT GCTTTkAACTT ATCTTGCAAT 366 0 

TTTTCACGTC AATTGACCTT TATGCTACTT TCTATTAAAA TATCTTTGTT ATAAAAAATA 372 0 

TGATTTAAAG AGGTTTTGTA TTCAATGAGT AATCAATATC AGCAATATTC TACAGTTAAG 3780 

AAATATTGGC ATTTAATGCG TCCTCATACA TTAACTGCTT CCGTAGTACC CGTTTTAGTT 3 84 0 

GGTACAGCAG CATCTAAAAT ATATTTTCTT GGTAGCGAAG ATCATATTAA AATCAGCCTA 3 900 

30 

TTCATTGCCA TGTTACTAGC ATGCTTACTT ATTCAAGCAG CAACTAATAT GTTTAATGAA 3 960 

TACTATGATT ATAAAAAAGG CCTCGATGAT CATGAATCTG TAGGCATTGG TGGTGCCATT 4 020 

GTTCGCAACG GTATGAGCCC AGAGCTTGTG CTACGATTAG CCATTGCATT TTACATCTTA 4080 

35 

GCAGCAATAT TAGGTTTGTT TTTAGCTGCT AACTCTTCAT TTTGGTTATT ACCAGTTGGA 4140 

TTAGfATGTA TGGCTGTTGG TTACCTATAT ACAGGTGGCC CTTTCCCTAT TTCATGGACG 4200 

40 CCTTTCGGTG AATTATTCTC AGGCGTATTT ATGGGTATGT TTATTATCGT TATTGCATTC 4 260 

TTTATTCAAA CTGGCAATAT TCAAAGTTAT GTAATTTGGT TAAGTGTACC TATAGTAATC 4 320 

ACTATCGG 4328 

''^ (2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 50 base pairs 
<B) TYPE: nucleic acid 
so (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GTTCAATACA GAAAAAATAA 


ATTTAGATGT 


TGAAGCATCC 


TACAATTAAT ACAGATCCAT 


60 




TTCAATATAT TTAAACTAAA 


ATCTCGGGAT 


TTCTAAATTT 


TGAAATTTCG AGGTTTTnAT 


120 


5 


ATTTTTATTT AAAATAGCAC 


ATTTATACTT 


TATAATAGTA 


AAGATGAACA TATAAGGAGG 


180 




CCAAATCATG GCAAAACATC 


CATTCGAACA 


ATTTAATCTA 


GAATCTAGTT TAATTGACGC 


240 


10 


TGTGAAAGAC CTTAATTTTG 


AAAAACCAAC 


TGAAATTCAG 


AATCGAATTA TTCCAAGAAT 


300 




ACTAAAGAGA ACAAATTTAA 


TTGGTCAATC 


TCAAACGGGT 


ACAGGGAAAT CTCATGCATT 


360 




TTTATTACCA TTAATGCAGT 


TAATTGATAG 


TGAAATAAAA 


GAACCACAAG CAATCGTAGT 


420 


IS 


TGCACCAACA AGAGAACTTG 


CACAACAACT 


ATACGATGCA 


GCGAACCATT TAAGCCAATT 


480 




TAAAGCTGGT GTTTCAGTTA 


AAGnTTTAT 


TGGTGGTACA 


GATATAGAGA AAGATAGACA 


540 




ACGTTGTAAT GCACAACCAC 


AATTGATTAT AGGCACCCCT ACTAGAATTA ATGACTTAGC 


600 


20 


TAAAACGGGA CATTTACATG 


TGCACTTAGC 


ATCATATTTA 


GTTATTGATG AAGCGGATCT 


660 




TATGATTGAC TTAGGATTAA 


TTGAAGATGT 


AGATTACATT 


GCTGCAAGAT TGGAAGATAA 


720 




TGCAAATATT GCGGTGTTTA 


GTGCTACAAT 


CCCACAACAG 


TTACAACCAT TTTTAAATAA 


780 


25 


ATATTTAAGT CATCCAGAAT 


ATGTAGCTGT 


CGACAGTAAA 


AAACAAAATA AAAAGAACAT 


840 




CGAATTCTAT TTAATACCTA 


CTAAAGGTGC 


AGCTAAAGTT 


GAAAAGACTT TAAATTTAAT 


900 




TGATATACTA AATCCATACT 


TATGTATTAT 


TTTCTGTAAT AGTAGAGATA ATGCAAATGA 


960 


TTTAGCACGT TCACTAAATG 


AAGCTGGTAT 


TAAAGTTGGT 


ATGATTCATG GTGGCTTAAC 


1020 




GCCaCGTgAA CGTAAACAAC AAATGarACG 


TATACGTAAT 


TTaGAATTCC aATACGTTAT 


1080 


35 


TGCCaGCGAT TTAGCATCTC 


GTGGTATTGA 


TATTGAAGGT 


GTTAGTCrTG TCATCaATTT 


1140 




TGATGTGCCA AATGATATTG 


ACTTCTTTAC 


GCATAGAGTC 


GGACGAACTG GTCGTGGGAA 


1200 




TTATrrAGGT GTAGCAATTA 


CGCTTTATAG 


TCCTGATGAA 


GAACACAATA TTTCATTAAT 


1260 


40 


AGAAGATCGC GGTTTTGTAT 


TCAATACTGT 


TGATATTAAA 


GATGGTGAGT TAAAAGAAGT 


1320 




TAAAGCGCAC AATCAGCGTC AAGCAAGAAT 


GCGCAAAGAT 


GACCATTTAA CTAATCAAGT 


1380 




GAAGAACAAA GTTCGAAGTA 


AAATTAAAAA 


CAAAGTTAAA 


CCAGGTTATA AGAAGAAATT 


1440 


45 


TAAACAAGAA 








1450 




(2) INFORMATION FOR SEQ ID NO: 281: 







(i) SEQUENCE CHARACTERISTICS: 
SO (A) LENGTH: 1139 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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75 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

AGTCAGGTAT ATCATGCCaT yCTGAATTGG TCGATATTAA TATCAGTGGT GTTAAAGAAC 60 

GAATTGTATA CCAATAGACG CTTTATATTG TAAAATAGTA TTAAATGCaG AATAGAGAGG 120 

AGATTTAATG CGATATGACA AATTATAAAG TTGTCGTTTT AGATATGGAT GACACATTGC 180 

TAAATTCAGA TAATGTGATA TCAGAAGAAA CTGCAAATTA TTTAACAGCA ATTCAAGATG 24 0 

AAGGTTATTA TGTTGTTCTA GCATCTGGTA GACCTACTGA AGGTATGATT CCAACTGCTA 300 

GAGATTTAAA ATTACCTGAA CATCATAGCT ATATTATTAG TTATAACGGT AGTAAAACGA 360 

TTAACATGAC TAATGAAGAA GTAGAAGTAA GTAAATCGAT TGGTAAGCAA GATTTCGATG 4 20 

AAATTGTAGA TTATTGTCGA GATAGAGGCT TTTTCGTTCT TACATATCAT GATGGTCAAA 4 80 

TTATTTaCGA CAGCGAACAT GAGTATATGA ATATTGAAGC AGAATTAACA GGTTTACCGA 54 0 

20 TGAAACGTGT TGATGATATC AAAGCGTATA TTCAAGGCGA TGTACCCAAG GTCATGGGTG 600 

TAGATTATGT AGCGAATATT ACAGAAGCTA GAATTGATTT GAATGGTGTG TTCAATGATA 660 

ATGTAGATGC TACGACAAGT AAGCCATTCT TCTTAGAATT TATGGCCAAA GACGTTTCAA 720' 

AAGGTAATGC AATTAAAGCG TTATGTCACA AATTGGGATA TTCGGTGGrfT CAAGTCATTG 780 

CTTTTGGTGA TAGTATGAAT GATAAATCAA TGTTTGAAGT CGCAGGTCTA GCTATTGCTA 84 0 

TGGGGaATGC ATCAGATGAA CTTAAGCAAT ATGCAAATGA AGTTACGTTG GATCATAATG 900 

AAAATGGTAT TCCACATGCG CTCAAAAAAT TGTTATAAAT TTTAAAATAA GCCTTAACAC 960 

ATGATATTTG AATAAGATAT CTTGTGGTTA AGGCTTTTTA TTTTTGTGAA AATGACTTCA 1020 

GTTATACTAT GGAGGATTTG AAATACATAT TTTAGATTAG TAATGATATC AAACGAATAG 1080 

AGTAAATGTA TATTTttTGA ATAAATCAAG TATTAACTAG TCACGGAAGG nAGATAAAT 1139 
(2) INFORMATION FOR SEQ ID NO: 282: 

40 ' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



35 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 282: 

TCTAAAAATG CTGTGAAATT CTTTTATAAA TATCTAAAAG GAATTAATGT TGATAACATT 60 

GCTGTGATAG GAAGTAAGAC AGCGCAATAT TGTGAATCAC TTGGCATTCG AGTTGATTTT 120 

AtGCCAAACG ACTTTTCTCA AGAAGGATTT TTAAAATCAT TTAATCAAAC TAACCAAAAA 180 
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AATGAAGTTG TTAAAATAGA TTTATATACT TCAGTGCCTA ACAAACAAAA TATACAAGAT 
GTTAAAGAAA TGATAGAACA TCAACAAATC GATGCATTAA CATTTTCAAG TTCGTCGGCA 
GTACX3TTATT ATTTTAATGA AGGATTTGTA CCAAAATTCA AGTCGTATTT TGCTATTGGA 
GAACAAACAG CACGGACCAT TAAATCATAT CAACAACCAlQ TAACAATTGC AGAAATTCAA 
ACACTCGAAT CACTAATTGA AAAGATTTTA GAAAGTAGGG GCTAAAAATG AAATTTGATA 
GACATAGAAG ATTGAQATCA TCAGCGACAA TGAGAGATAT GGTTAGAGAG AATCATGTAA 
GAAAAGAAGA TTTAATATAT CCAATTTTTG TAGTTGAAAA AGACGATGTG AAAAAAGAAA 

TTAAGTCATT GCCAGGTGTA TACCAAATCA GTTTGAATTT ACTTGAAAGT GAATTAAAAG 720 

AAGCTTATGA CTTAGGCATA CGTGCCATTA TGTTTTTCGG TGTTCCAAAC TCAAAaGATG 780 

ATATAGGTAC TGGTGCATAC ATTCACGATG GTGTTATTCA ACAGGCAACA CGTATTGCTA 840 

20 AAAAAATGTA TGATGACTTA TTAATTGTTG CAGACACTTG TTTATGTGAA TATACTGATC 900 

ATGGTCATTG TGGCGTGATT GATGACCATA CACATGACGT TGACAATGAT AAATCATTGC 960 

CACTACTTGT TAAAACAGCA AITTCTCAAG TGGAAGCTGG TGCTGATATT ATTGCGCCAA 1020 

GTAATATGAT GGATGGTTTT GTTGCTGAAA TTCGTCGTGG ATTAGATGAA GCCGGCTATT 1080 

ACAATATTCC TATAATGAGT TATGGTGTCA AGTATGCATC AAGTTTCTTT GGACCTTTTA 1140 

GAGATGCAGC AGATTCAGCG CCATCATTTG GGGATAGAAA AACGTATCAG ATGGACCCTG 1200 

CTAACCGTTT GGAAGCACTT CGTGAATTAG AAAGTGATCT TAAAGAAGGG TGCGACATGA 1260 

TGATTGTTAA ACCTGCTCTA AGTTATTTAG ATATAGTTCG AGATGTTAAA AATCATACGA 1320 

ATGTTCCAGT TGTTGCATAT AATGTGAGTG GAGAATATAG TATGACTAAA GCAGCGGCAC 13 80 

AAAATGGTTG GATAGATGAA GAACGTGTCG TTATGGAACA AATGGTTTCA AT6AAACGTG 1440 

CAGGTCCTGA TATGATTATT ACGTATTTTG CAAAGGACAT TTGTCGCTAT TTAGATAAAT 1500 

AAGGTTTTAT ATTTATGATT TTCCATAAAC TGTAGGAGGA ATTTACTTTA TGAGATATAC 1560 

GAAATCAGAA GAAGCAATGa AGGTTGCTGA AACTTTAATG CCTGGTGGTG TAAATAGTCC 1620 

AGTACGCGCA TTTAAATCAG TAGATACACC AGCAATTTTT ATGGATCACG GTAAAGGTTC 1680 

45 AAAAATTTAT GATATCX3ATG GTAACGAGTA TATCGACTAT GTACTAAGTT GGGGACCACT 174 0 

TATTTTAGGA CATAGAGACC CTCAAGTTAT TAGTCATTTA CATGAAGCAA TTGATAAAGG 1800 

TACAAGTTTT GGTGCATCAA CATTACTTGA AAATAAATTG GCGCAgcTCG TTATTGACCG 1860 

AGTACCTTCA ATAGAAAAAG TGCGTATGGT GTCATCTGGT ACAGAAGCTA CATTGGATAC 1920 

TTTAAGATTA GCACGTGGTT ATACTGGCAG AAATAAAATT GTGAAATTTG AAGGTTGCTA 1980 

55 
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10 



IS 



20 



2S 



30 



40 



45 



GCCX3GATTCT CCTGGTGTGC CTGAAGGTAT TGCTAAAAAT ACAATTACAG TTCCATACAA 2100 

TGATTTAGAT GCACTTAAAA TCGCTTTCGA AAAATTTGGa AACGATATTG CTGGTGTAAT 2160 

CGTAGAACCT GTTGCTGGTA ATATGGGTGT CGTACCGCCG ATTGAAGGTT TTTTACAGGG 2220 

ATTAAGAGAT ATTACGACTG AATACGGCGC ATTGCTAATT TTCGATGAAG TAATGACTGG 2280 

TTTCAGAGTC GGTTATCATT GTGCACAAGG TTACTTTGGT GTGACACCAG ATTTAACTTG 2340 

CTTAGGAAAA GTTATCGGTG GAGGACTACC TGTAGGTGCA TTTGGTGGTA AAAAAGAAAT 2400 

CATGGATCAT ATAGCACCAT TAGGAAATAT TTATCAAGCG GGTACGTTAT CAGGAAATCC 24 60 

TCTTGCAATG ACAAGTGGTT ATGAAACGTT AAGCCAATTA ACGCCAOAGA CATATGAGTA 2520 

TTTTAATATG TTAGGCGATA TACTTGAAGA CGGTTTAAAA CGTGTATTTG CTAAACACAA 2580 

TGTACCAATA ACTGTAAATA GAGCAGGTTC AATGATTGGT TATTTCTTAA ATGAAGGACC 2640 

TGTAACTAAT TTTGAACAAG CGAATAAAAG TGATTTGAAA TTATTTGCAG AAATGTATCG 27 00 
AGAAATGGCA AAAGAAGGTG TGTTTTT A CC ACCATCTCAA TTTGAAGGTA CATTCTTATC ' 2760 

TACGGCACAC ACGAAAGAAG ATATTGAAAA AACGATTCAA GCATTTGATA CGGCTTTAAG 2820 

TCGTATTGTA AAATAAATAT ACGGACAAAT TGAGAGCCTG AACTTTGTTC AGGCTCnTTT 2 880 

TAAATGTATA TAAGGCATGG GCGGCGACTT GATAGTGAAA GTCCACTACT A 2931 
(2) INFORMATION FOR SEQ ID NO: 283: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY: linear 



r(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

AATTATGAAT GCATTACCAG TATTATTACA AAAGAACAAT TAAAAATGTT TGTTTATGAT 60 

TATGATACGC ATCTCATTAA AAATGTAaTG GTTGCAGCAG ACGTGTTAAA GGCAAATGAT 120 

ATTCAAGGAC ATGAACCATT AATCGTTAAC CTTCAAACGA TTGATGAAAC ATTACATCGT 180 

TTACCTATGC ATAATAGAAA AGACATGATG GTTAATGGCG GTGTACTTAT GGCACATTTA 240 

AATGCCAAAA GTGGTCCGTG GTTAAAAGAT GTGCTAAGAC AAATTGAGAT AGCGATTGTA 3 00 

5^ ACAGGTAAAG TAAGCAACGA AGAAACTGAA ATTTTGAAAT GGGTGGATAA TCATGTCAAA 360 

ATATAGTCAA GATGTACTTC AATTACTCTA TAAAAATAAA CCGAATTATA TATCTGGACA 420 

AAGCATTGCG GAATCACTTA ATATTTCACG CACTGCAGTA AAAAAAGTGA TTGACCAATT 4 80 
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CCCAGATATT TGGTATCAAG GTATAATAGA CCAATATACA AAAAGTTCTG erTTOTTTGA 
TTTTAGTGAA GTATACGATT CAATAGATTC TACTICAACTT GCTGCGAAAA AGTCACTTGT 
TGGAAATCAA TCTTCATTTT TTATCTTGAG TGATGAACAA ACGAAAGGTC GTGGGCGATT 
TAATAGACAT TGGAGTTCTT CAAAAGGGCA AGGACTTTGG ATGTCTGTCG TGTTAAGACC 
TAACGTTGCA TTCTCAATGA TATCTAAATT TAATTTATTT ATTGCATTAG GGATAAGAGA 
TGCGATTCAA CATTTTAGTC AAGATGAAGT CAAAGTGAAA TGGCCGAATG ATATATTTAT 
TGATAATGGT AAAGTGTGTG GTTTCTTAAC TGAAATGGTT GCTAATAATC ATGGTATAGA 960 
AGCAATAATA TGTGGTATAG GTATTAATTT GACGCAACAA CTAGAAAACT TTGATGAAAG 1020 
TATTAGACAT AGAGCAACAA GTATACAATT ACATGATAAA AATAAATTAG ATAGATATCA 1080 
ATTTTTAGAG ATATTACTTC AAGAAATTGA AAAAAGATAT AATCAATTTT TAACGTTACC 114 0 

TTTTTCTGAA ATTCGTGAAG AATATACTGC AGCTTCTAAT ATTrGGAATA GAACGTTGCt 1200 
ATTTACAGAA AATGATAAAC AGTTTAAAgG ACAAGCAATT GATTTAGATT ACGATGGCTA 1260 
TCTAATTGTT AGAGATGAAG CGGGTGAATC ACACCGTTTA ATTAGTGGtAG ATATAGATTT 1320 
TTAACACTAA AGCAAGGAGA GATAGCTATG GGTATGGCAA CCTATGCCGT TGTGGATTTG 138 0 

GnAACAACAG GCAACCAATT AGATTTTGAC GATATCATTC A 
30 (2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2202 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1421 



-(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 284: 

CCAAGTTGCC TAAAATGATT AAGCAAGGTT TATACCCTAT GCnAACGATT GAACAAGAAT 60 

CTGGAGCCAT CCGACTGCCA ACGATTTCTA GAGTGAnGCG TTCATTACAA TGGGGTAATG 120 

ATGCTTATAC AATGATTTTA GATCGTATGA ATATTGAAAC AAATGAATAA TAAATGAACG 180 

ATAAACAATG GTTATCTATC TGCACTAATA AGGTAGATAA TCATTGTTTT TTCACGAAAA 240 

AATTTACAGA GTAAAAGAAC TTAAATTTCA TATTAAGTCT TTAGAACTCG ACACTTAAAA 3 00 

ATGCTATAAT CATATGTATG TTAAAAAAGG AGTTTCGGAA AATGTATGAC ATTAAAAAAT 3 60 

GGCGCCATAT TTTTAAATTA GACCCAGCTA AACATATTTC AGATGATGAT TTAGATGCGA 420 
TTTGTATGTC TCAAACAGAT GCAATTATGA TTGGTGGAaC TCATGACGTT ACTGAAGATA 
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CAAACATCGA AAGTGTAATG CCTGGTTTTG ATTtTTATTT TGTACCTACA GTACTGAACA 600 

GTACAGATGT TGTATTTCAC AATGGTACAT TATTAGAAGC GCTTAAAACA TATGGACATA 660 

GTATAGATTT TGAGGAAGTA ATATTTGAAG GGTATGTCGT GTGCAATGCT GATAGCAAAG 720 

TGGCAAAACA TACCAAAGCA AATACAGATT TAACAACAGA AGATTTAGAA GCATATGCCC 7 80 

AAATGGTCAA TCATATGTAT CGATTACCGG TTATGTATAT AGAGTATAGT GGCATTTATG 84 0 

GCGACGTATC AAAGGTTCAA GCTGTCTCAG AACATCTAAC AGAAACGCAA CTTTTTTATG 900 

GTGGCGGTAT TTCCTCAGAA CAACAAGCGA CAGAGATGGC AGCTATTGCA GATACAATTA 96 0 

IS TCGTCGGTGA TATTATTTAT AAAGATATTA AAAAAGCTTT AAAAACAGTA AAAATAAAGG 1020 

AGTCTAGTAA ATGAATGCGT TATTAAATCA TATGAATACA GAGCAAAGTG AAGCTGTAAA 1080 

GACAACAGAA GG ACCATTGT TAATTATGGC AGGTGCTGGT TCAGGGAAGA CACGTGT TT T 114 0 

AACACATAGA ATTGCTTATT TATTAGACGA AAAAGATGTC TCACCATACA ATGTTTTGGC 1200 

TATTACTTTT ACAAATAAAG CTGCAAGAGA AATGAAAGAA CGTGTTCAAA AATTAGTAGG 1260 

TGATCAAGCA GAAGTTATTT GGATGTCAAC ATTCCACTCA ATGTGTGTTC GTATTTTACG 132 0 

TCGTGATGCA GATCGAATTG GTATAGAACG CAATTTTACG ATAATTGATC CTACAGACCA 13 80 

AAAATCTGTT ATTAAAGACG TCTTAAAAAA TGAAAATATT GATAGTAAAA AGTTTGAACC 14 40 

TCGTATGTTT ATCGGTGCGA TCAGTAATTT GAAAAATGAA CTTAAAACAC CTGCAGATGC 15 00 

TCAAAAAGAA GCCACAGATT aTCACTCgcA AwTGGTaGCA ACgGTTTaTA GTgGATATCA 156 0 

ACGCCAATTG TCACGTAATG AAGCGTTAGA TTTTGATGAC CTTATTATGA CAACGATTAA 16 2 0 

^5 CTTATTTGAG CGTGTACCAG AAGTTCTAGA ATATTATCAG AACAAATTCC AATATATTCA 16 80 

TGTAGATGAG TATCAAGATA CTAATAAAGC ACAATACACA TTAGTTAAAT TATTAGCAAG 174 0 

TAACTTTAAA AACTTATGTG TTGTAGGTGA CTCAGATCAG TCAATTTATG GTTGGCGTGG 18 00 

TGCTGATATT CAAAATATCT TATCATTTGA AAAAGACTAT CCAGAAGCGA ATACAATCTT 1860 

TTTAGAGCAA AATTATCGTT CGACX^AAAAC GATTTTAAAT GCGGCTAACG AAGTGrTTAA 1920 

AAATAATTCT GAACGTAAGC CAAAAGGACT GTGGACTGCA AATACGAATG GTGAGAAAAT 1980 

TCATTACTAT GAAGCAATGA CGGAACtGAT GAAGCGGAAT TTGTAATACG AGAAATTATG 2040 

AAGCATCAAC GTAATGGTAA GAAATATCAA GATATGGCAA TTTTATATAG AACGAATGCA 2100 

SO CAATCACGTG TACTTGAGGA AACATTCATG AAATCTAATA TGCCATACAC AATGGTTGGT 2160 

GGCCAAAAGT TCTATGACCG TAAAGnAATC CAAAGATTTA TT 2202 
(2) INFORMATION FOR SEQ ID NO: 285: 
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(A) LENGTH: 785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 
AGTGGTGCAA AGATAGGCAT TGATaATACC GCTAAGCCAG CAAGATGATG GCACGATAAA 
ACCTAAACAG AAGAAmATAA ATAGTAATAC GATGATAAAT AATGGTCCAC TCATATGTTG 
AACTAAAGAT GATGAAAAGT GTAAGATTGT ATCTGAAATC ATACCTTCAT TCAACACTAA 
ATTAATACCT OGAGCTAAAC CAATAATTAA AGATACACCT ACTAAACTTG ATGCACCATT 
GACAAATGCA TCTACAGTTC CTTTTTCTCC CAATCCAGAT TTACCTGTCC CAGCAATAAA 
CATTATTATA ATTGTAAATA TTAAAAATGC TGAAGCCATA ACTGGGAACC ACCAACCTTG 
CGTCATAACT CCCCATACCA TAATTGGAAA TGGTAGTACA AATAATGTAA GGATTATCTT 
CTTACGCAAA GTAAAATGGG CACTATCGTC ATCTTTTAAT ACAGACCATT GCTGTTCAAA 
AGCATCTTTG TCTTCATAAG AATATGACGC TTTAGGATCG TTTTTAATTT TTTTACAGTA 
CCAATATAAA TAACTAATAA CAAAAATCGC ACCGACAATA CAAGCACCTA TTCTCCAATA 
CAAGCCATCC GTAAAAGTTG TACCAGCGGC ATTAGAGGCA ATTACAACCG AGAACGGGTT 
AATAGTTGAA AATGTACTAC CQACAGAGCT GGcAAGGAAT ATGGCACCAA CTGGAAACGA 
TAGAATCGTA TCCTAACGCT AATAAATATA GGGACTAAAA TCGGATAAAA TGCTACAGCT 
TCTTC 

(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 

" (C) STRANDEDNESS: doxible 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 
CTAACGnGAT AAGGTTGCAA nTTTATCTGA ACATCTGATG ACTGTAATTT TGTtAATGAT 
AAAATATTTG TCACTAATAG ATATAAATAC TGACTTTCTT GAAAACTATG TACAAGTAAT 
TGTTCCTTTT CTATGATAGA CATATCTTTA CTATGTGATA CTAAAATATC TAAATkTCCC 
ATAATTGTTG TTAACGGTGT ACGTATGTCA TGCGAAATTG ATCTTAAAAA ATTTCAATGT 
GTCAGTTGAC GTTCAGCCTG TAACATGGAT TCTCTCGTTT GTTTAAGTAA CGTCACATTT 
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ATCACTTGAG AACTTTGGTA ATCAATGGCT AGAATGCCTT TAATCGGAGA TGTGCCAATT ' 420 

GGTATCAACC ATTTATTAAT GCCTGGAAAT GTATCTGTTG TTGCACCAGC TTGTCTTTCA 4 80 

TTTTTAATTA CCCAGCTTAA TGCTTGTTCA TGCTGTTGAG TCGTATTATC GATATGGTTT 540 

TGCAATGGTA TTGTTTTAAT TACTTTCGAT TGATTGATAA CGTATATAGT AATTGATTGT 600 

TGCAATAATT GATTAATTTG GTATCCAGCA TTTATTAGTA AGTTTTCAAC TGTATAAGTT 660 

TGTTTAATCG AATCATTAAA TTGAAATAAT AAATCTGTAC GATAAAGTTG CTTTTTAGTA 720 

ATGGaGTAwT GGAATTTAAT TTGTnTTAAT AAAGCACTCG TTAAAATACT TGTnAAAATG 780 

CTAACGATAA ATGTAATAGG ATAGTCAAAG CG 812 
(2) INFORMATION FOR SEQ ID NO: 287: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1732 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

ATnnATTATT ATTACTGCTA TTTTAAATTT TAAAAAATGC TTTTGATTAT ATTCAACAnT 60 

TTGTATAAAA TTAAATTTGC TTTTGATTAA AGCATGAAAA TTGTAATCAA ACCATAAATT 120 

GTCGTATGAT GTAGTTAGAA TTTTAAAATG CAGGAGGTCa AGTATATGAC TGAAATaACA 180 

TTCAAAGGTG GACCAATCCA CTTAAAAGGT CAACAAATTA ATGAAGGTGA TTTTGCACCT 240 

GATTTTACAG TGTTAGATAA TGACTTAAAT CAAGTAACAT TAGCAGATTA TGCTGGTAAA 300 

AAGAAATTAA TTAgTGTGGT ACCATCAATT GATACAGGTG TTTGTGATCA GCAGACTCGC 360 

AAATTCAACT CTGATGCTTC TAAAGAAGAG GGGATTGTGC TTACAATTTC AGCAGACTTA 420 

40 CCATTCGCAC AAAAAAGATG GTGCGCTTCA GCAGGTTTAG ACAATGTCAT TACATTAAGT 480 

GACCACCGTG ACTTATCATT TGGTGAAAAC TATGGCGTTG TTATGGAAGA ACTTCGCTTA S4 0 

TTAGCTCGTG CAGTATTTGT ATTAGATGCA GATAATAAAG TTGTTTATAA AGAAATCGTT 600 

AGTGAAGGTA CTGATTTCCC AGATTTTGAT GCTGCTTTAG CTGCATACAA AAATATTTAA 660 

TCATTAAAGA GATAAATCTT AAAATGTATA CATCGTGTCC ATCGTTGTCA ACAGCATTAA 720 

AATAGAATTG TTTTCTATGA TTGCTAAGAC CTATGGGCAC TTTTTATTGG AGAGGGACGA 780 

ATATGGCAGA ACAACAAACA ATTATGGAAC GCTTGTTTCA TACATTAGAT GAAAAAGCTA 84 0 

AAACATTAAA TAATGAAAAT GGCCaAAGTT TTATTGAAAA TCTTGGGCTA GCAATGGAAC 900 
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CATTCCAATT TGCATATTTA AGTTTAATGC aGGAAGAAAA GATAcAAGCA AATCATCAAA 
TTACACCAGA TTCAATTGGA TTGATACTAG GATTTTTAGT TGAGCGTTTT ATCAACAACC 
AAGAAGAATT ACATATTGTT GATATTGCAA GTGGTCCCGG TCATTTAAGT GCTACTGTAA 
AAGAAGTGTT ACCTGraAtT GcGGTTATGc ATcATTTaAT TGaAGTTGAC CCAGTTTTAT 
CACGTGTTAG TGTACATTTA GCAAACTTCT TAGAAATTCC TTTCGATGTG TATCCTCAAG 
ATGCCATCAT GCCACTACCA TTAGAAGAAG CAGATATCGT TATTGGTGAT TTTCCAGTAG 
GCTATTATCC AATTGATGAA AGAAGTAAGG AGTTTAAGCT AGGTTTTGAA GAAGGACATA 
GTTATTCACA TTATTTATTA ATAGAACAAG CAATAAATGC ATTAAAAGAT GCTGGATATG 
CCTTTCTAGT GGTACCAAGT AATATTTTTA CAGGTGAACA TGTAAAACAG CTTCAAAAAT 
ATATTGCAAC AGAGACAGAG ATGCAAGCAT TTTTAAATTT ACCACCAACT TrATTTAAAA 
ATGAAAAAGC GCGAAAATCT ATATTAATTT TACAAAAGAA AAAATCGGGT GaAACAAAGC 
CAGTTGAAGT ATTATTGGCA AATATTCCTg ATTTCcAAAA TTCCTTCACC AATTTCCAAG 
GATTTATGGA CAGAGTTAAA ATCCAGTGGG ATGGGACCAC CAAATCGTCC TA 
(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 
AAAAGAACTA GCTAAACGCA AGCAAGAAGC TATTAGTAGA ATTAAAGACT TTTCAAATGA 
AAAAATAAAT AGTATTCGAA ATAGTGAAAT TGGCACAGCT GATGAAAAAC AAGCAGCAAT 
GAATCAAATT AACGAAATTG TGCTTGAAAC AATTAGAGAT ATTAATAATG CGCATACATT 
ACAGCAAGTT GAGGCTGCAT TGAACAATGG TATTGCTCGA ATTTCAGCAG TACAAATTGT 
AACATCTGAT CGTGCTAAAC AATCGTCAAG TACTGGAAAT GAATCTAATA GCCATTTAAC 
AATTGGTTAT GGAACTGCAA ATCATCCATT TAACAGTTCG ACTATTGGAC ATAAAAAGAA 
ACTTGATGAA GATGATGACA TTGATCCACT TCATATGCGT CACTTTAGTA ATAATTTCGG 
TAATGTTATT AAAAACGCTA TTGGTGTGGT GGGTATCTCT GGTTTACTAO CTAGTTTCTG 
GTTCTTCATT GCCAAACGTC GTCGTAAAGA AGATGAAGAG GAAGAATTAG AAATAAOAGA 
TAATAATAAA GATTCAATAA AAGAGACTTT AGACGATACA AAACATTTAC CACTTTTATT 
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AAATAATGGC GAGTCACTCG ATAAAGTTAA ACATACGCCG TTCTTCTTAC CAAAACGTCG * 720 

TCGTAAAGAA GATGAAGAAG ATGTGGAAGT TACAAATGAA AACACAGATG AAAAAGTGTT 780 

GAAAGATAAC GAACATTCAC CACTCTTATT CX5CAAAACGA CGCAAAGATA AAGAGGAAGA B40 

TGTTGAAACA ACAACTAGTA TTGAATCTAA AGATGAGGAC GTTCCTTTAT TATTGGCTAA 900 

AAAGAAAAAT CAAAAAGATA ACCAATCCAA AGACAAAAAG TCAGCATCAA AAAATACTTC 960 

TAAAAAGGTA GCAGCTAAAA AGAAGAAAAA GAAAGCTAAG AAAAATAAAA AATAATTTGT 1020 

TTCTTTGATA AATAGaGGAG CACCGATTGA CATCACATCA GTCGGTGCTC CTTTTATTTA 1080 

TTCTTTTTAA TTAATTTATA CAATGCCTGT TGAGCGTGTT GATTCGCTTC TTTGTTTTGT 1140 

TCTCTCGGTA TCCATTTAAC AAATAATAAA TCAAAATCTT TTTCAAATAT TTCTATTTGA 1200 

TCAAAATAAG GTTTGAAATT TGCGTTTTTC ACATAACCAG CTTCAATGCT ATCTGCAATT 1260 

20 AGCTTTGAGT CTGTATATAA TAGTGCGTTT TGAACATTTA ATTCACGTGC ATGTTCTAGT 1320 

GCATAAATAC ATGCAGCCCA TTCTGCAGTG TGGTTATCCA TTTCGCCTAA CTCATGTGTA 1380 

TATGTATAAT GCTGCTCATC TTCTTTGATT ACAATGGCAC ATGTACTTAT GCCTGGATTT 144 0 

25 cCTTTCGTCG CAGCATCAAA ATTTATTTTC GCCATAATAA ACCTACrTTC TATTCAATAC 1500 

TTAGTTAAAG TTACTATTAC TGTAATACAA AATATGTTGG GTAATCCATT AAAAAACACG 1560 

CATCACTTAA ATAAGTAACA CGTGTTTAAA ATACTCGCTG ATTCAAAGAT GATTTTCTAA 1620 

TACGTAtACT GTaATATACT TCCTAAAAAA ATCATCTTCA GGCTGGGACA TAAATCAATG 1680 

TTCTATGCTC TACGATGTTA TATTGGCAGT AGTTGACTGA ACGAAAATGC GCTTGTAACA 1740 

AGCTTTTTTC AATTCTAGTC AGGGGCCCCA ACACAGAGAA TTTCGAAAAG AAATTCTACA 1800 

GGCAATGCGA GTTGGGGTGT GGGTCCCAAC ACAGAAGATG ACGAAAAGTC AGCTTACAAT 1860 

AATCTGCaAG TTTGGGATGG GCCCCAACAA AGAGAAATTG GATTCCCAAT TTCTACAGAC 1920 

40 AATCCAAGTT GGGGTGGGAC GACGAAATAA ATTTTGCGAA AATATTATTT CTGTCCCACT 1980 

CCCTTAAAAC TTATTCTTTT GTGTAGTAAG TGCGTTAATA GCCTTGATCT AACTTATCAA 2040 

TCTTACCTTT ACGATAAAAT GATTTAGCAA TATATCCAAA TGGTACATTG AAAACTGTTG 2100 

AAGCTAATTT TAATACX3TAC GTTGTAATAA ATATTTCAAa TACAAmTGTa CCAGGTAAAC 2160 

TTCCXSATAAA TGCGATAGCT ACAAATAAAG CTGTATCAAT TATTGrGCTT AAAAATGTAC 2220 

TACCATATGg CACGGATGAA AAACGTTTTa TCmGaACTAA ATACTTTTTT AATTAGTGAA 2280 

SO 

AAGATAAATA CATCAATATG TTGACCAATA ATATATGCOA CGATTGAGCC TAAAGCAATG 2340 

CGTGGCACAA CATCAAAGAT TGCGTGTAAT GCTTTTTGTG CCATATCTTC TGGTGCAGGA 2400 
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CAAACTGCTC TTTTTGCAAC TCTACX3CCCA TAAATATCGT TTAATATATC TCTTGCTAAA 
TAAATAGAAG CAAACATGAC ATTACCTAAA GTTGCTGAAA TACCAAAGAT TTCTACAGTT 
TTAATCACTT GTATGTTGGC AATGATTGTG CCAATTGCAA CCCATGCAAT TAAACCTTGT 
TTACCAAAAA AGCGATACAT AAGTACCATA AGCACGAACG TTGCAATAAA CGTAACTAGT 
CCTAAAATTT CATTATACAT ATTAAAATGT CCTCCTAAAT TTTGATCATG CGGGTGTTTA 
GAAACCGCTC AATAAATAA 

(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 
ACTGATGTGC GTTCATCAAA AACAATATAA TCAAATTCAT TTTCATCAAA TTGCTTAAAA 
TTATCATCTC TAGATAATGT TTGAATGGTT GCAAATAAAT ATTTGGCATC GACATCTCTA 
TGTTTTCCTG TCAACAATCC AAAATCACTA TCATTTTTTA TAGGTAATAC TTTTTTAAAT 
TCTTCCTTAG CTCTATTTAA AATCCCCTCA TTATGAACAA TAAATAAAAA TTTATTAGGG 
TTTACTTCTC TAACATCTAA TGCACATAAA ATCGTTTTAC CTGTACCAGT TGCAGATATT 
AITAACGCCT TATCTTTGGC TTTATCCCTA ATAGCTTTTA ATGACCTTAA TGCTTCTGCT 
TGCATTAAAT TGGGTACAAT TTCCACTGAT TTTTTCACCT TATCAGCTAG CAGCATTTGA 
GTTTGTTCAA CCTCCGCTAA TTTTTCTAAG GAACGGTACT CAAATGATTC TTTATATGAA 
TTAATCCATT GCTCAGTCAG TGGGGTACTC TTTTGCCATA ACAAGTCAAA TTCACTTTTT 
ACACTATCAA CTAAATCGCC ATTTTTCATA GTAGACAGTA AAACATTATG CTCATAATTA 
ACCTTTAACG CATTAGATGT TAAATTAGAG CTTCCTATTA CCATAGAACT ATAATCCTTA 
TGCTCAAAAA TATATCCTTT GGCATGGAAT CCAGCAATAT CAGTTAATCT TACCTCTACA 
TTTTTTAATT TAAGTAATTC TCCATACATT TTAGGACTAT TAAACCCTAA GTAATTAGAT 
GTTAATATTT TCCCTTTAAC ACGCTTATTG CTTAAATCTA ATAGTTGAGC CTTTAAGCTG 
GcTAAACCGC TTTCTGTTAT AAAAGCCACA GAAAAATAAA ACGTTTCACA TTTTTGAAGT 
TCATCTATAA TTGTTGAAAG AACTTTTTCA TTTTTATTAT TTACTAAAAG CTTCGGTGTA 
TAATTCCCTT TATGAGAAAT ATGTTTGTCT ATAAACCCTT TATGTAAAGA TTCATTGAAA 



1114 



EP0 786 519 A2 

CGCTGGGGCC CAATTTAATT 
TTCAGTTAGA GTTGGTAACT 
^ TCCAAAATCA TATTCATGTT 
CATTTCTTCT CTAATTTCTC 
ACnGCCAGGA AATTCCCACA 

10 

AATTTTGTTA TCAGAAAAAA 
CACCCTTCAA TTTAAAATCA 
AAATCAATAT TAAAAATTAA 

IS 

ATACACATCT ATTATATAAT 
TTTGGGGCGT ATTTCTTTAT 
20 TATTAAGTCT AACTGAGATT 
TAGAACGAGc TAATGATAGT 
AGCGAGAAGG TTTTGAAAGT 
GTCTTGGTGA AGGTCGACTT 
GTTATTTAAA TGCCCAAACA 
TTTCcAACTG AATTCGCCC 

30 

(2) INFORMATION FOR SEQ ID NO : 2 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1933 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 



45 



SO 



GATGACi-rrr 


CCCCCTCATA 


ATCTTCATGG 


TCCAGGCGTC 


CATTAATGCG 


TCAAAGGATG 


60 


GCACATTTTA 


CCTGGAACAA 


ATGATTCATA 


TGGTTCATAA 


AAATCACGCG 


TCGTAATATA 


120 


ATCTTCTAAA 


TCAAATGCAT 


AGAAAATCAT 


TGGCTTTTTA 


AATACTGCAT 


ATTCATATAT 


180 


TAAAGATGAA 


TAGTCACTAA 


TTAATAAATC 


TGTTATGAAC 


AGTATATCAT 


TAACTTCTCT 


240 


AAAGTCAGAA 


ACGTCAACAA 


AATATTGTTT 


ATGTTTGTCT 


GCAATATTAA 


GTCTATTTTT 


300 


CACAAATGGA 


TGCATTTTAA 


ATAATACAAC 


CGCGTTATTT 


TTTTCGCAAT 


ATCTTGCTAA 


360 


ACGTTCAAAA 


TCAATTTTGA 


AAAATGGGTA 


ATGTGCTGTA 


CCATGACCAC 


TACCTCTAAA 


420 



55 



TATCAAGTTC GTTTATTGAC AACCATTCAA TACTCTTATG 1140 

CTTTGTTTAA AGTACATTTG TATGTTGTTA ACCTAACAAT 1200 

CTGTAGTTAT AACTTTGTCT CCAACAATTA AATCACATTT 1260 

TAATCAAAGC GTCTTTTTCA GTTTCATTCT TTTCAACCTT 1320 

TTAAAGGCAG ACTCATTTTT TCACTTCTCT GTGCACAAAG 1380 

TAATAGCTCC TACTACATTG ATTACTTTTT TCATAAGACT 1440 

TCTTAATTGT TATTCTATCA AAAATTACAA AACTATATAT 1500 

TATTTTACAT TCACATGAAC GCTCTACTCC ATGCATTTTC 1560 

ACTTGTGAAA AGTATTGTCT TGGGGCTGTG TTTTTTTACT 1620 

AATTCATTAC ATAAATGTAA GGGCTTTAGT TTTCATGTTT 1680 

TTGAAAGGAT GTTTAGCAAC AATGGATAAA GAATTATGGA 1740 

TTAGTTAAAC ATTTTTATGa GCAGCAATCT GACATTGAAC 1800 

AAATTAACAT TTGGTACTGC GGGTATACGC GQAAAATTCG 1860 

AATAAGTTTA CTATTGAAAA ATTGGCATTA GGTTTAGCGC 1920 

AACAGTCCAA CAATAGTCAT TCATTATGAT ATTAGACATC 1980 
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TTGTTTGATC TGTGTCGCAT AAGCTTCATC 


AAATAGTACA 


TCAGTACXJTG 


GAACACCTGT 


540 




AGGCACTACA TTTTTCTCTT 


TAATACCAAA 


TGCTTCAGCG 


TAGAATGGAA 


TATCGGTTTC 


600 


5 


AGATGATACA TAAGCTTTTG 


TATAGCTACG 


ATGATTTAAT 


GAATCAATAA 


ATGGTCCACC 


660 




Cl"i-riTACCA GTACGACTAA 


AGCCAACTGT 


TTTAAAGGCA 


CCAACGGCAT 


GCCATACTTG 


720 


10 


AATAACTTCT TGAGAACGTC 


TAAAACGCAC 


TGTATAAATC 


AATGGGTGAA 


AGTCATCAAC 


780 


AAAGATGTAG TCTGCCTTCC 


CAAGTAAATA 


TGGCAATCTA 


AACTTGTCGA 


TGATGCCACG 


840 




TCTATCTGTA ATATTCGCTT 


TAAAAACAGT 


GTGAATATCA 


TACTTTTTAT 


CTAAATTTTG 


900 


IS 


ACGTAACATT TCGTTATAGA 


TGTATTCAAA 


GTTTCCAGAC 


ATCGTTGGTC 


TAGAGTCTGA 


960 




TGTGAACAAC ACCGTATTCC 


CTTrrrrcAA 


GTGGAAAAAT 


TTCGTCGTAT 


TAAATATCGC 


1020 




TTTAAAAATA AATTGTCTTG 


TATTAAATGA 


TTGTTTGCGG 


AAATACTTAC 


GTAATTCTTT 


1080 


20 


ATATTTACGA ACGATATAAA 


TACTTTTAAC 


TTCCGGAGTC 


GTTACAACAA 


CATCAAGGAC 


1140 




AAATTCATTA ACATCGCTAG 


AAATTTCAGG 


TGTAACAGTA 


TAAACCGTTT 


TCTTCX3AAAT 


1200 




GCCGCCTTTT CTAAATTCTT 


TTAGGTAAGT 


CTGCAATAAG 


AAATTGATTT 


TACCATTT'iX5 


1260 


2S 


TGTTTCTAAT TCGTTGTATT 


CTTCTTCTTG 


TTCTGGCTTT 


AGATTTTGAT 


ATGCATCATT 


1320 




AATCACATCT GGGTTTAACT 


GTGCAATATA 


ATCAAGTTCT 


TGCTCATTCA 


CTAATAAGTA 


1380 


30 


CTTATCTTCA GGTAAGTAAT 


AACCATTATC 


TAAGATAGCT 


ACATTGAAAC 


GACAAACGAA 


1440 


TTGATTCCCA TCTATTTTGA 


CATCATTCGC 


CTTCATTGTA 


CGTGTCTCAG 


TTAAATTTCT 


1500 




TAATACAAAA TTACTATCTT 


CTAAATCTAG 


GTTTTCACTA 


TGTCCTTCAA 


CGAATAACTG 


1560 


3S 


AACACGTTCC CAATAGATTT 


TATCTATATA 


TATCTTACTT 


TTAACCAACG 


TTAATTCATC 


1620 




CTTTTCTATT TACATAATCC 


ATTTTAATAC 


TGTTTTACCC 


CAAGATGTAG 


ACAGGTCTGC 


1680 




TTCAAAAGCT TCTGTAAGAT 


CATTAATTGT 


TGCAATTTCA 


AATTCTTGAC 


CTTTTAACAA 


1740 


40 


CGCTAATTTA nCTACAATAT 


CTGGGTATTG 


AATGTATAAG 


TCTACAACAT 


CTTGGAAATC 


1800 




TTTTGAACCA CTTCGACTAC TACCAATCAA CX3TTAGTCCT 


TTTTCCAATA 


CTAGACXSTGT 


1860 




ATTAACTTCT ACTGGGAACT 


CACTTACACC 


TAACAGTnCA 


ATGCTTCCTT 


CTGGTGAAAT 


1920 


45 


GTAATCGATC ATT 










1933 




(2) INFORMATION FOR SEQ ID NO: 291: 









(i) SEQUENCE CHARACTERISTICS: 
SO (A) LENGTH: 204 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

nGTnCGGnCA GATATATTGG TGGT C TTT A G TAAGTGTATC AAATTCATCA GATGTCAAGG 6 0 

GCATGTTATC ACCTCCTTAG GTTGATAACA aCATTATACa CGaAAGGAGC ATAAaCAAaT 12 0 

GAACACAaGA TCAGAAGGAT TGCGTATAGG CGTCCCACAA GTTTCTAGCA AAGCTGATGC 180 

T T CTTCATCC TATTTAACGG AAAAGGAACG TAACTTAGGA GCGGAAATAT TAGAACTTAT 240 

TAAAAAAAGT GATTACAGCT ACTTAGAAAT AAACAAAGTT TTCTATGCAT TAGATAGAGA 300 

ACTTCAATAC AGGGCGAATA ATAACAAACT TTAaCATTTA TCTAAAGGAG TGATAGAGAT 360 

GCCAAAAATC ATAATACCAC CAACACCAGA AAACACATAT CGAGGCGAAG AAAAATTTGT 420 

GAAAAAGTTA TACGCAACAC CTACACAAAT CCATCAATTG TTTGGAGTAT GTAGAAGTAC 4 80 

AGTATACAAC TGGTTGAAAT ATTACCGTGA AGATAATTTA GGTGTAGAAA ATTTATACAT 540 

20 TGATTATTCA GCAACGGGAA CATTGATTAA TATTTCTAAA TTAGAAGAGT ATTTGATCAG 600 

AAAGCATAAA AAATGGTATT AGGAGGATTA TCAAATGAGC GACACATATA AAAGCTACCT 660 

ATTAGCAGTG TTGTGCTTCA CGGTCTTAGC GATTGTACTC ATGCCATTGC TGTACTTCAC 720 

TACAGCATGG TCAATTGCAG GATTCGCAAG TATAGTGACA TTCATATTTT ATAAGGAATA 780 

CTTTTATGAA GAATAAAAAA ACTGCTACTT GTTGGAGCAA GTAACAGTGC AAGATGAGCA 84 0 

ATTGTCTTAA ATAATTATAT AAGGAGTTAT TAATATGACC TTACAACAAA AAATACTATC 900 

ACATTTTGCA ACATATGACA ATTTCAATTC TGATGATGTT GTTGAAACTT TTGGGATATC 96 0 

TAAAACACAT GCAAAATCCA CACTTTCAAA ACTTAAGAAA AAAGGAAAGA TTGCAATGGA 1020 

AAGTTGGGGT GTCTGGCGTG TTATTGAATC GCAATTGCAT TTAAGTGTAG TCGAACGTAA 1080 

AAAAGAAATT TTAGAAGAAC AATTTGAATT GTTAGCAAGA TTAAATGAAC AAAGTGATGA 114 0 

CCCTAGAGAA ATAGAAGAAC GTATCAAGTT AATGATTCGT CTAGCTAACC AATTTTAAGG 1200 

40 AGGAGTTAAT CAATGGCAGT ATTAGAAGGT ATTTTTGAAG AATTAAAACT ATTAAATAAG 1260 

AACTTACGTG TGTTAAACAC TGAACTATCA ACTGTAGATT CATCAATTGT ACAAGAGAAA 1320 

GTTAAAGAAG CACCAATGCC AAAAGAAGAA ACAGCTCAAC TGGAATCAAT TOAAGAAGTT 1380 

AAGGAAACTT CTGCTGATTT GACTAAAGAT TATGTTTTAT CAGTAGGAAA AGAGTTCCTT 1440 

AAAAAAGCAG ACACTTCTGA TAAGAAAGAA TTTAGAAATA AACTTAACGA ACTTGGTGCG 1500 

GATAAGCTAT CTACTATCAA AGAAGAGCAT TATGAAAAAA TTGTTGATTT TATGAATGCG 1560 

SO 

AGAATAAATG CATGAAGCTA GATCACTCAA ATAGAGCTCA TGCAAAGCTT AGTGCAAGTG 1620 

GAGCAAAACA ATGGCTAAAC TGTCCACCGA GTATTAAGGC AAGTGAAGGT ATTGCAGATA 1680 
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GTCTTAAATA TGAAGGCCTA ACACAGTTTG AGTTTAATAA AGCTTTTCAA AATTATAAGC 
GAAATCAATA TTACAGTGAA GAGTTGCGCG AATATGTTGa AGAGTACGTA GCTAATGTAG 
AAGAAAAGTA TAACGAGGCT TTGaGTAGAG ATGACGATGT AATAGCTTTA TTTGAAACAA 
AATTGGATyT AGGTAAATAC GTCCCTGAAT CTTTTGGTAy TGGTGATGTC AtTATATTTT 
CAGGTGGTGT ACTTGAAATT ATTGACCTTA AATACGGTAA AGGCATTGAA GTTTCAGCTA 
TAGATAATC 

(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 
ATGATGTTTC TATATTCGTA TTAGGAAAAC CTGTTGGTAT TACAACAAAC GCCCTAAAAT 
TACAATACCG CTGCTCCTAT ACCAATTGCA ACAACAGTTT TAACTGAAAT ATCTTGTTTT 
TTCATCTTCA TTACTCCTtt ACATAAAAAA TTCATTATAT TGATGGTGCT TTAGATAAAT 
GAATCGTCCA ATCATTTCCA GTACCAATAT GATATAAATC TGAAAATGAG TCTTGATTGA 
CTGCTACACC AATATTTACT AGCGAGTTAA CATACACAAG AGGTTCACCC ACATTAACAT 
CTGCAAACGA TCGCGCAAAT TTAATAATAT TTTGATAGAC TTTCTTATCT TGATGATAAA 
TTGTTACCAC CAAATTATTA CCATGAACAA TTTCCAAGGA TTTTAAGAAT GCCAATGGAA 
TATTTGTCCA TAATGACCCA AATCTGATAT CTAAAATATC AATGCTTCCT GTAACAGAAT 
cctcSttttt TGTCACTTCT CTTATTTCTA ATGCCTCAAT ACTATCAACA TTAATTGCCT 
GAcdsAGACG TTCAAACGCT ATCTTATTTG CAGCTAATCT CGCACCATTG TATGCATAAA 
CATCTCTACC ATGAAAAGTA TGACTTTCTT CCGAATGAGG CAATCGGCTT TTCACTTCAT 
CAATTTCGAT AACTTTTTTA ATACCTTCGT AATGTTTAAT ATGACTTAAA GAACCATTAT 
CAGGTGTAAT AATGTAATGA CCTGAATATG TTAAGCAAGC AATGTCCGCC TATCACTACC 
TACACCCGGG TCTACCACTG ATACAAAraC TGTGCCTTTA GGCCAGTATT TTACAGTTTG 
ATATAAACGA TATGACGCTA CCCAAATGTC ATACGGTGGT ATATCATGCG TTAAGTTTTC 
AACACGTATA TCATCATTAC AGTATATGCA ACTCCATACA TT 
(2) INFORMATION FOR SEQ ID NO: 293: 
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15 



20 



25 



(A) LENGTH: 1268 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLCX5Y: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

10 TGTAATAAAA TTTTATGnAA CATGCTGnGA TGCTACCATG AACCTTCTTC ATTTCTTTAT 60 

GTGAGATTGT GaAATTAATC AAATAATAAT ACGGTGGATA CTTTCCTAAT TTACGATATT 120 

CCATTTCCTG ACGATAAAAT GTTAAATAAT CATTTTTTTG AACATCCAAT ATTGAATAAT 180 

GATCTGGATT ATACGTTTGA ATGATGACTT GACCTGCCTT TTCATGACGA CCAGCTCTAC 24 0 

CAGCCACTTG CGTTAATAGT TGATAAGTAC GTTCGCTCGC CCGAAAATCA GGTAAATTTA 300 

ACATTGTATC TGCATTCAGC ACACCAACTA AAGTAATATT TGGATAATCT AATCCTTTCG 360 

CAATCATCTG AGTACCTAGT AAAATGTCAC CGTTACCTTT TTCGAATTCA GTCAATAACT 420 

TTTCATGTGC ACCTTTCTTT GAGGTTGTAT CTACATCCAT CCTAATTATG CGCGCATCTT 4 80 

CAAATTCTTG TTGCAATAGT TCTTCAACTT TCTGAGTACC AGTACCTACT TGTCGAATGT 54 0 

GTTCACTCTC ACAATTTGGA CATTGATTCG GTGGCGTCTC TTGGTAACCA CAATAGTGAC 600 

ATTTTAATAA GTCTGTCGTT TTATGATACG TTAATGAAAT ATCACAGTTT GGACATTGCG 660 

30 GTACATATCC ACAATCCCGA CATAACATAA ACGATGCATA ACCACGTCGA TTTAAAAATA 720 

AAACAACTTG TTCCTGTCGA TCTAATCTTA ATTGTATGGC TTCACGTAAA TCTTTTGAAA 780 

ACATTGACCG ATTACCTTCA CTCAATTCTT CACGCATGTC TACTATATCA ATTTCAGGTA 84 0 

^5 AAGCTTGTTG GTTCACTCTG TTTGGTAATG ATAGCAAATG ATAAACGCCT TTTTCAGCTC 900 

GTGCATAACT TTCAAGACAT GGTGTTGCAC TTCCTAAAAT GACTGGACAG TGATGATATT 960 

CAcWCGCCA TTGGGCAATT TCTCTAGCGT GATATCTCGG ATAATCTTCT TGTTTATATG 102 0 

TAGATTCATG TTCTTCATCA ATGATGATTA ACCCTAAATT TTTGAAAGGT GCGAACACAC 1080 

TTGACCTTGC ACCAACACTT ACTCTCGCAC GACCATCCCT AATTTTTTGC CATTCATCAT 1140 

AACGTTCCCC ATTAGATAAG CCAGAATGTA ATACAGCAAC GTCATCACCA AATCGACGTT 1200 

TGAAGCGTAA AACCATTTGC GGTGTTAGAG CGATTTCAGG AACTAACATC ATCGCCTGTT 1260 

TTCCTTGG 126 8 
50 (2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 629 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 294: 
TACCACCAAA TAATATATTA GCTGGCATTT TAATAACATT TAnATTTCTC ATGATATCAT 
CAATAAAATG TTGAAACTTC GTAATTTCAC CTTGATAATC ATCAATTGCT GCTAATTGCG 
CACTCGATGC TTGCTGATCT AAATTTAAAA TATTCGACAT GCGTTGACTA TAATAAACTA 
AATGTTCTAT TAAGCCATCG TCACTCTTTT CCTTTGGTGC TGACATGACA GCGATACGTT 
TCAAAGGATA GTGTTGCGCC AATTTTAATG TCATTAATCC ACCTAAAGAC ACACCCGTTG 
CACTGATAGA TTCATAACCT TCATTGACTA AAAATTGGTA AGCTTTCTCA ACTTCTTCCC 
ACCAATCATC TACATTATAT GTCATGAAAT CTTTCAACAA TAAACCATGA CCTGGATAAT 
TCGGTGCATA ACAACTAAAT CCTTGGTCAT TTAACTCAGC TGCAAGATCC TTCACATCCC 
GATTTGTACC TGTAAATGAA TGTAATAATA ATATCGCATG TCCATTTGTG CCTTTTAAAT 
ACGtGGACTC GGTGTTTTAA TTCTCATTTT TCtaTATAcC TCCACTAtGT CTAAAGakGT 
TkGCTAAACG CGTTGtCGTC GATGATTAA 
<2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 
TATGAAAGTA ATGAATGGTA ATATTATTAA ATTTGATGGA AAAGTAGATA TTGATAATGC 
AGA-^TATC GGTTTTTTAA TTGAGCATCC TAAATTATAT GATAATAAAT CAGGATTGTA 
TAACTTGAAA TTATTTGCAC AAGTATTAGG TAAGGGTTTT GATAAAOCAT ACACAGACAA 
AATTATAGAT GCATTTGGTA TGAGACCTTA TATTAAAAAG AAAGTTAAGA AATATTCAAT 
GGGGATGAAG CAAAAGTTAG CAATTGCAGT ATCTTTArTG AATAAACCTA AATTTTTAAT 
CTTGGATGAG CCTACAAATG GkATGGATCC AGATGGCTCa ATTQATGTGC TGACTACAAT 
TAAGTCTTTA GkAAATGaAC TTGATATGAG AATTCtAATA TCAAGTCATA AGTTAGAAGA 
TATTGAATTA ATTTGTGATA GAGCTGTATT TTTAAQAGAC GGnCATTTTG TTCAAGATGT 
AAACATGGAG GAAGGTGTTG CATCTGACAC AACGATAGTT ACTGTTGATC ATAAAGACXT 
TOATAGAACT GAAAAATATC TTGCAGAGCA TTTCCAATTA CAAAATGTCG ACAAAGCAGA 



1120 



10 



IS 



25 



EP0 786 519 A2 

ATTAGATATT TATCCGAAAT ATATTGAAAC ACGTAAAAGT TCATTGCGTG ATACGTACTT 720 

CAATATAAAT CAAAGAGGTG ATAAATAATG AGAATTTTAA ATTTAGTTAA GTATGATTTT 780 

TATAGTATAT TTAAaArTCC TTTAACATAT TTAGCGaTAC TAGTCGTATC TAGTTTGATT 840 

GCAACTCAAA GTATACTTAT GGCAAATTCG ATGGATAACC CGAAACATAT TATTGTCTAT 900 

GGATCTGTAT TTGCTGCAGC AAAATGGTTA TTGTTAATAA TTGGATTAAT GTTTGTTGTT 96 0 

AAGACAATTA CGCGTGATTT TTCACAAGGT ACAaTTCAAC TATATATGAG TAAAGTTAAA 102 0 

ACACGCGTTG GAtACATTAT TTCGAAAACA ATTTCAATTa TTTTAATTTC AATATTATTT 108 0 

GCATTAATTC ATTATGTGAT TTTGATTGTT GTGCAGGCAT CTAGTAATGG AAAAAATTTG 114 0 

GCGTTTTCTA AATATGTAGA TAATTTATGG TTCTTCCTAA TCTTTTTACT ATTCTTTGGC 1200 

TTGTTTTTAT TCTTAATCAC ACTTGCATCA CAAAAAACAG CAATGATATT TTCATTAGGT 1260 

20 GTATTTTTAG TACTCATTGT ACCGTTTATT AAACCTTTTA TTACATTTAT CCCAAGATAC 1320 

GGTGAAAAAG TTTTAGATGC TTTTGATTAT ATCCCTTTTG CTTACTTAAC TGATAAAATG 1380 

ATTAGCTCTA ACTTTGATTT TAGCAATTGG CAATGGGTAA TTTCATTAGG TTCTATAGTG 144 0 

ATATTCTTCA TTTTGAATAT CTTATATGTC GCTAAAAAAG ACATyTAATA AAAATAATTT 1500 

TGAGGTTGGG AATTTTAAAT TTTCCCAACC TCAAAGTTTG TCTTATTGTA AATTTATTTA 1560 

TTTTCTAATT TATTTAGGAT GGAATTATAA ACTGCTTTCC AAAATGAAGC GTCAGTTTTA 1620 

TAGCGGTTTG ATATAACTAA GTGTGTTTCT TTTTCTAAAT CTGCATAGTC TGGATGATCT 16 80 

TTGCTCGGTA ATTTATCAGC ACX3AACATCA GTTACAAATT TTTGGACTTC ATTTGCTCTT 174 0 

GGTCCCCAAA CTGTTTCTTG TTCGAATTGA TCATTCAAGA ATACGAAGAT AGGAATTGCA 1800 

CGTGATTTAC CATTTGTTAA ATATTGATCG ATCAGTTTTG TATCATCATC TCTATGGAAC 18 6 0 

ACGCGTACTT CTAAATTTAA TGCTTCACTG ATGTGTTTTA GAATTGGGAG ATTCATCATT 1920 

40 GCATCTCCAC ACCAGTCTTC ACTAATTACT AATACTTTAG AATAATTCAT CTCTTTTATT 1980 

TTTTTGATGC GTGAATCATC TTCTGGTAAC TCAAATGATT GATAGATACT GAGAACGGTA 204 0 

TCTTGATTTG TCTTCATTCC ATCAATGTAT TCATTTAAGG GTTGGCTATT TTTGAAATAA 2100 

GTTTCTAAAT TTGTCATTGT AAAAACCTCC TTTAGCATTT ACAACATTAT ACCAATTTAT 2160 

AGTAATAAAA GGTAATGAAA TAAATTAATT GCAAATTCTT TGTTAATTTT TX3TTAAGGAT 2220 

GAAACGGGAA GCACCTTATG CTATATTTAA ATAAGTACAA AGAAAGGGTG ACATCAGTGC 22 80 

SO 

GTATTCAAAA TOGCTGGGTT GTGTTTATAT TATTTTTAAT CTGTTCTTTT GGTGTATTAA 2340 

TTGGTTTATA TCAATATCGT CATACGAAAA CTGTGGATTT GTCTAATCTT GAAATAAACG 2400 
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TTGATCGTTT TAAATTTTAT AACAGTAAAG CTCACCCTGA TCTTACCGTT AAAGTGAGAG 
AAAAGGATAA CATCGTTAAG GGGATAATAT TAGTAAGAGA TCAAAAGATA CATACTAATT 
TTGAtGGGGG AATTGGTTCG CCGATAAATA ACGCgATTGA AAATCTTGGa TTCgGATATA 
AAAGrACaAA AGTTGGcAAT GrtTkCtCAT CgGTAAAGTA TATTGATAGA GATAACCATT 
TAAAATTAAA CTTACTTTAT CAAGATTTAG AAATTAAACG TATTGAATTT TTTAGTAAAT 
AGCTTTAGGT CTTAAAGTTw TAAAAAACGA ATGAaTAATT TTATTGGGAT GAGTGAC 
(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2520 
2580 
2640 
2700 
2760 
2817 



60 
120 
180 
240 
300 
360 
420 
480 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 
TCTGTTAAAA TGATTTTTCT TTTAnAAAGG CCGnAAATCA ATGTTCGATT nTTATTTGCA 
TTATGGTCTC GATATTGGTA GAATATCAAA TGGTTAAATO AGAAAAACTT GGAGGTGCTC 
ACATGTCATC AATCGTAGTA GTTGGGACAC AATGGGGAGA CGAAGGAAAA GGAAAAATAA 
CGGATTTCTT GGcAGAACAG TCAGATGTTA TCGCGOGTTT TTCAGGTGGT AATAATGCAG 
GCCATACCAT TCAATTTGGC GGAGAAACAT ATAAATTACA TTTAGTACCA TCTGGTATCT 
TTTACAAAGA CAAATTAGCG GTAATCGGTA ACGGAGTCGT TGTTGATCCA GTTGCACTAT 
TGAAAGAATT AGACGGATTA AATGAACGTG GCATTCCTAC AAGTAATTTA CGTATATCTA 
ATCGTGCGCA AGTGATTTTA CCATATCACT TAGCACAAGA TGAATATGAA GAACGTTTAC 
gTGGtOACAA TAAGATTGGT ACAACTAAAA AAGGTATCGG TCCAGCATAT GTAGACAAAG '540 
^ TTCAACGTAT CGGTATTCGT ATGGCAGATT TACTTGAAAA AGAAACATTC GAAAGATTAT 600 

TAAAATCAAA CATTGAATAT AAACAAGCAT ATTTCAAAGQ TATGTTTAAC QAAACATGTC 660 
CATCArrTGA TGATATCTTT GAAGAATATT ATGCAGCAGG TCAACGTCTA AAAGAATTTG 720 
45 TAACAGACAC ATCAAAAATC TTAGACGATG CATTTGTAGC AGATGAAAAG GTACTTTTCG 780 

AAGGTGCGCA AGGTGTAATG TTAGATATCG ACCATGGTAC ATATCCATTC GTTACATCAA 84 0 

GTAATCCAAT TGCAGGTAAC GTTACTGTTG GTACAGGTGT AGGTCCTACA TTCGTTTCAA 900 
AGGTAATTGG TGTATGTAAA GCTTATACAT CACGTGTTGG TGATGGTCCA TTCCCTACTC 
AATTATTCGA TGAAGATGGA CATCATATTA GAGAGGTTCG TCGTCAATAC GGTACAACAA 
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1020 



SS 



1122 



EP 0 786 519 A2 





TAAGTGGTAT 


TACAGATTTA 


TCTATTAACT 


CAATCGATGT 


TTTAACAGGC 


CTAGACACAG 


1140 




TGAAAATCTG 


TACAGCTTAT 


GAATTAGACG 


GTAAAGAAAT 


TACTGAGTAC 


CCAGCAAACT 


1200 


5 


TAGATCAATT 


AAAACGTTGT 


AAACCAATCT 


TTGAAGAGTT 


ACCAGGTTGG 


ACAGAAGACG 


1260 




TAACAAATGT 


GCGTACTTTA 


GAAGAATTAC 


CTGAAAATGC 


ACGTAAATAT 


TTAGAGCGTA 


1320 


70 


TTTCAGAATT 


ATGTAATGTA 


CAAATTTCTA 


TCTTCTCAGT 


TGGTCCAGAT 


AGAGAACAAA 


1380 


CAAACCTATT 


AAAAGAATTG 


TGGTAGAACT 


TTATATAAGT 


CATACACAAT 


GATTATAAAT 


1440 




ACATGAGCCT 


TCTATCTTTA 


TTGGTAGGAG 


GCTTTTGTTA 


TGCTTGCTTC 


TGTATCGATT 


1500 


IS 


CGATTATTTA 


GATAAAAAAT 


ACTAACGTAA 


AGGCGATATT 


TGCTAGTCAT 


AATTTAGAAG 


1560 




rTTAgATGAt 


AtTtAACGAA AAtTAAgATG 


aziATACtTGA 


ATGGTAA 




1607 




(2) INFORr4ATION FOR SEQ ID NO: 297: 








20 
25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 








TTAGAAGTAA 


GCACTTTAAT 


ATCTTTACCT 


ACCCATGTAC 


CAACACCTTC 


TTTAGGCTTT 


60 


30 


GGATTTTCAG 


CATGGTTATT 


TGATTTATTC 


ACCTGTTCAG 


ATCTATTTAC 


TTTATTACCT 


120 




TTATTCGGGT 


TCTCTAATAC 


ATCAAATTTT 


AATCTCGGTG 


AATAAAAAAG 


ATATATTAAA 


180 


35 


AATCCAACTA 


AAAATAATAC 


ACCGACGACT 


CTTATAATTA 


ACTTTTTCAT 


CAATCAACCA 


240 


CCTAAAAAGT 


ATTAATACTA 


TTGTAAAAAA 


CAACACATTA 


ATTAOCAAAT 


TTTCAACACT 


300 




GACATAACTG 


TGTCGTTTCG 


ATAAAACAAA 


ATCATCTTCA 


GGCATAATTT 


TAACGTCATA 


360 


40 


AAGATTTTCC 


CTACACTCTA 


TATCATAACC 


TATCTTTATG 


TTTTCAGGTT 


GAATTTCATT 


420 




ATTAAGATTA 


AAATATGTAT 


AAAAAAATGG 


ACAGTTAAGG 


TATCAAATGA 


ATACCATCAA 


480 




CTGTCCGACT 


ATTCTTCATC 


AAAAAACCTG 


ATAAAACAAA 


TTGCCTTATC 


AGATTAGTAT 


540 


45 


CATTTGTATA 


AGCATATTAA 


TGACCAAATG 


TTGCTTTAAT 


CAGTGATGTC 


GTTTCTCCAC 


600 




CTGGATATAA 


TACATATAAT 


AATAAGTAAA 


CTGCTACACC 


TGTAATTGCA 


GTACAGAACC 


660 




ATATAACTGA 


AGCGAATGGC 


CCGAATTTGC 


GGTGTACATT 


ATATTTATCT 


TTAAATGCAG 


720 


SO 


TAATAATTTG 


AACTAGGCCT 


AGAATACCAC 


CAATTGTTGC 


TAAATTAATG 


TGGAAAAATA 


780 




AGAAAATCGT 


ATAATATTTC 


TTAATTGATO 


CTGGTCCGCC 


AAATGCTGTA 


TTACCGATAA 


840 
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TAACATTTTT 
AAATTGCACT 
TTTTCATCCT 
CGAAAAAGAT 
GATCTGAATT 
TrGCTAACAC 
TAATCAACCA 
TTGGAATATT 
TTGGtTGCCA 
CAATTGCAAC 
TCCATGTTGT 
TAAGACCAAG 
TCATACCAAA 
TATTTTGTTT 
ATGCGCCACC 
ATAAGAAGGA 
AGTTACCTTG 
TAATATTTTG 
TAAGATTATG 
AAATTAACGA 
TAATCTATAT 
TAAGCTTCAA 
ATTAAGTTCT 
GTTGGATCCC 
GGCTATTAAT 
GCAAAAAGAA 
TTGGTGGAGC 
TATGTCATGG 
ATAGAGCCGT 



GTGTTTATTT 
AATGACAATA 
AACTTAATTA 
CACTAAATAA 
TTTCTTAAAT 
TaCGAATACG 
GATAAACATA 
TGCAAGTGCA 
ACmAAATACA 
CCATCCAATT 
ATGTCTTTTA 
TACGCCTGAT 
ACTTAATAGT 
ACTAGGCATA 
CATAATTAAA 
ATGATTTGTC 
AACAAGTCCC 
TGACmAAGTA 
ATATCTTAGA 
CAAGAGAAAA 
GTTAGAAAAT 
ATTTGAGTAA 
TTTTATGTCG 
ATCTTTTCAT 
TCTAATTCTC 
TTTAAAATGG 
CTTAGTTACC 
TGCGTTGATT 
TTCAGCTTTG 



ATTTCCCTTT 

CATGTCGTAC 

ATCTAGATCA 

TTTAGTGAAT 

GTTGTTAAAC 

ACACCTAAAT 

CTGACACGTG 

TATTCATCTT 

ACTAAAAACA 

AGTGGTGGTA 

GACCATATTG 

GGTATATTCA 

AATAAATTTT 

ATACGATCAA 

GTAGATCCAA 

ATTACAACTG 

ATTTTAATTA 

TGCTCTTTGC 

TTGCATAAAA 

TGTCAATTTT 

TTTAATAAGT 

ACGTGGATTA 

TCCACATACA 

ATCCTACAAT 

AGTGATGCGT 

TTAGGTGTCG 

AAAACCGGAT 

CCAGAATTCT 

TCTTTATTAA 



TCCAAATAAG 

TAATCGTTGG 

AAGTAAGTAA 

ATATAAACAT 

CTAATGCAAT 

TTATTAGTAA 

TACGTTTAAA 

TACGTTTAAT 

GCGCAATCGC 

CTGCTCCAGG 

AGTAATAAGA 

ATAAAAACAA 

GATCTGTAAT 

TATCTTGGTC 

ATAGCATTAA 

CTAGCCATGC 

TCTGTTGCAA 

TCATAATCCC 

AGACTAGGTT 

GTGACACAAA 

AGAATCATGC 

CATAATTATC 

ATACTTGTAA 

CAGGTCTATT 

TmTTATGA 

TAGCAACGTT 

CAGCTGATGG 

TTCCTATTGA 

TGGTCTTATG 



TCrCCAACCA 

TAAAATTGGA 

TGAAACAATC 

TTGTGTTGCC 

CCATCCTAAA 

TAAAGGTACT 

GCCCTTAACT 

AGCTAAGGcA 

TGTTAAACTA 

AAAACTCCCA 

CACATAACCT 

ACAAATTTCT 

TCTATTATTT 

GTAATAATTA 

TAAAATTTGA 

GCcCGCAAAA 

TTCTTTGAAG 

CCTCCITAAA 

TAATAAAATT 

TAACATTTAA 

ATCTAAAAGA 

CCAATAAAAA 

AATTAAATCA 

TATAGTATCA 

TGGGGTGTAT 

AATGATGACA 

TTGTGGTTCT 

TACGATTATT 

GTTAGTTATC 



ATGGCAATTA 

ACGCCCATAT 

ACAGCTAACA 

CATTTTGTTT 

TTTAATAAGG 

GGCAATAAAA 

GATGGTAACA 

TAAAAATGAA 

ATTTGTCCTT 

ATCACTGTGT 

ACAATCCCCA 

CCAACTAACA 

ACAGTTGGTC 

TTTAACGCAC 

GGTATTGATG 

GCCGGAATTA 

TTAACTCTGC 

TTTGTTCATA 

AAATTGTGAC 

TTTATTGCTA 

GATTAAT ATT 

AATCATTAOG 

TATTTCCTGC 

TCTCAAATCC 

AAATTGTTTG 

TTTGTACAAC 

TCTTGGCCAC 

GAC3TTAAGTC 

ACTGCATGGA 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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TATTGCAAGC ATTAATCGGA GCTGCTGCTG TTATTTGGCA ACAAAACGAT TACGTTTTAG 2760 

CATTGCACTT TGGTATATCA TTAATCAGTT TCTCATCTGT ATTTTTAATA ACATTGATTA 2820 

5 

TTTTCTCTAT AGATCAAAAA TATGAAGCTG ACGAATTATA TATCAAAAAG CCATTAAGAC 2880 

GTTTAACATG GTTAATGGCA ATCATCATTT ATTGTGGTGT TTATACTGGT GCACTAGTGa 294 0 

GACATGCGGA TGCAAGTTTA GCATATGGTG GTTGGCCATT GCCATTACCA CGATCTTGTA 3000 

10 

CCACATTCAG GAACAAGATT GGGTTCAACT CACGCATCGT ATCAnGGTCn nTTAA 3 OSS 

(2) INFORMATION FOR SEQ ID NO: 2 98: 

,5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 
TTCTTATTTA AAGAAGTCAT TTTTAGAAAT TGTTGAGACT TTAAAAAATG ATCCGTATAA 60- 
AATAACACAA TCTTTTGAAA AATTAGAGCC TAAATATTTA GAGCGATACT CAAGAAGAAT 120 
TAACCATCAG CACAGGGTCG TCTATACCGT AGATGATCGA AATAAAGAAG TATTAATACT 180 
ATCGGCATGG TCACATTATG ATTAATGAAT ATTCAATATC TGAATAACTT TAATQATAAG 24 0 

30 

TTAATTAAAG AAACTAGTAT TTAAGTGTAG GGAAAATAGC GACGTTAATG CGTTGTTATC 300 
TCTACACTTT TTAATTTTAT AATAGCGCAA GACTAAACAG ATTGAAATTA GTAACAATAA 360 
AAGAATAACG TATTATAATA AGGAATTTTA AATTGTGACT TTTTCGGAAT ATTAAATTTT 420 

35 

AGAAATATGA GGTTTTTAAG CGGATTCCTC ACAAAATTTT AAAAATATTT AAGCCTGAAA 480 
ATGMAAAGC GGTAGGGAAC GTTTTTCTGA AaGTTAGTGA TACAATAGTT TTAAGTTGAA S40 

4^ ATACAGGAGG ATGAATAACA TGAATCAGTC AGTCAAATTA CTTAAACATT TAACAGATGT 600 

AAACGGCATT GCTGGTTATG AAATGCAAGT TAAAGAAGCA ATGCGTaACT ATATAGAGCC 660 
TGTCAGTGAT CaAATTATTG AAGATAACTT GGGTGGCATT TTTGGAAAGA AAAATGCTGA 720 

45 GAATGGTCAA TACTCAATTA TGGAnTTC 74 8 

(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


299: 








ACCTCCGAAT AATTGATTCC 


: ATTAAcrrrr 


' TTTGTGaAAA 


. AtTTAAACAG 


GCGAAGTCTT 


60 


5 


CAATAAGTGA AGATGTCCAA 


. ATTATAAAAA 


ATACATTCCA 


AAAAGAAAAG 


TTAGGTACAG 


120 




TAATTACtAC 


ycTGGCGCAA GTGGTGGTGT 


TACGTATAAA 


CCAATGATGA 


GTAAAGAAGA 


180 


10 


GGCGACTGAA 


. GTTGTTAATG 


AGGTCATTAC 


TCTATTAGAA 


GAGAAAGAAC 


GTTTGTTAcC 


240 




TGGCGGATAT 


TTAT'rrriAT 


CAGATTTGGT 


AGGTAATCCA 


TCGCTACTAA 


ACAAAGTTGG 


300 




TAAGTTAATT 


GCCAGTATTT 


ACATGGAAGA 


AAAATTAGAT 


GCTGTTGTTA 


CCATTGCGAC 


360 


IS 


AAAAGGTATT 


TCATTGGCAA 


ATGCGGTTGC 


TAATATTTTA 


AATTTACCAG 


TAGTAGTGAT 


420 




TAGAAAAGAC 


AACAAGGTGA 


CTGAAGGTTC 


TACAGTTTCA 


ATTAATTACG 


TTTCAGGATC 


480 




TTCAAGAAAA 


ATAGAAACAA 


TGGTACTTTC 


GAAGAGAACT 


TTAGCAGAAA 


ATTCAAATGT 


540 


20 


TTTAGTTGTC 


GATGATTTTA 


TGAGGGCTGG 


TGGCTCTATT 


AATGGTGTTA 


TGAATTTAAT 


600 




GAATGAGTTT 


AAAGCCCATG 


TAAAAGGGGT ATCAGTACTT 


GTAGAATCAA 


AAGAAGTTAA 


660 




ACAAAGATTG 


ATTGAAGATT 


ATACTTCCTT 


AGTGAAATTA 


TCTGATGTAG 


ATGAATATAA 


720 


25 


TCaAGAGTTT 


AACGTAGAAC 


CTGGCAACAG 


TTTATCTAAG 


TTTTCATAAA AGGAGTTTTA 


730 




GTATTATGAA 


AATCATTAAC 


ACAACAAGAT 


TACCGGAAGC 


ACTTGGACCA 


TATTCGCATG 


840 


30 


CAACAGTTGT 


GAATGGTATG 


GTTTATACTT 


CTGGTCAGAT 


TCCATTGAAT 


ATTGATGGAC 


900 


ATATCGTAAG 


CGCTGATGTT 


CAAGCACAGA 


CAAAACAAGT 


TTTAGAAAAT 


TTAAAGGTTG 


960 




TTTTGGAAGA 


AGCAGGATCT 


GATTTGAATT 


CTGTTGCGAA 


AGCGACCATT 


TTCATTAAAG 


1020 


35 


ATATGAATGA 


TTTCCAAAAA 


ATAAATGAAG 


TGTATGGTCA 


ATATTTTAAT 


GAACACAAGC 


1080 




CAGCGCGTAG 


TTGTGTAGAG 


GTTGCGCGTT 


TGCCAAAAGA 


TGTGAAAGTA 


GAAATTGAAT 


1140 




TACHVAGTAA 


AATTAAGGAA 


TTATAATTTT 


CGATTAATAT 


GTTTAATCAA 


GCTTCTAAAT 


1200 


40 


AAAACAGAGA 


GATATATACT 


ATAGGGGGGC 


TCACTACATG 


AAAGTGACAG 


ATGTAAGACT 


1260 




TAGAAAAATA 


CAAACAGATG 


GACGAATGAA AGCACTCGTT 


TCCATTACAT 


TAGATGAAGC 


1320 




TTTCGTAATT CATGATTTAC 


GTGTAATTGA AGGAAACTCT 


GGCTTGTTCG 


TTGCAATGCC 


1380 


45 


AAGTAAACGT 


ACACCAGATG 


GTGAATTCCG 


CGACATCGcg CATCCTATTA ATTCAGATAT 


1440 




GAGACAAGAA 


ATTCAAGATG 


CAGTGATGAA 


AGTATATGAT 


GAAACAGATG 


AAGTAGTACC 


1500 




AGATAAAAAC 


GCTACATCAG 


AAGATTCAGA 


AGAAGCTTAA 


TCAATTTTAT 


ATTTAGCGAT 


1560 


SO 


GTAATACATT 


TGCAATAAGT 


TGATTTGATA 


CTGTCGATAA 


AGCATAAAGC 


TTTGTCGGCA 


1620 




G'l-rrrrrrAG 


TTTGTATTAA 


"ixirrrrrrrA 


Tl-l-rrAATGA 


AAGGCTAATA 


AATATATACG 


1680 
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20 



25 



30 



35 



40 



45 



SO 



TGaTGCTCGT 
CAATGCCTTT 
AATTTAATGG 
CACAAGAATG 
CGAACATGTA 
ACATGGTGCT 
GGAACAACTC 
GGTACGACAA 
TTGATTGCGC 
CAACCATATG 
GAAGAGAAAG 
GCGTTTAATA 
GGTGAATATT 
GTCTATCGTA 
CAGGCTGAGA 
ATCATCGATC 
ATTGAACCAG 
CAGTACTCTG 
GTTAATGATG 
GGCGCGCAAT 
CTTAAAGATG 
CGTACTAATA 
AcTATCGTCG 
ATTGGTGATG 
AGTTTAGCTG 
ACGTATTTAA 
GATTTAATGA 
AAATAATTGG 
TTTCATTGAA 



ATTTTTGAAG 
TATATGTTGA 
GGGTTAATGT 
AAATCTAAAA 
TTGGAAAGTG 
GAAAGTGTAA 
GGTACTGCGC 
TCGTTGTATG 
ATCACGAGGA 
GATACGGAAG 
ATGCAACGCA 
ATAAAACGTT 
ACCTCCCTGa 
CCAATGATGT 
AGGCGATGCA 
CTGACAGCAC 
GCGTACGAAT 
AAATTAACAA 
CTAGCX5TAGG 
TAGGTGCAGA 
GTGCCAAGGT 
TTGGTTGCGG 
GCAAAGATTC 
ATGTATTGGT 
TGGCAAGAGC 
AATGGCTAGG 
TTCGCATAGT 
AGGACTATAA 
AGGAAACGAA 



TAAGAAAAAA GTTGTTTTTA AAATTACAAC GAATTAAAAA 1800 

AAGAGTATTG CaGATTAAAT TaTAATAATG ACGAaGgTAA 1860 

TCATGCGAAG ACACGCGATA ATTTTGGCAG CAGGTAAAGG 1920 

AGTATAAAGT GCTACACGAG GTTGCTGGGA AACCTATGGT 1980 

TGAAAGGCTC TGGTGTCGAT CAAGTTGTAA CCATCGTAGG 204 0 

AAGGACATTT AGGCGAGCGT TCTTTATACA GTTTTCAAGA 2100 

ATGCatGCAA ATGGCGAAAT CACACTTAGA AGACAAGGAA 2160 

TGGTGACACA CCGCTCATCA CAAAGGAAAC ATTAGTAACA 2220 

TGCTAATGCT CAAGCAACTG TATTATCTGC ATCGATTCAA 2280 

AATCGTTCGA AATGCGTCAG GTCGTTTAGA ACGCATAGTT 234 0 

AGCTGAAAAG GATATTAATG AAATTAGTTC AGGTATTTTT 24 00 

GTTTGAAAAA TTAACACAAG TGAAAAATGA TAATGCGCAA 2460 

TGTATTGTCG TTAATTTTAA ATGATGGCGG CATCX3TAGAA 2S20, 

TGAAGAAATC ATGGGTGTAA ATGATCGTGT AATGCTTAGT 2580 

ACGTCGTACG AATCATTATC ACATGCTAAA TGGTGTGACA 2640 

TTATATTGGT CCAGACGTTA CAATTGGTAG TGATACAGTC 2700 

TAATGGTCGT ACAGAAATTG GCX3AAGATGT TGTTATTGGT 2760 

TAGTACGATT GAAAATGGTG CATGTATTCA ACAGTCTGTT 2820 

AGCGAATACT AAGGTCGGAC CGTTTGCGCA ATTGAGACCA 2880 

TGTTAAGGTT GGAAATTTTG TAGAAATTAA AAAAGCAGAT 2940 

TTCACATTTA AGTTATATTG GCGATGCTGT AATTGGCGAA 3000 

AACGATTACA GTTAACTATG ATGGTGAAAA TAAATTTAAA 3060 

ATTTGTAGGT TGCAATGTTA ATTTAGTAGC ACCTGTAACA 3120 

GGCAGCTGGT TCCACAATCA CAGATGACGT ACCAAATGAC 3180 

AAGACAAACA ACAAAAGAAG GATATAGGAA ATAATCATTT 3240 

ATAAAAGGAT AATCCTATGT AATATTAATG TAATCTTTAT 3300 

AATGGAGTTA CATyTTATAT ATAATAGTAA TTGCXSTAAGT 3 360 

ATGTTAAATA ATGAATATAA GAATTCGTCA TTAAAGATTT 3420 

GCATTAGCGC AAGAAGTTGC TGACCAAGTA GGAATTGAAC 34 80 
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GTATTCGTGG TTGTGACGTA TTTATTATTC AACCAACATC 
TAATGGAATT ATTAATTATG ATTGATGCTT GTAAACGTGC 
TTGTAGTGCC ATATTATGGA TATGCAAGAC AAGATAGPJKA 
TCACTGCTAA ATTAGTTGCA AACTTAATCG AAAcAGctGG 
TAGACTTACA TGCACCACAA ATTCAAGGAT TCTTTGATAT 
GTGTGCCAAT TCTTGCTAAA CATTTCAAAG ATGATCCGAA 
TCGTTGTTCA CCAGACCATG GCGGsnTTAC ACGTGCACGT 
AACTCCAATT GCAATTATAG ATAAACGTCG TCCTAGACCA 
CATTGTTGGT GAGATTGAAG GACGTACGGC AATTATTATT 
AGGTACAATC ACTTTAGCTG CACAAGCATT AAAAGATAAA 
TTGTTGTACA CACCCTGTTT TATCAGGACC GGCTAAAGAA 
AAAAGAATTA ATCGTAACAA ACTCAATTCA TTTAGATGAA 

taaagaatta tctgttgctg gtttaatcgc acaagctatc 
atcagttagc gtattatttg actaatattt aaaaggcgtt 
tgtataatag tttcgttcgt gattatacga ataaataaac 
ttgatgggta agtgaggtgc tcgttttgag caaaaatgaa 
tcattaaagt caatcatccg tcaaggtaaa caaacacgtt 
aaatctggta aagtaccagc agtagtatac ggttacggta 
gttgatgaag tagaattcat caaagttatc cgtgaagtag 
ttaggcgttg gttctaaaac tatcaaagtt atggttgc 
(2) Information for seq id no: 300: 
(i) sequence characteristics: 

(A) LENGTH: 3181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 
AAAATGCATA TAAATACATA TTAAGGAGGA TTTTATGAAT TTTCTTAAAC CTGCAAAGCA 
TATTAAGCCT TTGCCAGAAA ATCAGATAGA TGATACCTAT AAACGATTAC GTCTCCAAGT 
ATTTCTTGGT ATTTTCATCG GTTACGCTGG GTACTATTTA TTACGTAAAA ACTTTTCGTT 



ATATCCTGTG 
TTCTGCAGCA 
AGCCCGTAGC 
CGCAACTCGT 
TCCAATTGAC 
TATTAACCCA 
AAATTAGCTG 
AATGTTGCTG 
GACGATATTA 
GGTGCTAAAG 
CGTATCGAAA 
GATCGCAAAC 
ATTCGTGTAT 
TGACGAACAT 
ACTTGCAAGC 
AGGTGGAAAT 
CAGATCTTAA 
CTAAAAACGT 
GTCGTAACGG 



AATCTACATT 

ACAATCAATA 

CGTGAGCCAA 

ATGATTGCGT 

CACTTAATGG 

GAAGAATGTG 

ACATTTTAAA 

AAGTGATGAA 

TTGATACAGC 

AAGTATATGC 

ATTCTGCTAT 

CATCTAACAC 

ACGAAAGAGA 

ATTCCAAACG 

AACGATGATG 

GAGAATGGCT 

ACAATTAAGA 

GTCAGTTAAA 

TGTTATCGAA 



3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4630 

4718 



60 
120 
180 
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TGCTGTTTCC ATCGCATATG GATTTAGTAA GTTCTTTATG GGTACTGTAA gTGATCGGAG 300 

CAATGCTCGG ATATTCTTAG TTCTTGGATT AGCACTCACT GCTATCGTCA ATTTGTTAAT 360 

GGGATTTGTA CCGTTCTTTA CATCAGGTAT CGGTATTATG TTTGTCCTAT TATTCTTAAA 420 

TGGATGGTTT CAAGGTATGG GCTGGCCACC TTCAGGCCGT GTTCTCGTTC ACTGGTTTAG 4 80 

TGTAAGTGAA CGCGGAAGTA AGACTGCCCT TTGGAACGTT GCGCATAATG TTGGTGGAGG 540 

TATTATGGCA CCTATTGCTG CTTGGGGTAT TACAACAACT GCATTTATCA ACTTTGGTTA 600 

TTTAAAAGGT TTCGAAGGTG TATTCATTTA CCCTGCACTC TTAGCACTTA TCATTGCCGC 66 0 

AATTTCATAC GTATTGATTA GAGACACACC TCAATCTCAA GGTTTACCTC CAATCGAAAT 720 

TTATAAAAAT GACTTTGCTA CAAGCGATAA GAAAACATTA GAAACAGAAT TAACTACAAA 780 

AGAAATTTTA TTTAAATATG TACTGAACAA TAAATGGGTA TGGGCAATTG CCTTTGCAAA 84 0 

TATATTTGTT TATTTCGTGC GTTATGGTGT ACTTGATTGG GCGCCAGTCT ACTTAAGTGA 900 

AGAAAAACAT TTCGACTTAA AAGCATCAGG TTGGGCATAC TTCTTATACG AATGGGCTGG 96 0 

AATTCCTGGT ACATTATTAT GTGGTTACAT TTCTGATAAA TTATTCAAAG GTCGTCGTGG 1020 

2S ACCTGCAGGT TTCTTCTTTA TGTTAGGTGT CACAGTATTT GTATTAATTT ATTGGTTAAA 1080 

TCCTCCAGGC AATGCTTGGT TAGACAATGT CTCATTAATT GCCATTGGTT TCTTAATATA 1140 

TGGACCAGTT ATGTTAATTG GTTTACAAGC ATTAGATTAT GTACCTAAAA AAGCAGCTGG 1200 

CACAgcAGCT GGATTAACAG GATTATTTGG TTATCTGTTT GGTGCT G TAA TGGCCAACAT 1260 

CGTCTTAGGT GCTGTAGTTG ATAAATTCGG ATGGGATGTC GGTTTTATTT TATTAACAGC 1320 

AATTAGTGTG TTTGCAATGT TGAGCTTTAT CCTCACTTGG AATAAAGTAG GACAAGAAAC 13 80 

CXSTTCATCAT TAAATGATAA AAAATAAAGT CATATGGTTA TCTTATCGAA AGATGATATA 1440 

TTCfTCTCTT ATAAGTTCAA CCATATGACT TTTTATTAGT ATTCAAAAAA ATATTTACAT ISOO 

TGCCACTTTT GTGTTTGCCC TGCTGTTTTA TTCAATTGAT TACACCACTT AGGATAAACT 1560 

CTAAAAGCCA TTTTCCCTTG ATACTTAGAT GAAGCTAAGA TACCTTTTTC TACCAATAAT 1620 

TCCCTAGGAA ATAGAAAGTA GCCATTTAAT TCATCATCGA TAACAGCAAC AATCAGGTAA 1680 

TCAGCAAACT CTTCAATTTT GTATGGCCGA TTATAATTAT CtTCGTCTTT TGTCCAACAT 174 0 

GTCACGAAAT ATCCCGATTT AGTCGGTGTT TTCTTAGCTA ACCTACTTTG ATATGTTTCC 1800 

TCTTTAAAGC TAAATGTTAA TGCyTCGTAA TCTTGATTAT ATTTTTCTTC AGTTAAATCT 1860 

SO TTAACTTCTG ATTCTTCACT AAAAATATTT TTCAACAGTA TTTTAGATTT ACACATATGC 1920 

CAATCTCACT TTATITTTTC ATAATCGTAT CATATATTTA TTTTTTTCGA AAAATACACT 1980 
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TGTATCAACT GTCAAACGAT CACGTAATAA ATAGACGATT AACATTGcTG CTAAAGCACA 
AAGTGACTCA GCAATTAATA ATGACCAAAT GACACCTGTT AGTCCAAACA AAGCATTCAT 
^ AATAAATAAT ACTGGAATX3A TAATTGCACC TTGTAAAATG GCCATAATTG TAGCACCACG 

ACCTTGCCCA GTCGCTTGAA GCATACCAGT AAACAAGAAA CCTATACCAT TTAATAATAA 
TGATGCCATT GTTACTTTCA AAATAAATGt CGCCATCTCA ACAATGGCTT GATCAGTAGT 

10 

AAATAGTCCG ACCATATGAT GTCCAATTCT AAATACAGCA CTCATACATA CAACAAAGAT 
AACGCCGATA GACATGATAA CTGCTTTGAT AACGTCTTTC ATACGGCCTT TATTTGCCAT 
IS AAAGTTATAT GCAATTAGTG GTACAACACC TTCACATAAT CCCATGATAA TAAGTTCTGG 

aAATTGCACA AGTCTAAATG AGATACCATA ACTTGCaATC GCGAAGTTTC CATAATGTCC 
TAAAAATAAA TTTAAAACTA ATCCTGTGAA TCCCATTAAG ATACTCATTA AAAATGCAGG 
20 AATACCGATT TTAAAGATTT CAGAAAGCAT TTCTTTATTA GGTTTCGCAA GTTTAATATT 

AACTGACACA ACGTCACTAT TTTTCATAAA ATAAATGATA AAGAACAGAG CAGCAGCAAC 
ATTACTGATT GCAGTACCCA AAGCTGcACC AACAACGTTT aAATCAAAAC CAAAAATTAA 
AATTGGATCT AAAATAATAT TTAAGCCTAC ACTAGCTAAC ATACCAATCA TAGAAACCAT 
TGGTGCCCCA ATTGCACGTG CAAATTGTTC TAATATGAAG AACAAAATTA CAAAAGGTGC 
ACTTAAAAAC ATTACTTTCA AATAATTACT TGTTAAAGCT AACGTTTCAC CTCTCGCCCC 
TAAAATTGCT GCGATTTGAT CACTGAATGG TAAAGTAACT AAAATCACGA TAAGTCCTAG 
TGCAATACCA CCATAAATAG AGAAACTACT TACAAATTTA CTCTTAmTAT AGTCTTTCGC 
ACCTAATAAA CGTGAAATAT AAGTTCCTGC ACCAACGCCA AATAAATTAC CTAACCCCAT 
T 

(2) Information for seq id no: 301 : 



25 



30 



35 



40 



45 



so 



{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 
- (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

TCATCACCTT CAAATAAATA GGCATGCGAT AATTTATTTG AATGATATGC ATTCGTCAAT 60 

TGTTGCTGTT CATCCATTCA TTACAAACTC CCTTTTGCTT TTATATAAAA AGGTTGCCAA 120 

AGAGCAACTA ATTACTGCAA CAATAGOGAC CATGATTTGT GATTATTTTT AGAACCACAA 180 
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TTATTAATAC 


ACTATTTTAA 


GTATCTCTAA 


TTGATGATTG 


TATTATAGAA 


TTAAAATTGA 


300 




1 \j1»AA 1 vjC at 


CAACTGGCAT 


AACAAATACA 


GTAGCACCGC 


CAACTTCAAC 


TTCAACTGGA 


360 


5 


1 ft 1 vaOAA 1 Vj i 


ACG AATC CG C 


ACTGCCTCCC 


ATAGGTGTAA 


TAGGTGAAAC 


CAACTGTTCT 


420 




L. lAl 1 ACl-AC 


ACGTTTGATT 


AATCACAGAC 


AATATTTCAT 


CTACACGGTC 


ATCATTGACA 


480 


10 


u uAuATAAG A 


ATGTTGTATT 


ACCCGCTCTT 


AAAAACCCAC 


CTGTTGTTGC 


CAATTTTGTT 


540 


G CTCT AAAGT 


TATTTTTAAC 


AAGTTGATCT 


GCAAGTTCCT 


GACTATCTTG 


ATCTTGTACG 


600 




AT CGCT ATAA 


TCATTTTCAT 


TTTATAACAC 


CTCTTCTAAC 


AATTATATCA 


TATCTTTTCT 


660 


IS 


AAATATTTGA 


TGATAGTTTG 


ATACGTGTCT 


TCAACAACAT 


TTTCAAGAGG 


TTGATCTGCA 


720 




TTAAOGCTTT 


TGAACCGTTG 


TGATTCATTA 


TGAATGATTT 


CTTGGTAACC 


TTCAATTACT 


780 




TTTTCGTGAA 


cTTTAAATCT 


TCTTGATCTA 


ATCTATTTTG 


ATCTCTTGAA 


TTTTTAATAA 


640 


20 


TACGTTCGCG 


ACCTACTTCA 


GCACTAACAT 


TTAAATAAAT 


CGTCAAGTCT 


GGaTATAATC 


900 




CATTTATTGC 


AAATTCGTTT 


AATGCTCTTA 


CTTCTTCAAC 


GCCAATCCCT 


CTAGCATAAC 


960 




CTTGATAAGC 


TAATGAACTA 


TCGATATAGC 


GATCACACAA 


CACAACCTTA 


CCTTCTTTTA 


1020 


25 


AAGCTGGTAT 


GACCTTTAAT 


ACAAGATGTT 


CTCTTCTAGA 


TGCAGCAAAT 


AACATTGcTT 


1080 




CaGTtCTAAT 


GTCCATATCA 


TTGCCcTTCT 


AATACAATTT 


TACGTATTTC 


TTCACCAGTA 


1140 




GGAACACCAC 


CTGGTTCTCT 


AGTCATAATG 


ACATCATAAT 


CTTTTACTAA 


TCTATGGTAA 


1200 


30 


ACTTCATTAA 


TTACAGTTGT 


TTTTCCAGAG 


CCTTCTGGGC 


CCTCAAAAGT 


TATAAAAGCT 


1260 




GACATTTTAT 


TCATCCTCAA 


CTAAAATTTT 


ATTA'iTrrrA 


ATTCCTTCAA 


CTATCATTCC 


1320 


35 


AGTTTCCAGA 


TATTCATTTA 


CCAATTCTAT 


CATATTTTCA 


GTTATTGTTT 


CTCCTTTGAA 


1380 


AATAATAGGA ATcCCTGGcG 


GATACGGGAC 


AATATGTCGC 


GCCAGAACTT 


TACCTTTgCC 


1440 




TTTTCAAATC ACACCAAGTA 


ACATATTCAA 


AGCGTTTTGG 


TTTATAGTTA 


CCTTCAGTyG 


1500 


40 


TTAAAAGTTG 


TGTTTGTTTA 


ACTTTAGAAA 


CTGAiTrrrr 


CGGTAAAATC 


ATATCTTCAA 


1560 




TTTTACGTAA 


AAGCGAATCA 




TATCATCATG 


ATGCCATAAC 


GGCAATATCG 


1620 




CTAATGCTTG 


ATAGTCGTCC 


GCTAATTCTA 


AATAGATATG 


TGCATTCATA 


AACCAATTTT 


1680 


45 


GAATATCATG 


ACCTGTAAAA 


CCTTCATATT 


TTATCAGCAA 


CTTCAACGGA 


TCATCAACTT 


1740 




GAAGCATTTC 


AAAACCCTTC 


TTCTCCAAAC 


ATTCGATTAA 


TTGCGCTCTC 


TTATCAAAAA 


1800 




ACACGGTACT 


ATCATATGTT 


TTATAGAACT 


CGGCAGCTGA 


CTCTAAACTA 


GCCATAATCA 


1860 


SO 


AATACGAAGG 


ACTAGATGTT 


TGGAAGTAGC 


TTAGATATTC 


TATAATAGTT 


TCrCTATAAG 


1920 




GTGCATTTTT 


ATGAATATAA 


AGTACCX5AGC 


CCATCGTTAA 


AGCTGGTAAC 


GTrrrATGAA 


1980 



>4SDOCtO: <EP 07865 19A2.I_> 



1131 





EP0 786 519 A2 






C^CGTGTGCT TCGTCAATGA GTACAGGAAT ATTTAATTGG TGCAAAGATT 


_ 

2100 




TGATAACCTC TTC^TAfATTa iaT<#^'TWTWT*/^*/^ 

i^^>*AMMv.^iC iH^iACATTA AATGTTTCAC CGTAATAGTT AGGATAAGTC AACACAGCAA 


2160 


5 


vjni^A^^AUU ^^n^TTATTC AAACGGCTTA AATTAACTTT ATTATAATGA TTCGTTAACG 


2220 




J " f " I 'Cllk'VCl f^f^TT^'I'/^iv T> »T»?\ K It TL m--i -I- 1- till . 

LrAi-HiLrATG CGTTTCAATA AAATGCCCTT CTTGTTGGCT AATATCGAGC GCATGTAACA 


2280 


10 


uM^AiiiAiL, lACATTTCTT GCCATTAAGA TATCGCCTTT TTTCTGTGAA AAAGACTGGA 


2340 




iv^ML^UAiAA TATTCCTGAA GTGGTGCCAT TCACTAAGAA ATAAGCATCA TAATCTGAAT 


2400 




Uiiit-iCCAC CTGCTTCATA CTTTCCAAAA TGACTTCTTC AGGATGATGT AAATCAnCTn 


2460 


IS 


AArccAGGTA TTTCAGTTTT ATCCATTGTC ATTGATAATT GAGATAAATG ACCGATAGTC 


2520 




AiAirTTTAT GACCCGGAAC ATGCAAAGAA ATCGCTTCTT CTTGATTTAA ACTTTCTAAT 


2580 




iiArTTAAAA TAGGTTGCTT CATGATATAC GCTTCCTTTA TTTACACTGT TTTGGAATTA 


2640 


20 


t^ITACTTTCA AAAGTATTAA TTATATAGTA ACACTTCTTT GACAAAAGTT AGTGTTACIT 


2700 




ATGCAATAGC TTGTCTATTG TATAATAATT AATTTCCTTT TTGTACTTCG ATTTAAAAGA 


2760 




iArxAGACAT AAAATCTAAA AACAGCAGTA AGATGATTTA TGATTAAAAA CTATCTTACT 


2820 


25 


ut-ruTTCACT TTTTATAATA CTTCTGAATG TCTTCACTTA TACTTCTAGT CACAGATTTA 


2880 




AAiAAiC-AAA AGTGCACATT ATTAAAATAT CAATTTCACA CTCAATGCGG CTCATCGCAT 


2940 


30 


TCATTTCTTG TCTAGCAACG TTCTACTCTA GCGGAACGTA AGTTAGCTAC CATCCTCGCT 


3000 


AAUAACCTTT CTTGACTTGT GACAATCGCT TGCTTCTTTC CTCTCCTTCG GCTCTCGCTT 


3060 




AeiCATTTAG CTCTACTAAA CTCGTTGCGC TCTTTTCTCG TTTCGTCAGA TTCAAACGTT 


3120 


35 


iiUAL-irt-GC CAAGCCATTT TTCTTTGTGT TTACTTTTTA TTTTGACGTT TTAGACATAA 


3180 


AAAAAUAGAC CTCACGGTCT CAACTTGCCT GGCAACGTTC TACTCTAGCG GAACGTAAGT 


3240 




i^CTACCAT CGTCGCTAAA GACCTTTCTT GACTTGTGAC AATCGCTTGC TTCTTTCCTC 


3300 


40 


luui-rtiGeCT CTCGCTTACT CATTTAGCTC TACTAAACTC GTTGCGCTCT TTTCTCGTTT 


3360 




t-ijit-AUAix:c AAACGTTTTC ACTTCGCCAA GCCATTTTTC TTTGTGTTTA CTTTTTATTT 


3420 




AOAUoimA ^rUATAAAAA AAAGAGACcT TGCGGTCTCA ATGCGGCTCA TCGCATCCAT 


3480 


45 


iiiiiGCCTG GCAACGTTCT ACTCTAGCGG AACGTAAGTT GGCTACCATC GwCGCTAAgA 


3540 




aCCTTTCTTG ACTTGTGACA ATCXSCTTGCT TCTTTCCTCT yCTTCGGCTC TCGCTTACTC 


3600 




ATTTAGCTCT ACTAAACTCG TTGCGCTCTT TTCTCGTTTC GTCAGATTCA AACGTriTCA 


3660 


50 


CTTCGCCAAG CCATTTTTCT TTGTGTTTGC TTTTTATTTT GACGTTTTAG ACATAAAAAA 


3720 




AAGAGACCTT GCGGTCTCAA TGCGGCTCAT CGCATCCATT TTTTGCCTGG CAACGTTCTA 


3780 
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CATGGGAACA GGTGTGACCT CCTTGCTATA GTCACCAGAC ATATGAATGT AATTTATACA 3 900 

TTCAAAACTA GATAGTAAGT gAAAAGTGGA TTTTGCTTCG CAAAACATTT ATTTTGGATT 3960 

AAGTCTTCGA TCGATTAGTG ATTCGTGCAG CTCCACATGT GCACCATGCT TGCCACCTCG 4 020 

GAACCTATT 4 029 



(2) INFORMATION FOR SEQ ID NO: 302: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 



20 


GATGCAATAG 


TTGAGTAGTT 


ATAAGTAGCT 


ACATACAACC 


GCTCAAATAT 


AGGTTCAAGA 


60 




ACATTTTTAA 


TGAAGAAAGC 


GATTTTCCCm 


CAGAGTGCAA 


ACGCTTGACT 


CAATAACGAA 


120 




ATGTCACCTA 


TGTATGGCGT 


GACTTATTAT 


ACAGGaGGTG 


CAAAGTATGT 


TTGCTATTAT 


180 


25 


TGAAACAGGT 


GGAAAACAAA 


TCAAAGTAGA 


AGAAGGTCAA 


GAAATCTTCG 


TTGAAAAATT 


240 




AGACGTAAAC 


GAAGGAGATA 


CTTTTACATT 


TGATAAAGTA 


TTATTTGTAG 


GTGGAGATTC 


300 




AGTTAAAGTT 


GGAGCGCCAA 


CAGTTGAAGG 


TGCAACAGTT 


ACTGCTACTG 


TTAATAAACA 


360 


30 


AGGTCGCGGT 


AAAAAAATCA 


CTGTATTCAC 


ATACAAACGT 


CGTAAAAATT 


CAAAACGTAA 


420 




AAAAGGCCAT 


CGTCAACCAT 


ACACTAAATT 


AACAATCGAT 


AAAATCAACG 


CGTAATTATT 


480 


35 


ATGATTACTG 


TTGATATTAC 


AGTTAATGAT 


GAAGGCAAAG 


TAACAGACGT 


TATTATGGAT 


540 


GGCCATGCTG 


aCCATGGTGa ATATGGTCAT GATATCGTTT GTGCTGGAGC TTCAgCTGTA 


600 




TTCFrrGGTA 


GTGTTAATGC 


GATTATAGGA 


TTGACATCTG 


AGAGACCAGA 


TATCAATTAT 


660 


40 


GACGACAATG 


GTGGTCATTT 


TCATATAAGA 


AGCGTTGATA 


CAAACAACGA 


TGAAGCGCAA 


720 




CTAATTCTTC 


AAACAATGCT 


TGTGTCTTTA 


CAAACTATTG 


AAGAAGAATA 


TAATGAGAAT 


780 




ATTAGATTAA 


ATTATAAGTG 


AGGTGCATTC 


CGATGTTAAA 


ATTAAACTTA 


CAATTCTTCG 


840 


45 


CATCTAAAAA 


AGGGGTAAGT 


TCTACAAAAA 


ACGGACGTGA 


CTCTGAATCA 


AAACGCTTAG 


900 




GTGCTAAACG 


TGCTGACGGT 


CAATTCGTAA 


CAGGTGGTTC 


AATTTTATAT 


CGCCAACGTG 


960 




GTACTAAAAT 


TTACCCTGGT 


GAAAATGTAG 


GTCGTGGTGG 


CGATGATACA 


TTATTCGCTA 


1020 


50 


AAATCGACGG 


CGTTGTTAAA 


TTCGAACGTA 


AAGGTCGCGA 


CAAAAAACAA 


GTTTCTGTAT 


1080 




ATGCAGTAGC 


TGAATAATTT 


TGTCTAGTTA 


ACACCAGAAG 


TGAATCTTCT 


GGTGTTtTTT 


1140 
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IS 



20 



25 



30 



35 



40 



45 



SO 



AGACGTTATA 
AAAGGAATAC 
TAAGTATTTT 
ATATATAAGG 
GGTGAGATAT 
TGGTATTACC 
CGGTGGTAAA 
TTTTAGATAT 
GCATGGTAAA 
TGTTGAAACA 
GAAGGGCGGT 
TGACTTCAGT 
ATTAGCTGAT 
CGTTTCAAAA 
AGGTGTTGTT 
TGAAGGTGCA 
AAAAGTTATT 
TTATAAAGTC 
AATCX3TAGTA 
AGAAGAAATT 
TGATCAATTA 
AGTTGAAGAA 
AGATAAATTT 
TGAAAGAATG 
TCAAATGCGT 
TATCGTTAGA 
CAATAAAGAT 
TAAAACATTG 
CGTTAAACAG 



CTAAATGTGC 
AGTTGCAGCG 
AAAGCAAAGA 
GGCTCTAATA 
ATGTTTGTCG 
GCATACAGAA 
GGTGCTTCAG 
CAACGTCATT 
AATGCXX3AAG 
GACGAAgTGT 
CGAGGTGGCC 
GAAAAAGGTG 
GTAGGATTAG 
GCTAAGCCTA 
TCAACGCCTG 
TCTGATGGCG 
GTTCACATGA 
ATTAATCAAG 
GCTAACAAGA 
GGCGAAGATG 
TTATATGCAA 
GAGGAGTCAG 
ACAATTTCAA 
TTTAAAATGA 
TCGATGGGTA 
ATTCTTGGCG 
TATAAAAAGT 
AAGATTAAAG 
TTTGATCTAT 



AcTGTATAAG 
TTTAAAGAAT 
TTTATAAAAG 
TGCTATAATT 
ATCAAGTCAA 
GAGAAAAATA 
TCGTATTTGA 
TTAAAGCAAG 
ATTTAGTATT 
TAGCAGATCT 
GAGGTAATTC 
AACCAGGTGA 
TAGGTTTCCC 
AAATTGGGGC 
ATCAACGTAG 
TTGGATTAGG 
TTGATATGAG 
AATTAGCTGC 
TGGATTTACC 
TGCCAGTTAT 
TAGCAGATAA 
TTGGCATTAA 
GAGATGATGA 
CTGACTTTAA 
TTGATGATGC 
GAGAATTTGA 
TTTATTTAAT 
ATGCCTTAAA 
CTAGAAGTGC 



AGCCCCTAAT 
AAACTGTACC 
TAACTGCATA 
ATTGGGAAAA 
AATATCTCTT 
TGTACCATTT 
AGTGGATGAA 
CAAAGGTGAA 
AAAAGTTCCA 
TGTTGAAGAT 
ACGTTTTGCA 
GGAATTAGAT 
TAGTGTGGGT 
ATATCATTTT 
TTTTGTTATG 
ACATCAATTT 
CGGTTCTGAA 
GTACGAGCAA 
TGAATCACAA 
TCCAGTTTCA 
ATTAGAAGAA 
CCGAGTATTA 
TGGTGCTTAT 
CAGTGATCCA 
GCTTAGAGAA 
ATTCGTTGAA 
TAGAGAAGAT 
AAGTGATCCG 
TTTTTATAAA 



CACTAAACTA 
ACAATTGGTG 
AGAGCCCCTA 
TGAAAATTAT 
AAAGCCGGTG 
GGTGGACCAG 
GGTTTAAGAA 
AATGGCCAAA 
CCTGGTACAA 
GGTCAAAGAG 
ACACCTAGAA 
GTATCTTTAG 
AAATCGACTT 
ACAACGATTA 
GCAGATTTAC 
TTAAGACATG 
GGTAGAGAAC 
CGTTTAGAAG 
GATAATTTAA 
ACAATAACGC 
TATAAAGATG 
TATAAACATA 
GTGGTAAGTG 
GCAGTACGTC 
CGTGGTTGTA 
TAGGAGCGAA 
GTCTTGCCTG 
ACATTGTCCA 
TATAGAGAAA 



TAAGGGGGAC 
CTGAGAAATA 
ATTATTTACA 
ATGTAAAAGA 
ATGGTGGTAA 
CTGGCGGTGA 
CGTTATTAGA 
GTAGTAATAT 
TTATTAAAAA 
CTGTAGTAGC 
ACCCTGCACC 
AATTGAAATT 
TATTATCTAT 
AACCAAATCT 
CAGGTTTAAT 
TAGAGAGAAC 
CTATTGAAGA 
ATAGACCTCA 
ACTTGTTTAA 
GTGATAATAT 
TTGACTTCAC 
CACCGTCACA 
GTAATGCTAT 
GATTTGCTCG 
AAAATGGTGA 
ACATGATGGA 
AATCCGTGGT 
TTTATGATGC 
CGATATTTCC 



1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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TGTTGGTATG TTGGCACGTG TACTAGATGT TATATCAAAG TTAGAACTAT CTGTATTAAC 3060 

GATTCATCAA AGTATTCCAA TGGAAGAAAA AiSCAACAATA ACATTATCAC TGAATGCTAA 3120 

ATCTAAAGAA ACTTCAGTAG AAGATGTTAT TGGCGCTTTG AGAAATTTAG ATTATGTATC 3180 

AAAAGTAGAA TTAATTAGTA TGAGTATGTA AGGATGTGCC TATAATGTAC GCGTATGTCA 3240 

AAGGTAAGTT AACACATTTA TATCCTACAC ACGTAGTTGT TGAAACTGCT GGTGTTGGTT 33 00 

ATGAAATTCA AACACCAAAT TCTTATCGTT TTCAAAAGCA TCTAGATCAT GAAGTTTTAA 3360 

TTCATACATC TTTAATTGTT CGTGAAGATG CACAATTATT GTATGGATTT AGTAGTGAAG 3420 

AAGAGAAAGA TATGTTCTTG AGTTTAATTA AAGTTACTGG TATTGGTCCG AAATCAGCTT 3430 

TAGCTATTTT AGCGACAAGT ACGCCTAATG AAGTAAAACG TGCCATTGAA AATGAAAATG 3540 

ATACGTATTT AACTAAATTC CCAGGAATTG GTAAGAAAAC GGCAAGACAG ATTGTCTTAG 3600 

20 ATTTAAAAGG TAAAGTGAAA ATTACTGAAG AAGATAGCGA TTCATTATTA CAAGTAGACG 3660 

CTACTTCGAC GGTGCAAGAT CAATTCGTGC AAGAAGCAAT GTTAGCGTTA GAAGCATTAG 3720 

GTTATTCTAA ACGAGAGCTT GCAAAAGTTG AGAAAACGTT AAATAAAAAT AAATATGACT 3780 

CAGTTGATGA AGCTGTTAAG GCAGGTCTTC AATTAGTTGT ATCTTAAOTT TAAATAGATT 384 0 

AATAGGGGAA GTGTTGTCAT GAATGAGCGT ATGGTTGATC AATCAATGCA TAGTGAAGAA 3900 

ACTGATTTCG AATTGTCGCT TAGACCTACG AGATTACGAC AATATATTGG TCAAAATTCA 3960 

ATAAAAAGTA ATTTAGAAGT ATTTATTAAA GCGGCTAAAC TTCGTCATGA ACCATTAGAT 4 020 

CATGTATTGC TTTTTGGCCC CCCTGGATTA GGTAAGACAA CATTATCTAA TATCATTGCC 4080 

AATGAAATGG AAGTTAATAT ACGTACAGTA TCAGGGCCTT CATTAGAAAG ACCTGGTGAT 4140 

TTGGCTGCAA TTTTATCAGG ACTTCAACCT GGAGATGTTT TGTTTATTGA TGAAATACAC 4200 

AGACTGAGTA GTGTTGTTGA AGAAGTGTTA TACCCTGCAA TGGAAGATTT CTTTTTAGAT 4260 

ATTATCATTG GTAAAGGCGA TGAGGCTAGA AGTATCCGTA TCGACTTACC TCCATTCACT 4320 

TTGGTAGGTG CAACAACGCG AGCTGGCAGC TTAACAGGTC CACTAAGGGA TCGATTTGGT 4380 

GTGCACTTAA GATTAGAATA TTATAACGAA TCAGATTTAA AAGAAATCAT TATTAGAACA 4440 

45 GCTGAGGTTT TAGGCACAGG TATTGATGAA GAAAGTGCCA TTGAACTTGC TAAACGTTCT 4500 

AGAGGGACTC CAAGAGTAGC AAATCGACTA TTGAAGCGGG TAAGAGACTT CCAGCAAGTG 4560 

AATGAAGATG AACAAATATA CATTGAAACA ACGAAGCACG CATTAGGTTT ACTTCAAGTT 4620 

GATCAACACG GACTAGATTA CATTGATCAT AAAATGATGA ACTGTATTAT TAAGCAGTAT 4680 

AATGGCGGAC CTGTTGGTTT AGATACGATT GCCGTAACAA TTGGTGAAGA AOGTATTACA 4740 



55 



30 



35 



40 
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GGCAGAAAAG 


CAACACCATT 


AGCTTATGAA 


CATTTTGCAA 


AGTCGAATRA 




A □ f r\ 
4 tio 0 


5 


TAACGTGAAT 


ATTGAAGAAT 


TTGACTATGA 


CTTACCAGAA 


TCATTAATTG 


(TTCLAAACGC 






TTTAAAAGAT 


CGTGATCATA 


GTCGTTTATT 


AGTCATGGAT 


AGAGAAACTG 


GTGAAATGAA 






ACATTTACAT 


TTCAAAGATA 


TCATTGAGTA 


TTTTAGACCT 


GGTGATACAT 


TAGTGCTTAA 




10 


CGATACGCGA 


GTAATGCCAG 


CTAGACTTTT 


TGGTTTAAAA 


GAAGAAACTG 


GTGCAAAAGT 


D 1 U U 




TGAAATGTTA 


ATGTtAACTC 


AAATTGAAGG 


TAATGATTGG 


GAAGTCTTAC 


TG A AArr* A o r* 


^ Xo U 




TAAGCGTATT 


AAAGTTGGTA 


ATAAATTGAA 


TTTTGGTAAT 


GGCAAAATTA 


T An f'Xf; A A Tn 

X MVJr W X W AaM X VJ 






CATAAAAGAA 


ATGGATCAAG 


GTGGACGCAT 


CATGCGTTTA 


CATTATGAAG 


O X A X X X X 






AGAAAGATTA 


GATGAATTAG 


GGGAAATGCC 


ACTGCCACCA 


TACATCAAAG 


AA PHTTT An A 


3 J 4 U 




TGATCCAGAT 


CGTTATCAAA 


CAGTTTACGC 


TAAAGAAAGT 






C /I A r\ 


20 


AGCAGGATTA 


CATTTTACTG 




AATTGAAATT 


AAAAATAAAG 


GTGTTAATAT 






CGCATTTGTT 


ACATTACATG 


TTGGGTTAGG 


TACGTTTAGA. 






c c o n 




GAATGACCAC 


GAAATGCATA 


GTGAATATTA 


TCAAATGACm 


CAAGAAAr*Af7 


/"P/^&' r"^'" f 'a*T**T^ 
V- X Vjrt X X X A X X 


c c Q n 


25 


AAATGATACT 


AAGcCAAAGG 


ACATCGCATT 


ATATCAGTTG 


GTACAACTPTt^ 


A A A OfiT a a 


3b4U 




CTTGAAACAA 


TTCX3ACGCGA 


TCATGATAAA 


TTTGTTGAAA 


CGAGTGGCTG 






30 


TTTATTTATC 


CAGGATTTGA 


TTTTAAAGCA 


ATTGATGGCC 


AGATTACTAA 


XXX X\>.^%x X Xn 




CCAAAATCAA 


CATTAGTTAT 


GCTAGTATCA 


GCGTTTAGTA 


GTCGTGAAAA 


X W X X X Ut^V^ X 


7 o ^ u 




GCTTATAAAA 


CGGCAGTAAA 


TTTAGAATAT 


AGATTCTTTA 


GTTTTGGCGA 


TGCAATGTTA 

X VJ^af^f^ X X XX^ 


^ o o w 


3S 


ATTATATAAA 


AAGAATGTGA 


GGATTTTGAA 


TATGCCTGCA 


GTAACATACG 


AACACATTAA 


^ v 




AACTTGTAAA 


CAATCAGGTG 


CGCGTTTAGG 


TATCGTGCAC 


ACACCACACG 


GTT^ATTmA 

W X X %a^%X X X 


w V W V 




AACACCTATG TTTATGCCAG 


TTGGTACTAA 


AGCAACCGTT 


AAAACAATGA 


GTCCAGAAGA 


6060 


40 


GTTAAGACAA ATTGAAGCAA 


AAATCA'!"!*!"!' 


GGGCAACACA 


TATCATTTGT 


GGTTACAAG<? 


6120 




CGGAAATGAT ATTATCAAAC 


ACGCTGGGGG 


ATTACATAAA 


TTCATGAATT 


GGGATGGTCC 


6180 




GATTCTTACA GATTCAGGCG 


GTTTCCAAGT 


GTTTAGTTTA 


AGTAATTTAC 


GTAAAATTAC 


6240 


45 


AGAAGAAGGC 


GTGGAATTTA 


GACATCATAC 


TAATGGGTCT 


AAATTATTTT 


TGAGTCCTGA 


6300 




GAAATCAATG 


CAAATTCAAA 


ATGATTTAGG 


ATCTGATATT 


ATGATGGCAT 


TTGATGAATG 


6360 




TCCACCGATG 


CCTGCTGAAT 


ATGATTATGT 


AAAAAAATCT 


ATTGAACGTA 


CAACACGTTG 


6420 


SO 


GGCGAAAAGA 


TGTCTAGATG 


CACACCAAAG 


ACCTGAAGAT 


CAAGCATTGT 


TCGGCATTAT 


6480 




ACAAGGTGGC 


GAATATGAAG 


ATTTAAGAGA 


ACAAAGTGCA 


AAGGATTTAG 


TAGAATTAGA 


6540 
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20 



25 



AATGGTTGAA CATACAGAGC AGTTTATGCC TAAAGATAAA CCAAGATATT TAATGGGTGT 6660 

AGGATCTCCa GATGCGTTAA TCGAATGTAG TATTCGCGGC ATGGATATGT TTGATTGTGT 6720 

CTTACCGACA CGTATTGCCA GAAATGGTAC TTGTATGACA TCGCAAGGTC GTTTAGTTAT 67 80 

TAAAAATGCA AAATTTGCAG ATGATTTAAG ACCGTTAGAT GAGAATTGTG ACTGTTATAC 684 0 

ATGTCAAAAC TATTCAAGAG CGTATATACG TCATTTAATC AAGGCAGAGG AAACTTTTGG 6900 

TATTCGTCTT ACTACTATTC ATAATTTACA TTTTCTGCTA AAATTAATGG AAGATATAAG 6 960 

ACAAGCCATT CGAGAAGATC GTCTTTTAGA TTTCAAAGAA GAATTCTTCG AGCAATATGG 7020 

ATTAAATGTT GAGAACCCAA AAAACTTTTA AGCAAGAGGA GCGTATAAAA TGCAATTTTC 7080 

ATTACTAATA TATATAGTCG TAATTTTTGC GGTTATGTAT TTCTTGATGA TCAGACCACA 7140 

ACAAAAACTG TGCGAAACA 7159 
(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 



30 



35 



40 



45 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

TCCATTTATA GAAATTTCTA AAGCAGAnAA TAAGATAGAA GATATCGGCC AAGGTGCAGA 60 

AATCATCAAA AGAACACAAG ACATTACTAG CAAACGATTA GCTATAACTC AAAACATTCA 120 

ATTTG A TTTT GTAAAAGATA AAAAATATAA CAAAGATGCC CTAGTTGTTA AGATGCAAGG 180 

CTTCATTAGC TCTAGAaCAA CATATTCAGA CTTAAAAAAA TATCCATATA TTAAAAGAAT 240 

GATATGGCCA TTTCAATATA ATATCAGTTT GArAACGAAA GACTCTAATG TTGATTTAAT 300 

TAATTATCTT CCTAAAAATA rAATTGATTC AGCAGATGTT AGTCAGAAAT TAGGCTATAA 360 

TATCGGCGGA AACTTCCAAT CAGCGCCATC AATCGGAGGC AGTGGCTCAT TCAACTACTC 4 20 

TAAAACAATT AGTTATAATC AAAAAAACTA TGTTACTGAA GTAGAAAGTC AGAACTCTAA 4 80 

AGGTGTTAAA TGGGGAGTGA AAGCAAATTC ATTCGTTACA CCGAATGGTC AAGTATCTGC 54 0 

ATATGATCAA TACTTATTTG CACAAGACCC AACTGGTCCA GCAGCACGAG ACTATTTCGT 60 0 

CCCAGATAAT CAACTACCTC CTTTAATTCA AAGTGGCTTT AATCCATCAT TTATTACAAC 66 0 

ATTGTCACAC GAAAGAGGTA AAGGTGATAA AAGCGAOTTT GAAATCACTT ACGGCAGAAA 720 

CATGGATGCT ACATATGCTT ACGTGACAAG ACATCGTTTA GCCGTTGATA GAAAACATGA 780 
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AAAAATTAAA AGCATCACAC 


CTAAGTAAAC 


AGTTCaATCA 


TCTTAAAAAA 


TCCTGGGACA 


900 




CTTCATACTT GTCTCAGGAT 


TTTTTmACAA 


ATTGAATCAG 


CCTCATAACA 


TTAAATTATT 


960 


5 


TTATCGTACA TTAAATTTAA 


TAATAACAAC 


TGATTTTrAT 


AAGAATAAAG 


TATCGAmCCA 


X020 




TAGTAGATAC ACAAATAATA 


CAAATGAAAC 


AATTTAACTT 


GAAAGCTTAa 


ATAAATATTA 


1080 


10 


TCAAGTTAAT AAACAATTAA 


TTTTTAGATG 


GATTCATCAA 


AAATCGTAAA 


AAAGCACAAT 


1140 


TTGTATTTTA CAAACATTAA 


TTAAAAAAGA 


AAGCAAGACA 


TTCGTGCAAT 


CGGTTACCTT 


1200 




AAATTGTTTA CAACTGTCAA 


CAATACCAAG 


GTTTTATTAA 


CTATATTTCT 


CACAAAATTA 


1260 


IS 


GCTTTTAGCA TTCCAAACAA 


AAAAGGTTAA 


ATCGAACGGA 


ATTATGGCAT 


TTTTAACTTA 


1320 




ATTGTAAAAA AAGTTGATAA 


TGGTCAATTG 


TTAATGAACA 


GTTAATTATA 


ATAACGCCCA 


1380 




AAATATATTA TTATTTAATT 


AAGTTAAATA 


AAATTATAGA 


AAGAAAGTGA 


AACTTATGCT 


1440 


20 


TAAAAATAAA ATATTAACTA 


CAACTTTATC 


TGTGAGCTTA 


CTTGCCCCTC 


TTGCCAATCC 


1500 




GTTATTAGAA AATGctAAAG 


CTGCTAACGA 


TACTGAAGAC 


ATCGGTAAAG 


GAAGCGATAT 


1560 


2S 


AGAAATTATC AAAAGGACAG 


AAGATAAAAC 


AAGTAATAAA 


TGGGGCGTGA 
• 


CTCAAAATAT 


1620 


TCAATTTGAT TTTGTAAAGG 


ATAAAAAATA 


TAACAAAGAT 


GCTTTGATAT 


TAAAGATGCA 


1680 




AGGATTCATT AGCTCTAGAA 


CAACATATTA 


CAACTATAAA 


AAAACTAATC 


ATGTTAAAGC 


1740 


30 


TATGCGATGG CCATTCCAAT 


aTaATATTGG 


TTTAAAAACA 


AATGATAAAT 


ATGTTTCTTT 


1800 




AATTAATTAT TTACCTAAAA 


ATAAAATTGA 


ATCTACAAAC 


GTGAGTCAGA 


CATTAGGATA 


1860 




CAATATCGGT GGTAATTTCC 


AATCAGCCCC 


ATCACTCGGT 


GGTAATGGAT 


CATTTAACTA 


1920 


35 


TTCTAAATCG ATTAGCTATA 


CACAACAAAA 


TTATGTAAGT 


GAAGTAGAAC 


AACAAAACTC 


1980 




AAAAAGTGTT TTATGGGGCG 


TCAAAGCGAA 


TTCATTCGCC 


ACTGAATCAG 


GTCAAAAATC 


2040 




AGCCTTTGAT AGCGATTTAT 


TTGTAGGCTA 


CAAACCTCAT 


AGTAAAGATC 


CTAGAGATTA 


2100 


40 


TTTCGTTCCA GACAGTGAGT 


TACCACCTCT 


TGTACAAAGT 


GGATTTAACC 


CTTCATTTAT 


2160 




CGCCACAGTA TCTCATGAAA AAGGTTCAAG 


CGATACAAGC 


GAATTTGAAA 


TTACTTACGG 


2220 


45 


AAGAAACATG GATGTCACTC ATGCCATTAA AAGATCAACG CATTATGGCA 


ACAGTTATTT 


2280 


AGACGGACAT AGAGTCCATA 


ATGCATTTGT 


AAATAGAAAC 


TATACTGTGA 


AATACGAGGT 


2340 




CAATTGGAAG ACTCATGAAA 


TCAAGGTGAA 


AGGACAGAAT 


TGATATGAAA 


ATGAATAAAT 


2400 


SO 


TAGTCAAATC ATCCGTTGCT 


ACATCTATGG 


CATTATTATT 


ACTTTCTGGT 


ACTGCTAATG 


2460 




CTGAAGGTAA AATAACACCA 


GTCAGCGTAA 


AAAAAGTCGA 


TGACAAAGTT 


ACTTTATACA 


2520 




AAACAACAGC CACAGCAGAT 


TCTGATAAAT 


TTAAAATTTC 


ACAGATTTTA 


ACATTTAATT 


2580 
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ACTCAGGCTT 


TGTGnAACCT 


AATCCTAATG 


ACTATGACTT 


TTCAAAATTA 


TATTGGGGAG 


2700 


CTAAATACAA 


TGTATCTATA 


AGCTCACAAT 


CTAATGATTC 


AGTAAACGTC 


GTTGATTATG 


2760 


CACCAAAAAA 


TCAAAATGAA 


GAGTTTCAAG 


TTCAAAATAC 


TTTAGGCTAT 


ACATTTGGTG 


2820 


GTGACATTAG 


TATCTCTAAT 


GGTTTATCTG 


GTGGACTTAA 


TGGAAATACA 


GCTTTTTCTG 


2880 


AAACAATTAA 


TTATAAACAA 


GAAAGTTACA 


GAACAACATT 


AAGTCGCAAC 


ACAAATTATA 


2940 


AAAATGTTGG 


CTGGGGAGTT 


GAAGCACATA 


AAATTATGAA 


TAATGGTTGG 


GGACCTTATG 


3000 


GAAGAGATAG 


CTTCCACCCA 


ACATATGGTA 


ATGAACTCTT 


CTTAGCTGGC 


AGACAAAGCA 


3060 


GTGCATACGC 


TGGCCAAAAC 


TTCATAGCGC 


AACACCAAAT 


GCCATTATTA 


TCTAGAAGTA 


3120 


ACTTCAATCC 


AGAATTTTTA 


AGCGTACTAT 


CACACAGAC 






3159 


(2) INFORMATION FOR SEQ ID NO: 3 04: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 






GCAAAAACTT 


TTCTCCAACT 


ATGTCAGACT 


ATAATGGCAT 


GAACAAAATT 


GATATGATGA 


60 


ATCAAATTAA 


AGTTGATACG 


ATGTTGCATG 


GTTATCACGC 


TGGATriTTA 


TTTGCATTAT 


120 


TGATTACAGT 


TGTTAGTTTC 


TTTTGTTCAT 


TTATGTTACA 


AGGTAAGAAA 


AAAGAAGTTG 


180 


ATTCTCGTCA 


ATAAATATAA 


GTTGCTAGCT 


ATATAAAGCT 


TTTTAGCAAA 


AGTTCAACAT 


240 


ATTGACTTAT 


CCGGCATTTC 


AGATTAAAAT 


ATTTTTATTC 


CGATTAGAAT 


AATAAGAATA 


300 


AGGAC3ATATA TTCTATGAAA 


AGACTTTfAT 


TTGTGATGAT 


AGCTTTOGTT 


TTCATATTGG 


360 


CTGCATGCGG 


AAACAATTCG 


TCGAAAGACA 


AGGAAGCTAG 


TAAAGATAGC 


AAGACAATTA 


420 


ATGTTGGGAC 


TGAGGGGACT 


TATGCACCAT 


TTAGTTTCCA 


CGATAAAGAT 


GGTAAATTAA 


480 


CTGGTTACGA 


TATTGATGTT 


ATTAAAGCAG 


TGGCTAAAGA 


AGAAGGTTTA 


AAACTTAAAT 


540 


TTAATGAAAC 


TTCTTGGGAT 


TCTATGTTTG 


CAGGTTTAGA 


CGCAGOGCGT 


TTTGATGTAA 


600 


TCGCGAACCA 


AGTAGGTATT 


AATCCTGATA 


GAGAAAAGAA 


ATATAAATTT 


TCTAAGCCTT 


660 


ACACATTCTC 


AAGTGCTGTT 


TTAGTTATTC 


GTGAAAATGA 


AAAAGATATT 


AAAGATTTTG 


720 


ATGATGTTAA 


AGGTAAGAAG 


TTAGCACAAA 


CATTCACATC 


TAATTATGGT 


AAATTAGCTA 


780 


AGGATAAAGG 


TGCTGATATT 


ACAAAAGTTG 


ATGGCTTTAA 


CCAATCAATG 


GATTTATTAT 


840 
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TGcTAAGATC 


AAAGCAATCA 


AAGGTAATGC 


TGAACAAAGT 


AGATCTGCAT 


960 




TTGCATTTTC 


TAAAAAAGCA 


GATGATGAAA 


CAGTTCAAAA 


ATTCAATGAT 


GGCTTGAAAA 


1020 


5 


AAATCGAGGA 


AAACGGTGAA 


TTAGCTAAAA 


TAGGTAAGAA 


ATGGTTTGGT 


CAAGATGTTT 


1080 




CTAAATCTAA 


ATAGCGAACA 


ACAACATGCG 


CTAGATGCTG 


CAAAACAAGC 


TTTCGGACCT 


1140 


70 


ATGCTAGAAG 


GTTTGGTCAA 


ATATTCAATT 


CCTATTACAT 


TAGTTACATT 


TGTTTTAGGA 


1200 




TTGATTATTG 


CATTATTTAC 


AGCATTAATG 


CGAATTTCAA 


CGAGTAAAAT 


TTTAAGAAGT 


1260 




ATTTCACGTG 


TCTATGTATC 


TATTATTCGA 


GGAACACCAA 


TGATAGTACA 


ACTATTTATC 


1320 


IS 


ATATTTTATG 


GTATTCCAGA 


ATTAGGTAGA 


TTATTAACAA 


ATGACGCTGA 


CAACCAATGG 


1380 




ACATTGGCAC 


CTGTAGTGGC 


TGCTATTATT 


GGTTTATCAT 


TAAATGTAGG 


TGCGTATGCT 


1440 




TCGGAAATTA 


TTCGTGGCGG 


TATTATTTCT 


ATACCGAAAG 


GACAAACAGA 


AGCTGCaTAT 


1500 


20 


TcCaTCGGTA 


TGACGTATGG- TCAAACGATA 


CAACGTATCA 


TTTTACCGCA 


GGCAATTCGA 


1560 




GTGTCGATTC 


CTGCACTAGG 


TAATACATTT 


TTAAGTTTAA 


TCAAAGATAC 


ATCATTATTA 


1620 


25 


GGATTTATTT 


TAGTGGCTGA 


AATGTTTAGA 


AAAGCTCAAG 


AAGTTGCGTC 


TACAACATAT 


1680 


GAATATTTAA 


CAATTTATGT 


GTTAGTTGCG 


CTAATGTACT 


GGGTGGTATG 


CTTTATTATT 


1740 




TCAATTATCC 


AAGGTATCTA 


TGAATCTTAT 


ATTGAAAGAG 


GGTATCGCTC 


ATGATTCAAT 


1800 


30 


TGAACAATAT 


CCATAAATCA 


TTTAATGATG 


TTGAAGTCAT 


CAAAGGTATT 


GATTTATCTG 


1860 




TTGAACAAGG 


TGAGGTTGTA 


ACCTTAATCG 


GTCGATCTGG 


TTCAGGTAAA 


ACAACATTGT 


1920 




TACGTATGAT 


TAATGCATTA 


GAAATTCCAA 


CTGAAGGTAC 


AGTTTATGTT 


AACX3GCAAAA 


1980 


35 


CATATACATC 


TAAAGATAAA 


AAATCACAAA 


TAGAAGTTCG 


TAAACAGTCT 


GGTATGGTAT 


2040 




TTCAAAGTTA 


TAACCTTTTT 


CCGCATAAGA 


CGGCATTAGA 


AAATGTAATG 


GAAGGTCTTA 


2100 




TCACAGTTAA AAAGTTGAAA AAGGATGAGG 


CACGTGGGAA 


ATCACTTGAG 


TTACTTGAGA 


2160 


40 


AAGTTGGTTT 


AACACATGTC 


AAAGATCAAC 


GTCCACATGC 


ATTATCAGGT 


GGTCAACAAC 


2220 




AACGTGTTGC 


TATTGTCAAG 


AGCACTAGCA 


ATGAACCCTA 


AAGTGATGTT 


GTTTGATGAA 


2280 


4S 


CCAACATCTG 


CACTTGATCC 


TGAACTTGTG 


AATGATGTTT 


TAAAGGTTAT 


TAAAGATTTG 


2340 


GCTAATGAAG 


GCATGACAAT 


GGTCATTGTG 


ACACATGAAA 


TGCGTTin'GC 


TAAAGAAGTA 


2400 




TCTAATAACA 


TTGTATTTAT 


TCmTGAAGGC 


ATGATCGGAG 


AACAAGGGGC 


TCCAGAAGAG 


2460 


SO 


ATGTTCAATC 


GTCCGAAAAC 


AGAAGAATTA 


AGACGTTTCT 


TAAATGTTAT 


AAATGAAGAA 


2520 




TAATCAAATA 


GAACCACGTA 


TCATGTTTTA 


GTATGGCGAT 


GAAGCCATAT 


ACATGATGCG 


2580 




TGGTTCTTTG 


TTATGTTGTC 


ATAATCTTGG 


AGCGATATTT 


TAACGACGTT 


TATGATTTAA 


2640 
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20 



30 



TTCTACATGT 


GCGTTAAAAC 


CTTTlTlXiAA 


TTGTTGGACG 


CCATAGTCTT 


CTGATGACTC 


2760 


TGAAAAGTCA 


CCGGTAATAC 


CATAAAAATT 


ATAGCGATCA 


ATATGATGCG 


CTTTAGCAAA 


2B20 


CTTAATCATT 


TCCCaCTGCA 


m m m^«^4#n wk ^^^^^^ 

AATGGTAGGC 


ACCCATATAA 


GCATTATATT 


TAGGGTTTGA 


2880 


ACCACTAGAT 


AAGTAATAAA 


CTTCATGCTC 


ATTGTAGATA 


AATAAAGCAG 


AAGCTAAGTT 


2940 


TAAGACTGCA 


CCATCTTGTT 


CAATTTGTTC 


TATTGTATTG 


TCGATTTTAC 


GCTTATTGCT 


3000 


ATTTAGCTGT 


TGTTCTAGCT 


GTGTGCGTTT 


CGTTTTATTT 


TTCTTTGAAT 


TAGGACTTTC 


3060 


TTCCAATGCT 


TCTTCAACAC 


CTGAGAGTTC 


AGCTGTTAAT 


TGTTGTTGCT 


TTAATTGTAA 


3120 


CGTTTTTAAA 


TACTCGTTTA 


AATCAATATA 


CGCCAACTTT 


AACATGGCGT 


GGTCATCGTA 


3180 


TAACTTTTGC 


ATTTCTTCAA 


AGTATGGTAA 


CTCACGGAAT 


TTGAAACCGT 


GCTTTTCCTC 


3240 


AGCCATATGG 


AATAAGTCGA 


AAAAAGTTTG 


CGTTTCATCA 


ATCGTTAAcG 


TTTTaGTTTT 


3300 


GACACCAAtA 


TCaTATGTtT 


tTtTAATATT 


ACGTCTCGtT 


TGATAATCCA 


TTTCTTTTAA 


3360 


AAGTTGGTCT 


TCAGTCTTAT 


CTTTTAAATC 


TAACACTGAC 


AGCCAACGGA 


TTTGGCTCAT 


3420 


TGAATCATAA 


CCTACAGGGA 


AACCTTGGTG 


TTTATAACCT 


AATTTATCCA 


TTGTTCTaAC 


3 4 80 


AAATGCTCGG 


TTATCATAAG 


ATTTAACAAT 


TTCACCGTCT 


GCATTGCGTA 


AATTTTCAAT 


3540 


TAAATATGGA 


TCTACAAGGA 


CATATAAACA 


ATTGTGTTTC 


TTTAAATATG 


ACGTTAATGC 


3600 


TTTAAAGAAA 


AATGCTACTA 


ATGATTGATT 


TGTATAATCC 


ATCACTGGCC 


CGCGATGTGT 


3660 


ATAAAAATAT 


TTGAAAAATT 


TAAGTGTGCG 


TGCTTCTGTC 


AATAAGCATC 


CTGCAATCAC 


3720 


TTGACCATTG 


TCATCTTTAA 


CCCCTACrAG 


ATGCACATCG 


CCTTTTAAAT 


CAACTCTATG 


3780 


ATTGTAATGA 


ATAGCTGATT 


GTGTGTAATG 


TGAAAAATGC 


T 




3821 



(2) INFORMATION FOR SEQ ID NO: 305: 

7 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



4S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 
GCGACACCAT TGTTTGCAGC AACTGTTAAT TTACCAGTTG ATGGTTTCGA CGGTGTTGTT 60 
5^ GGTTTTGATG GTGTAGTAGG GGTAGTTGTA GATTGCGTAC CCCATGGCGC CACTTTACCC 120 

ATTTTTATTA AATATTTTTC ATTAATTAAG TCATATAATT GATCATAACT ATAATTATGA 180 
CTTCTTAAAT ATCCATGTGG ATcGGCATGG TCAGTACCAC CTAAATATTT ACTTACAGCG 24 0 
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TATTGTAATT GTGTAGCTGC ATAGTCAGCA TAGTTATTCA TTGAACGTGC AAATGAAGCA 
TAGTCGTGTG TGTGTACGAT TTCAACATTG ATGAATCTAG GGTTACCGAC TGCACCGACA 
CCCCAAGATA AGTAATCCGT TGGTGCTGTT TCGATTATAC GATCCCCATC AACAAATGCA 
TGTACGAATG CGTTTTGATA GTTATTTTTC ATATAACTAA TTTCACCATT TATCGTCGAA 
CGATCATTAG CTGTATCATG AACTACGATA CCTTCAGGAC GACCTACGCC GTTACGGTAT 
GCGTATTTAG GGAAGTAAGA TGTATAATCT TCTTCAATTT TAGGTGCTTT TAAGTTATTT 
TTACAAATGT AATCGTTAAT TGAAGAGTTT ACTTGTGGTT TATATTTTGG CAAACTCGTT 
TTTGGTGTTG CAGCAACTGA TCTTGGTTGT GCTGAAGCGC TAAAAGTAGT TACTTTAGGT 
GTCGCTTCAG TTTTAGCTTT AGGTGCTGAT GTAGTTGCAG CTTTAGGTGC TGCGGTTTTA 
TATTGCGTyT CAAGAGCTGC AGGTTTAGCA GCTGATTTAA TTAATTCTGG ATTAATTTGA 
TTTTCTGAAT TATCATCTTC ATCATCAACT AAACTATAAC CAGCATTTGT AACATTAGTG 
TTAGTTTTAG GTGCTGTAGT GCTTGTTGAC TTTGCAACAG GCTGCGTATT ATTTGTAGTC 
GCTGATTGAT TAGCACGAGT GTCACCATTT ACTTGTGCAG TATCAACTTT TTGACTTACT 
TGAGCATTGC CTGTTTTGTT ATTTGCTGTT TTTGGTTGGA CAATAGCAGG GTCTTGATAT 
ACTTGAGTGC CAGAAATGTT TTGCGTTGGA TTTTTTACCT CAGCTTTTGC TTGTTCAGTA 
GTTGCTTTAA CTTTATTACT ATCTAAAACG TTTTTATTAG TAGTTTGATC TTGTGTCGTC 
TCAGCTGCTT GAACTTGATG TGCAGTGACT GCTGAACCTA CAAGCGTTAA TGCAACCATT 
GAgGTAGTTT GTAATTGAAT TTTTTCGCCA TTCTATTTAT TACTCCyAAC ATTTATTAAT 
TATtACTAAC ATTATAGTAC CTGTnTTATA TACCTGTGCG TA 
(2) INFORMATION FOR SEQ ID NO: 306: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6076 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
:r (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 
ACGCGTTGGT TATTTCAATC ATAATATTAC TCTGCAAATA CACAGCCCTG TAACCGAGTA 
AATGGATAGA GATTTGAACA AATGAAAACA ATCAACTAAT GGAAAGGATA AAATATTATG 
CACAACAATA ATGAATTCAA CAAAAAACTG AAAGATTTTA TAGGTAGCGA TAAACGGATG 
GCTTTAGTAA AGGGTTATGT TAACGAGTAT AAATTAACAA CTGTTTTAAG AGCTTTAAAT 
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GTAGATAATC GAATTATTCC TAAAAATATA ACTCAAAACA CAATCTTCAA ATTGAGTAAT 360 

TTAACCTTAG AGGTTAACCT CTATGAAAGA AACACAATTT ATCACGGAGA CTTTGCTATT 420 

TACTATCCTG TAGAATCTGC TCTAATAATA GAAACAGATA CTCCTAGGCT AATTAATCAT 480 

ATAAATAACA ACCCAGTCCA AAAAATATTT ATAATAACAA CAAATGACTG GAGCTTTAAT 540 

ACACTTGaAT TAGAAAAAAT AGTAGACGAA ACTATAATTT ATGATTTAAA ACAAGAAGAT 600 

CCTAAACAAT ATGAAATTTT ATATAAAAAT AAACACGGAA AATTGCCTTA TTGATGAATT 660 

AAAACGCAAA CTTAGAACAC CTTACCCCTT TTCCX5TCCTT ATTGTAATTT TCCAACTACT 720 

CAAATTAAAA ACCACCCX3TG CAAACAGGTG GTTTTATTAT ATATAATTAT TCCCACTCAA 780 

TTGTGCTTGG TGGTTTTGAT GTAATGTCAT AGACTACGCG GTTGACGTGA TCTACTTCGT 84 0 

TTACGATACG ACTAGAAATC TTTTGTAAGA CTTCCCAATC GATGCGTGCG AAGTCACTTG 900 

20 TCATACCGTC GATAGATGTT ACTGCACGAA TACCTACTGT GTGATCATAC GTACGGTAGT 960 

CTCCCATAAC ACCTACTGAT TGAATGTTTG GTAACACTGT GAAGTATTGC CAAATTTCTC 1020 

TTTCAAGACC TTCTTCTCTA ATCACTTGGC GTAAAATCGC GTCTGATTCT CTAACGATTT 1080 

CTAGTTTATC TTCAGTAATT TCTCCAAGTA CACGAATACC AAGACCAGGT CCTGGGAATG 1140 

GTTGTCTCCA TACTAAATGT TCTGGAATAC CTAACTCAAT ACCTAATTTA CGTACTTCAT 1200 

CTTTAAACAA TGTATTGATT GGTTCGATTA ATTCGAATTC CATGTCTTCT GGTAATCCAC 1260 

CAACATTGTG GTGTGATTTG ATTGTTTGTG CTGTTTTAGT ACCTGATTCG ATGACGTCTG 1320 

TATATAGTGT TCCTTGCGCA AGGAAGTCTA CACCTTTCAG TTTTGATGCT TCATCATCAA 1380 

ATACGTATAC AAATTCATTA CCAATGATTT TACGTTTTTG TTCAGGATCT GAAACACCTT 1440 

TTAATTTATT CATAAAGCGA TCTTTCGCAT TAACACGAAT AATATTCATG TTGAAACCTT 1500 

CACCySAATTG CTCCATAACC ATGTCGCCTT CACCTTTACG AAGtAAGCCA TGGTCTACAA 1560 

40 AGATACATGT TAGTTGATCA CCTATTGCTT TATGCAATAG TACAGCTACA ACAGATGAAT 1620 

CTACGCCGCC ACTCATCX»CA CATAATACAC GACGGTCTCC TACGCGTTGA CGAATCTTTT 1680 

CAATTTCGAT TTCGATAAAG TTTTCCATTG TCCATTGACC TCTACAATCA CAAACACGAC 1740 

GGACAAAATT ATTTAATAAA TCATTACCAT ATTCTGTATG ACGTACTTCT GGATGGAATT 1800 

GAACACCATA AATGCGACGT TTCrTATCTT CGATTGCTGC ATAGTCTGTG CTTGGGCTAT 1860 

CAGCGATAAC TTCAAAGCCT TCTGGAATTT CAATAACTTT ATCAGAATGA CTCATCCAAA 1920 

CAGTTTGTTC TGCTGGTAAG CCAGCGAATA ACTCATCTGA CTTCGCATTA ATGATTGCTT 1980 

TACCGTATTC ACGTTCATTG GCACGTTCAA CTTTACCACC TAATAATTTA GTAGTTAATT 204 0 
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TAAATGAACC TTCTTCATAA ACTGAATTTG 
TCATTTTCTT AATTTCTTCA ATTGAAATTT 
TTTCACGAAT TCGGCGTGTA ATTAATTGGT 
TTAACTCTTG TTCTTTTGCC ATTTCCATAT 
AGTAGTTCGG TGATTCTTTC GTAATTTGAA 
CAGGACCCAT ACGTGTAAAT TGTGCTTCTT 
TATAACCCAT ACCAGCACGC ACACCGCCCA 
CTTTATAAGC CGTACGTCCT TCGATACCTT 
CTTGGAAGTA ACGGTCGTTT GAACCTTTTT 
ATACTTTATA TTGTCTACCT TGGAAAATTT 
CTAATAAGCT ACCTAACATA ACCGCATGTC 
CTGAGAATTT AATACCACCA TCAGCAATGA 
CACAATCATA AATTGCTGTA ATTTGTGGTA 
AAATTGAACC TGGGCCAATA CCAACTTTAA 
TTGTTGCTTC TGCAGTTGCT ACGTTACCTG 
TAATATGTTT CACTTGATCG ATAACACCTT 
AGACATCCAC ACCTGCTTCG ACTAATTTTT 
CAATTGCTGC GGCTACAAGT AGACGACCAT 
TAACTTTTTC AATATCTTTA ATAGTAATAA 
GTAACTTTTC AATCTTATGT TTTTGGAGAA 
CTGH5AGCTGT AATTAAATTt TCTTGCGTCA 
CAATAAAACG TAAGTCACGG TTTGTTAAAA 
CAACAATTGG TACACCTOAA ATACGGTATT 
TTTCTTCTGG CGTTAAGAAA AATGGGTTTG 
TTTGAACTTC GTCCGCTTGT TCTTCAACGC 
CTTGACGAGC CATAGCAATC GCCATTTTAG 
TAACTGGAAT ATTTAATTTA ACTTTGTCTG 
GTAAAATATC AGATTGTGCT GGAATTAATA 
CAAATTTACT TTCCCACATT AAAAACAGCC 



GACCACCTGA 
CATGATCGTG 
TGTATTGGCT 
TTGTCGTTCT 
TATTATGTGG 
CGCGTAATTC 
TTAATTGGTA 
CAGGAACAAA 
CCATCGCACC 
CTGTTGCGCC 
CACCAGCAGC 
TAGCTTTACC 
CACCAACACC 
CAATATCTGC 
CTACTAATGT 
TAGAGTGACC 
GAGCACGAAT 
GTTCATCTTT 
GACCTTCTAG 
TTTTTTCTGC 
TTACATCTAC 
TACCTACTAA 
TACCCATTAA 
AAATGACACC 
CCATATTTTT 
ATTCAGTTAC 
ATAATTGTAC 
ACACATCATC 
TCCATTTTTC 



TAAGATAATA 
TAATTCACTA 
ACCAAAGTCT 
CCTTTATCTT 
ATGGCTTTCT 
TCTTAAATCG 
AATTGTATCT 
TTTCTTAGGC 
TAAAGAGCCC 
TGGGCTTTCT 
TAATGCTTTA 
ATGTTTGCGT 
TGCTACAACA 
ACCCGCTTCA 
GATTTCTGGG 
ATGTGCTGTA 
ATCAGTATCT 
TGCTGCATTA 
ACGTCCGTCT 
TTCTTCAAGT 
AATTTTAATC 
GTTGCGATCT 
TGCTTCTGCT 
ATTTTCTGAG 
ATGAATAACA 
AGTATCCATA 
GCTTAAATCA 
AAACGTTAAT 
AAATTAATTA 



CCTTTTGGAT 2160 
TAAACGCCCA 2220 
AAGACAAGGA 2280 

AATTAGAATG 2340 

GCTAAACCAG 2400 

TGTGAACCAG 24 60 

TGTAACGCAC 2520 

GCTTTGTCCT 2580 

ATACCACXX3T 2640 

TCAGTACCTG 2700 

ATGATATCTC 2760 

GCTTCAGTtG 2 820 

CGCGTCGTAC 2880 

AATAAATCTT 2940 

TAAGTCTTCT 3000 

TCGATAACTA 3060 

TTTGAAATAC 3120 

GGGAATTCGA 3180 

TTAACTAATG 3240 

GTTGTATTCA 3300 

GAGAAGTCTT 3360 

TCTTTATTAT 3420 

TCATAAACGC 34 80 

CGTTTTACTT 3 54 0 

CCTAAACCAC 3600 

CCAGCAGAAA 3660 

ACGTCTTTCG 3720 

GATTCTTTTG 3780 

GTTATATTAT 3 840 
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GCAGAGATTG CGCCTAAAAC AATTCCGTTT TGAGTCAACC ATGCAAATTG TTCACCTAAA 3 960 

C C TTTAAATG CTTGTGGTAC AGCGCTTATA CCAGTACCTA AtCCTACTGA TACAGCGATA 4020 

ATTAATAAAT TGTTTTGATT TTTAAAATCG ATATGTCCTA ATATACTAAC ACCATATGCC 4 0 80 

ATTACCATGC CAAACATAGC TATCATCGCA CCGCCTAACA CAGGTAGCGG TATGATATTT 4140 

GCTAATGCGC CAAGCTTAGG TATACAACCA CATATAAGTA ATAACACGAC CATGCCGTAT 4 200 

ATAACATTGT TTTTCTTAGC GCCGGATAAA GAAACAAGTC CTACATTTTG CGAATAGGCT 4 260 

GTATACGGAA ATGAATTGAA TATAGAACCT AAcACTATCG CTAGACCTTC CGCAGTATAA 4 320 

,5 CCTTTACGAA AATCTTTTCT TTCTAACTTC TTACCGGTAA TTTCACTTAA CGCATGATAG 4 380 

ACACCTGTCG ACTCAATTAA ACTAACGATA GCTACAATAA AGAACACTAA CGTCGATGTC 4440 

ACATCAAAGC TAAATCCAGA GAATCTAAAC GGCACTGGGA TGCCTAACCA ACCGGCATGA 4 5 00 

20 TTGACTTGAT TAATATCGAC CATCCCAAGT AAGCCAGCAC CTATCGTTCC TAAAACGAGT 4 560 

CCAATTAATA TGGCAATACT CTTAATAAAT CCAGTTGTGA ATCTTTGTAA AAGAAGAATA 4 6 20 

ATGATTAATG TCATTAAACC TAACAAAATG TTCTTAACAT CTCCATAGTC CTTTGCACCT 4680 

TGACCTCCAG CTAAGTAATT CATTGCTACT GGCATTAAAT TGATACCAAT GATAGTAACA 474 0 

ACACTACCCG TTACTACTGG TGGGAAGAAT TTTACAAGAT GTGAAAAGAA AGGCGCGATG 4 800 

ATAATAACTA ATATCCCTGA TAAAAATAGC GAACCATAAA GTACATCTAT TCCTTTCGTT 4 860 

TGACCAATTA AAATCATGGG CGCAACAGCC GTGAATGTAC ATCCAAGAAC GATTGGTAAT 4 920 

CCTGTTCCTG TTACTTTATT GGCTTGTAAA AATGTGGCAA CCCCACACAT AAATATATCT 4 980 

ACTGTAACTA AGTAAGCGAT TTGTTCAGGT GTAAACTTCA AACTTGTACC AACAATGATT 504 0 

GGAACTAAGA TAGCACCTGC GTACATAGCT AAAAGATGTT GAACACTTAG GATTAAATTT 5100 

TTCATTATTC TTCTCCCACC AATGTCACTT TGTTTCCTTC TAGTGAAGCA ACCTTGCAGA 5160 

-^o GAGAAGAAAC TGTTAAACCT GCTTCTTCTA AACGTTGATG CCCATTTTGO AAACTCTTTT 5220 

CAACAACAAT ACCAATACCA GCTGTCTTAG CATTCGCTTG CTGTGCGATA TCGTATAATC 5280 

CTAATGAAGC ATCACCATTT GCTAAAAAGT CATCGATGAT AAGTACAGTA TCTTCTTCTG 5340 

ATAAAAACTC TTTTGAAACA ATGACCGTAC TTGTTTTATT TTTAGTAAAT GAATGAATAG 5400 

ATGTTTCATA ATAACCATCC GTCAAAGTGC TAGGTTTTGC TmTTCGCA AATAAACATG 5460 

GCACATCAAA ATGCAGTGCA GCCATGATTG CAGGTGCGAT ACCGGAAGCT TCAATGGTTA 5520 

AG A TTTT A GT AATCCCTTTA TCTTTAAATT GCTCGTAAAA AGTGCX3ACCA ACTTCATTCA 5580 

TTAACTTTGC ATCAATTTGA TGATTTAAAA ATCCATCGAC TTTTAAAATC TTCTCATCAA 5640 
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ATTTGTGTGA AACATTTTGC TCTTAAATTG GTGCTAGATA CAAAAAAATC CCCAAACTAA 

ATAATAGTTT CAGGGTTTAT GAGTGAACGa ACATGCATAA CGAATTTGTC ATGCAATCAA 

5 

TGTAAAAGAA GTTTCATCAA ATAAACTGTG ACCATCATAT AAAATGATAT AAATCACCCA 

CCATGGTTAC AATTTAATGG CTGAAGCTAC TCCTAGTATT GTGTTGTTAC TCATAGTCAT 
GTCGTTCAAG GCAACATGGT AGAAACTTCT AAAGCCATAT TCTTTAGATT ATATGAGTTT 6000 
ATGTAAATTA TTTAACGATA ATAGCAAATT TTCGGCATTT TTTCAATAAC TGCTTAGGTA 6060 
ATCTTTTAAT AGTTTT 
rs (2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



6076 



60 
120 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 
ACCGCCGATT GATAGCTTTA CTGCTGCGAT AAAACTGCCT TCAACAATTG GTGCATCAAC 
TTTTAACACA CGATGATTAC CATCATACAT TTCAATTGCC ATATCTACAT TCATTTCTGA 

AGATCCAATA TCGTAAAAAC ACAATGCATC ATCCTCTAAT TTAGTCAAAA CTTCTTGGAT 180 

GATATCAAAT GAAGTTCCAA TTGAACCATC TGGTAATCCC CCGATTGGTA TAATATCAAC 24 0 

GTCACCTGCC ATTTGCTTTA ACAAAGATTT TGTACCACTT GcAATTTCTT TACTGTGGCT 300 

3^ AACAAGTATA ATTTTAGGCA TTATCTTCAT CTCCAATCAA AGCGTTTAAA ATATAAACCA 360 

TACTTTGAGC ACCTGGATCA ATATAACCTT TTGATTCTTC TCCAAAATAT GCAGCTCTAC 420 

CTTTCGTTGC TACCATATCT TTTGTATTAT CTGCTAATTG CTGTAAATCA rTGAATGTTA 480 

AAGTTTCACC ATTTTTAAGC TTCTCTGCTG CTCGCGCTAC TACATCATAC ATTGTCTTTT 540 

CATTTAAAfeT AACTTTACCA CGTGATTCAA CCGCTTCGGC AAATGCCTGA ATTAGTGTAA 600 

TGAAATCTTG ATTATCCATA TCATCTTTGG TGACTGCAGA CATTTTAACA AAGCTAAAGC 660 

CATACAGTGG TCCTGATGCA CCCCCAACAT TTGACATCAA TGCCATACCA GTTGATTTGA 720 

ACAATGATTG CATTGAGCTA TCATCAAGTT TGTCTTTAAG ACTACTAAAC CCACGAACCA 780 

TGTTAACCCC GTGGTCACCA TCACCAATTG CTCGATCTAA TTCAGTTAAT TCAGATTCAT 840 

GTTTTTTAAA CGTTTCTTCT AAATTTAATA AACGTGCTTT CATATCATTC ACTTTCATTT 900 

GTGCCACCTC ATAAATGTAT ATTTATTCAT ATTCACTTCT TATTTAAAGT ATTGACTTGT 960 
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ACCTTGCATG TCTAAAGATG TCATATAATC ACCAACAAAC CATTTAGCAA CATTAACCGT 1080 

TCTTGCAGCT AAATTTTGTT GAATATATTT AGTTACGATA TTTAATTCAG ATAACGGCGT 1140 

ACCACCCATA CCATTTACCA TTAATATGAC ATCATTGGCA GTAACTTCTT TATACAATTC 1200 

GTCTAACAAT GTTCCAACAA TATGATCAAT ATCCTTTACT TCTTCCCTAT GAATACCTTT 1260 

TTCACCATGT ATACCAATAC CGATTTCCAT TTTGTCGTCT TCAATATCAA AGCCATATTT 1320 

TCCAGTAGTT GGAACAAGCG GTGGCTCAAT TGCCATACCA ATACTTTTAA TTTCAGGTAA 13 80 

CAACGCTTCT ACACGCGATT TTATCTCTGT TAATGAATAA CCTTTTTCAG CAAGATAACC 1440 

GGCAAGCTTA TGAACAAACA CTGTTCCTGC AACACCACGA CGTTGTACTT CGTTTGTCAC 1500 

AGCAATGTCG TCACX5AACAA TAACAGTTTG AACATTTATA CCTTCCATTT CTGCAAGCTC 1560 

TTGTGCCATT TCGAAATTCA TCACGTCACC TGCATAGTTT TTTACAACTA GTAATACACC 1620 

20 ATCACCAGTA TCTACTGCTT TAATAGCTTC TAATATTTTA TCAGGTGTAG GTGATGTAAA 16 80 

TACTTCGCCA CAAACCGCTG CATCTAGCAT ACCTTCTGCA ACAAAACCGG CATGCGCAGG 1740 

TTCATGTCCG CTTCCACCTC CAGAGACTAT TGCTACACCA TGTTCTTTCT TAGCTTTTTT 1800 

TACAATAACT GTATTAGCAA TCAGATCTAA CTCTGGGTGC GCAATTAACA ATCCTTCAAG 1860 

CATATCAGTT AAAAATGTTT CTTTTTTATT GATTAACTTT TTCATCATGT TGTACCTCCT 1920 

TGGTATTATC AGTCATAGTA TAAAACAGAA TAAATGAATG CGCTATCATA AAGAATTAAC 1980 

CTGATACCGT TATCAAATAT ATTTTTGATA AGATCCTCTT GATAACTTAA TGGTTCATTA 204 0 

TTGAAAAAAT AAATAGTCTT GAGTGGCTCA TTAAATAAAA AATGCTATGA AAAGCCTTAT 2100 

AATAAAGTGC CTTTCATAGC aATAAGTTGT GTCCATTGAC ACTATACATT TTTCGTTTTG 2160 

TACATTAAAT ATAAGAAATA CGGTGCACCA ATAATTGCTA CGACAATACC TGCTGGAACC 2220 

CCAC.CTGGTT GTAATACAAT TTTGCCAATT GTATCAGCTA TAACAAGTAA ACATGCCCCT 2280 

40 ACTAAAATGG CAATTGGTAA AAACAACTGG TGACGTGGTC CAACGATACG TTTGGCAATA 2340 

TGCGGACCCA TTAATCCGAT AAACGAAATT GAACCTGCTA CTGCTACAGC AGCAGATGAT 2400 

AACATCACTG CGATAAAGAA TAATATTAAA CGTTCTCTGC TTAACCTTAC ACCTAGACCT 2460 

CGTGCAATAT TATCACCCGT ATGAATAATA TTTAGTGTAT TCGATTTAAA TAGTAAGTAA 2520 

GGAATAATAA TCAACACCCA CGGTAAAAAT GCAATGACAA ATGGCCATTC GTCACCCCAA 2580 

ATATTACCTG CAAACCAAGC AGCGATGAAA TCAGATTGCT TATCATCAAA TTTTGACATA 264 0 

ATTGTAATTG AGCCACCATA TAATGCTGTT TGTAAACCTA CACCTATTAA TACCATACTC 2700 

GCAGGTGTAA CACCTTCATT TTTATTAAAA CTGaAAATAA AAATAATCaA TGCAGTGgTG 2760 
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CCAATTGCAA TAAATAATGC GATTGCAAAT CCGCCACtGC GTTAATACCT AATATACCTG 2880 

GTTCAGCTAT TGGATTTTTT GTGACACTTT GCACTATTGC ACCACTAATA CTAAgCtGCG 294 0 
CCAGCCAAAA TAGTAATCAT CATCCGAGGT AACCTGAAAT CTAATAAGAT TAACTCATCA 
ATGGCATCAC CTTGTCCAPlT TAAAGTTTTG AAAAATCTTT CAACAGGTAT GTTGTATTCA 
CCTGAGGTAA TACTCCAAGT ACAACCTAGA AATAGTAGAA TGCTAAAAAC AGCCAGTGCT 
ATCAATTGTC rGCGTTTATT ATTTGAACTA ATCATATTGA GCGTCCTCCT TTTTTAACTA 

AATATAAAAA GTAAGGAACA CCGATAAATG AAATGATTGC ACCAACAGGC GCTTCTCCTA 324 0 

,5 AATATCGTGC TATCACATCG GCAACAAGCA CGAGTATCCC ACCTAACAAG GCTGTTAATG 33 00 

GTAGAATTTT AGCATAATCA GTTCCAATTA AAAATCTTGC TATATGAGGT ACCATCAAAC 3360 

CTACAAATGC AACTTGTCCA GCGATAGCAA CTGCAATACC TGCTAGAATC ATAGCAATAA 34 20 

TTAAACATAT GCCTCTGATC ATTGTTACAT TTTGACCTAA ACCTTTAGCT AATGATTCAC 3480 

CAAGATTTAA AATGGTAAGT TGTTTACTAA TTGTTAATAT .AATGAATAAC GCAATACCAA 3540 

TTAATGGAAT TGCCCACTTA AGGTGTGACC ATGTTGTGCC TGAAACGCCT CCAGCAGTCC 3600 

AAAATGTTAC TGTTTGATTT AGTCTAAAAG CTAATGCAAT ACCTTGACTT AGCGCTGTTA 3660 

ACATAGCACT TACTGCTGCA CCCGCTAAAA TAATACGCAT CGGATTAAAT CCATCACGTC 3720 

TAGATCGGCC TATCATTAAT ACAATAGCAC CTCCTAGAAT AGCACCTAAA AATCCAGCAA 3780 

ACATCAATAT TAAAAATGAA GTGTTTGGTA AAACTGCATA TGTTAATGCT AAAGCAAATG 3 840 

AAGCACCTGA ATTTAAACCT ATGAGCGCCG GATCAGCAAG ACCATTACGA GTAACACCTT 3 900 

GTATAATCGC ACCAGAAACT GCAAGCGCCA TACCTACAAT TACTGCTGCT ATATTrCTGG 3 960 

GAATCCTAAT CTCATTGATG ATGTTTTGCT GTTGATTGCT AGGATTATAA TTAAAAATAG 4020 

CCTCTATAAT TGTAGAGGCT TGAATTTTGG CGTCACCTAT TAATGTAGAA ATAAATAGTG 4080 

40 TGATTAGTAG TATCATACTT AAACCTATAA TATAGGATAA AAACTTCAAT GGCGTTGGGT 414 0 

TCTCTCTATT TGTCATGTTA ATTGTCCTTT TTATCATATT AACTTACTTA ATTAAGAATA 4200 

AGCTCTGCGA CATAAGTCAT AAGTTACCAG TAAAGGTTTT CCAGTTTTAG GATCTTTACT 4260 

TAAAACAACA TCAATATTAA AAACTTTTTC TAATATTTCC TGTGTTAATA CGTCTTCTGT 4320 

TGAACCTGTA GCGATGATAT CCCCTTCTTT CATCGCAATA AGATGATCTG AGAAACGAAT 4380 

CX5CTTGGTTG ATATCATGAA GAACCATGAC AATTGTACAA CCTTGTTCCT GATTTAGCTT 444 0 

CTGAACTAAT TCTAGTATTT CTAATTGATG ACAGATATCT AAATATGTTG TTGGTTCGTC 4500 

TAAAAAGATA ATATCAGTTC TTTGTGCTAA TGCCATTGCA ATCCAAACAC GTTGTCTTTG 4 560 

55 



1148 

BNSOOCID: <EP 07SfiS10A9 I ^ 



w 



IS 



EP0 786 519 A2 

TGCCCAATCA ATtTCTTTCT TAtCCTCAGC AgTTAATCTA CCAAATCCTT TTTGATGTGG 4680 

AAAACGACCA TATGAAACTA ATTCCCCAAC AGTTAAGCCA TCTGCTACTT CaGGTGaTTG 4 740 

aGGTAAAaTG GCtATTTTTt TGcAATCyCy TTCGTAGATt GTGtATGAAT ATTTTCACcA 4 800 

TCTAAAAATA CTTCGCCTTC TTTAACTGCC AATAAACGTG ACAATGCCTT TAGCAAAGTA 4860 

GATTTCCCGC AGCCGTTAGG ACCAATGATT GACGTCACTT TGCCATCTGG TATTTCAACA 4 920 

TCTAATTTAT TTATAATCGT GTTATCCCCG TAACCAATTT TAACTTGTTG TCCATGCAAA 4 980 

CGATTCATAA TTTCCCTACT TTCAATAAAA TTCTTTCTGT TTATAAAAAA TAATTTCTAT 504 0 

TTTTAAATTA TCAATTTTCA AAGACATCCC AATTGATAAT GATTATCATG AACATCATTA 5100 

TAACATTTTT CAATCTTATT GACTAACATT ACTTTTTAAA TTGGATAGCT CGATTTGTCA 5160 

TGTCTTGTAT ATTACTTTTA TAAAATAAAA AACGCCCACA GATAAGTCTT CATAGTTCAA 5220 

20 AAACTTGTCC GTGGACTTCT ATTTAAGTAT GTGTGCTCAT ACCATTTATT TATTCATCTG 52 80 

CAAGAAAGCC ATTACCATAG ACATCTCTTA CATCATGAAT TACGAGGAAT GCATCTTTAT 534 0 

CGATTTGTTT AATTAATCGC TTTGCTTTTG AAACTTGTGT TTTAGAAATA ACAACGTATA 54 00- 

AGACATCTTT TTCTTCACGC GTATAATAGC CATGTCCGTT TAAAATGGTT AAACCTCTTC 54 60 

CAATTTGCTC GTCTATTGCT TTGGCAAGTT TGTCXSGGATT AGTTGAAATA ATCGTCATAG 5520 

CTTTTTTAGT GTTTAAACCT TCTATGACAT ATTCCATCAC TTTTGTTCCT ATATAAAGTG 5580 

ATATTACTGT TACTAATACT TTATCAAGTG GAATAACTGT AAGTGAAATT GCAACAACGA 564 0 

TCATATCGAA GAAAAGCAAA GCATATGGCG TGCTTACATC GAGGTATTTC GTTGCAATTC 5700 

TCGCCAAAAT TGTrGTACCT GCTGTTGTAC CGCCTGCAAG GATAATTACT CCGATTCCTA 5760 

GTCCAACGCT TACACCACCA AAAATGGCAT TCACAATGCT GTTTCCAGTT TCTACTTGCC 5820 

ATGaStCTGT TAAACTCAAA AATATTGAAA TAAGAATTGT TACAAGAATA GTTAAGTACA 5880 

40 TACTTCTCTT ACTCAAAAAt TTATAAcCTA TGGCAATCAA TACTGCGTTG ACCAAGAAGT 5940 

TAGTGATGGC TGGTGAAATA TGAAACGCAT AATATAAAAT AATTGCTAAA. CCTGTAACCC 6000 

CX3CCTTCACC TAAGTTACCA GAAAT^ATAA ATGCATTTAC ACCTGCAGCA AAGATAAATG 6060 

AACCTAAGAC AACTAGTATT AAATCTTTAA CCGTTTTATT CACGAAACCA TCCCCTTTAT 6120 

ATATTTATTA GACTAT 6136 
(2) INFORMATION FOR SEQ ID NO: 308: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 
GATATCGTAS CTACTGAAAA GTCATCACCA CCATGGAATG ATITCTTTAA ACGTTTTAGC 
TTCTATGCAA TTGCAATTCA ATACTTTGTT GTACAATTTA TCATTACATT ATTCTTAATT 
TGGTTACCGA CGTATTTAAC AGAAGTATTC CACGTTAACT TTAAAGAAAT GAGCATTAGT 
TCAtTACCTT GGTTATTAAT GTTCTTCTTA ATCTTATCAG CAGGTGCAAT TTCTGACCGT 
GTATTAGGAT TAGGTCGTTC AAAATTCGTA GCTAGAGGTG TAATTGCAAT TGCAGGATTT 
ATTGTGTTTG CAGTTTCAAT TATCTTTGCT GTACGCACAG GAAATTTATA TGT AAGTATT 
TTCTGGTTAT CACTAGGTCT TGGTGGTATC GGTATTTCAA TGGGTATGAG TTGGGCTGCA 
GCAACTGACT TAGGACGTAA CTTCTCTGGT ACAGTATCAG GGTGGATGAA CTTATGGGGT 
AATATAGGTG CATTAATCAG TCCGCTATTA GCAGGTCTAT TCGTAGAACA TTTGGGTTGG 
ACAATGACAT TCCAATTGTT AATCGTTCCA GCAGTAATCG CTGTGATTAT GTGGTTCTAT 
GTGAAACCAG ATCAACCTTT AATTGTTAGT GATGATAAAG CAATAGAAAA ATAATTTAAA 
CAAGCAGTAA GCTTTCACAT AGTTGGGGCT TATTGCTTTT TTTGCGTTGA AATTGAAACT 
TTTTAAAACA GATATGGTTT AAGATGAAAA TGAAGTTATT GAAATGATAT ATGTAAAGAA 
ATAAGGTTTT AAAACATTAG TCAGGTAACG CTTGTaAAAG TACATATAAA TTTTAACTAG 
CGCAAAGGTG GGCGACCAAA GtTcAACGAT GTTAAATAAC aTTAGrAATT AATTTTAATT 
GGACTTTAAA AGTTTTTAAA TTTAGATAAT TGAGCATAAG GTGTTATAAT GACATATGTT 
GCGTAATTAA AATTTATAGC AACAAATTCA TTTTAACTAT GCTAATAAAA AGATTATGGA 
AATATTTTGA CAAGGAAAGG AGAAGTCGAA ATGACATCTT TTTGACATCA CTCATAAAAA 
TCAACCGACT TAACTTAGAC TTTTATAAAG GTGTAAGACA GGGACTGTTA ATGATTATTC 
CTGCAATAAT CGGTTACTTA TGTGGTAATT TCCAATTTGG ATTATTAGTT GCAACCGGAA 
CACTAGCCCA TATTTATGTT TTTAAAGGTC CGTCGCGATC TAAGCTGCGA ACTGTAATAA 
TTTGTAATTT AGCGTTTGCA ATATGTATGA TGCTTGGTAC GCTAACAGCC AAAACGCCAC 
TCGTTTTTGG AATGACATTA TTAATTGTTA CGGTTATACC ATTTTATATA TTTACTGCCT 
TAAAAATAGC TGGACCGTCA TCGACATTCT TCATTGTGAC ATTCAGTCTA CCCATTAACT 
TACCTATAGC TCCCGAAGAA GCATTATATA GAGGCTTTGC GATTTTAGTA GGCGGTATAC 
TTGCCACTAT GATGGTGTTA ATCACGATCG TATTTTCTAA AAACAAAGCT GAAGAACAAG 
CAATTCAAAA TGATTTTAAA CTCATATCTA AGTTGTTACA CACTTATAAT GATAAATCTG 
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TCACTTCTAC TTCAAGTAAC GATAAATTAA GTAGACGTTT 
ACACATCTGC CCAAGGGATT TATTCTGAAC TGTTAGAGTT 
^ CATTGCCAGA TGAGTTAATT GAAATGATGG ATCATATCAT 

AGGAAAATGT AAGATATTGG CGAAAAGAAG TGACAGTAAC 
TCAACCATAT ATTGAAAATT GATGAAATGG TGCATGCAAA 

10 

AAGCAGACAT GCGAAAACCT TTATATAGTA AACGCATTTA 
CtATkGTTTT TAGAAATACA TTGAGATATA CAGCGATTAT 
CGTTAATGTT TGATTTTGAA AAAGCATACT GGATACCGTT 

75 

TAGGAACATC AACTATACAT GCAATCGAGA GAGGTATGGC 
TAGGTGTGTT AGTACTTTCA GTCATATTGT TGTTTTCAAT 

20 TTTTAATGGG CATTGCAGCA TTGTTTACTG AAGCATTGGT 

CAGTAGTTTT TATTACAATA CAAGTTATTT TAATGAACGG 
CAATTAACAT TGCGTTTCCA AGAGTTATTG ACGTTGCAAT 

25 TAGGTTTATT TGTCCTTGGA CAACGTACCG CATCCGCATT 

AAGTTGTTCG TAAAGAAGCA ACGCTCTTTC ATTATTTATT 
(2) INFORMATION FOR SEQ ID NO: 309: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 
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CAAGCACATT 


ACGATGGTCT 


AATACTTTGC 


TAATAATTTC 


TTCTTGTCCT 


GGTCGAAACG 


60 


TTTCATACCC 


AAAGTAATGC 


GATAATGTTT 


GTTGCATCAT 


AAATTGACCC 


CTTATTGTTG 


120 


TTCTTTAATT 


TCTTCTAACT 


CACTCCATCT 


TGTGATGTCT 


AAATCATATT 


GAATTTCAAG 


130 


TTGTTCTTTT 


TCTTCGTTTA 


ATTCTTTAAT 


TTTCCCATAA 


TCTGCACTTG 


CCTCAATCAT 


240 


GAGCACATCA 


ATTTCTTCCA 


TTCTTACTTC 


CGCTTGTTCT 


ATGCGTTTCA 


TCAATTGTTC 


300 


ATATTCTAAT 


TTTTCTTTAT 


ATGATAAACC 


ATTTTTCTTA 


CGTACAGTTG 


TAGAAGATTT 


360 


AGATTGTTGC 


TTCAATGTGG 


ATTTATTTTT 


ATCTAATGAT 


TTTTTATAAC 


TTTCATAATC 


420 


TTCAAAAGTT 


CCGATAATCT 


TTTCCATCTG 


ACCATCATGA 


ATAAACCAAT 


ATGACTGTGC 


480 
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CCAAAAATTA TTATTATTAC 1740 

GAACGCTAAA CAAATTCGAC 1800 

TGCACAACTA GATAATAGTG 1860 

AGAGGAATTT CAAAATTTAT 1920 

TGAAGCGCGT ATTGCGTATG 1980 

TCaAAATTTA ACaTTAGACT 204 0 

GATGATAGCG ATATTTATTG 2100 

ATCTGCACAT ACAATATTAC 2160 

ACGAG G TTTA GGTACTATTT 2220 

ACCAACACCT GTTGCAGTAA 2280 

GGGAGCAAAT TATGCGATTG 2 34 0 

ATTAGCATCA CAGAATTTAA 24 00 

GGGTATTGTG ATTGCAATCA 24 60 

GCTTCCTAAT GTAATGGCTG 2520 

TTCTGAAAAT CAATAT 2576 
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AATATAATCT TCAAGTATTG TTAAAGTCTC AGTATCTAAA TCATTTGTCG GTTCATCTAA 600 
CAACAGAACA TTTGGCTGGT GTACGAGTAG ACGTAATAAA TACAAACGCT TTTGCTCTCC 660 
ACCAGATA 

(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 
CATCAGTTTA TTTTGAAAGG CAATGCGATC ATTTCATGTA TTTATGTTGT TTGAAACATC 60 

GTTAGATAAC AATAGTGATA TTGCACATTT TAAGCTGAAG ATGGAAAAAT ATTCGATAAT 120 

CAAATAAACA ATGAATTTTA GAAGGTACAA TGACGTTCAC TAATTTAAAT ATAGCTGAAT 18 0 

GTGTTGGTGA GTGATGTTCA CTATAGATTT ATATTAATAT ACAAAGACAA AGGTTGTTAA 240 

TTTTTATTAA GCGTTAGGTT GAATGTATGA GAATTTTAGA TTTATAATAG AAGATAGAAA 3 00 

CGAAAATTTT TCTTAAAAGC AGTAATGTTG ACTCAAAATA AGCTATAATA ATGACACTTA 3 60 

TTTAATTGAT TAACATTTGC TAATAAATAT CAATATAGAA TATAACTTTC CAATAATGAC 4 20 

TGAGAAAATC GAAATGTCAG TCTCGAATCA TATAATTAGA AAATTGATTA TTTTCTGTCA 4 80 

ATTTAGGGTT GAACTATACA TATGATATTG TTAGAATATT TTTTAACATT ATATTTTATT 540 

GCTTTAAAGT GGAATATACT TGAAATAATT AGTAGAGGTG AGTAAGGATG AGTAATAAAT 600 

TAGAATCATA CAGAAGTGAG ATTGTATCAC TGAATCATCA AATTTTAGAC TTATTATCTA 660 

AACQTGGTGA ACTAGCACAA AAAATTGGGG AAGAAAAATT AAAACAAGGT ACACGTATTT 720 

ATGATCCACA ACGTGAAAAA GAAATGCTTA ACGACTTAAT CGATAGTAAC AAAGGACCAT 780 

TCAACGAT7UV TACTATTAAQ CAATTATTTA AAGAAATTTT CAAAGCCTCT ACAGATTTAC 840 

AAAAATCTGA AAATGAAAAA CATTTATATG TATCACGTAA GTTGAAACCT GAAGATACGA 900 

TTGTAACATT TGATAATGGG GGCATTATAG GAGACGGCAA TAAATCATTT GTATTTGGGC 960 

CATGTTCAGT TGAATCATTT GAACAAGTTG AAGCTGTTGC TAAAAACTTA CATGCTAAAG 1020 

GTGAAAAATT TATTCGTGGC GGTGCATTTA AACCACGTAC ATCACCATAT GATTTCCAAG 1080 

GCCTAGGTGT TGAAGGACTT AAAATACTTA AACAGATTAA AGATAAATAT GATTTAAATG 1140 

TTGTCAGOGA AATCGTAAAT CCAAATGATT TTGAAGTGGC TGATGAGTAT TTAGAOGTAT 1200 
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AAAAGCCTAT TCTATTAAAA CGTGGTTTAT 
CTGAATACAT TGCTTCACAA GGTAATCAAA 
^ CTTATGAAAA GGCGACACGT AACACTTTAG 

GTACACACTT ACCAGTCATG GTAGATGTTA 
TACCAACTGC GAArsAgCAT TAGCAGTTGG 

w 

AGATCCATCT GTTGCACTTA GTGATGCGGG 
ATTTTATGAT GAATTAAAGC CTTTAGCTGA 
TTCCAAGGAA ACTATAGACT ACTTAACTAA 
GAATCGACAA TTTTAAACTT ACAGCCATTC 
ATTTTTTGTA TAGAAATAAA GGTATACTGC 

20 TGTTCAATTT CACAATTATT AAAGATTATG 

GAATTATCTG TAAAATATAA TAGATTCACA 
ACAAATGCAA TTGATATTTG AAAAGGCTTT 

25 GTTTATATAA AATTAAAATA TATCGTTCGT 

TTTTCGCTAA ATTTTTCCAA ACAAAAATAT 
GAAAATGAAA TGAAAACATG TTATTATAAT 

30 

ACAGTTACTA TATATGATGT AGCAAGAGAA 
GTTGTTAATG GGAACCAAAA TGTTAAAGCA 
AAGCGTTTGA ATTATCGTCC AAATGCTGTT 

35 

ACAGTAGGTG TGATCATTCC AGATATATCT 
CTT<»AGATA TTGCAACAAT GTATAAATAT 
40 CCTGAAAAGG AAAAAGAAAT TTTTAATAAC 

TTCCTTGGTG GTACAATTAC TOAAGAAATG 
GTAGTAGTAT CAGGAACAAA TGGTAAGGAT 
ACTGAAGCTG CGAAAGAAAT TACGGGAGAA 
TTAGTAGGTG GAGAACATTC TAAAAAAGCT 
GTGTTAAATA AAAATGGCCT TCAATTAGGT 

50 

TATAAAGAAG GCX5TAAAAGC TTTTGCCAAA 
TGTATCAGCG ACGAAGAAGC AATTGGTATT 

55 
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CTGCTACAAT CGAAGAGTTT GTTTATGCAG 1320 

ACATTATTTT ATGTGAACGT GGAATCCGAA 1380 

ATATTTCAGC AGTACCAATT TTAAAACAAG 1440 

CGCATAGTAC AGGTCGTAAA GATATCATGT 1500 

TGCTGATGGA GTTATGGCTG AGGTGCATCC 1560 

TCAACAAATG GATTTAGATG AATTCCAAGC 1620 

TTTATATAAC GCTAAAAAGT TAAAATAATA 1680 

TATGTCATGT TGAAGTAGAA TATTATCTTT 1740 

TAAGAGTATA TTACTTTTAG AGTGGCTATT 1800 

ACTTAACGAT TGTTATAATA CTTCGACACT 1860 

ACTGATAGCA GTAATTAAAA TTATAACTAT 1920 

CATTTGTTGC TGAAATGTGA ACATTTTTCA 1980 

CTCAAAACAT TACAATTAAA AATGAAAAAA 2040 

TATCATTTAG CGTTTGTTTT TATTTCAAGC 2100 

GTTACTGTAA ATTAAAATAT GGTAAACTAT 2160 

GAATAAAACG TTTACAAGGA GGAAATTATG 2220 

GCGCGTGTCT CTATGGCCAC AGTGTCX5CGT 2280 

GAAACTAAAA ATAAAGTTAA CGAAGTCATT 2340 

GCTAGAGGTT TAGCTAGTAA AAAGACAACA 2400 

AATATCTATT ATTCACAACT TGCTCGTGGA 2460 

CACTCAATTA TTTCAAATTC AGATAACQAT 2520 

TTATTAAGTA AACAGGTTGA TGGTATTATT 2580 

AAAGAATTGA TAAATCAATC ATCTGTACCT 2640 

GCACATATAG CATCAGTTAA TATTGATTTT 2700 

TTAATTGAAA AAGGCGCTAA ATCATTTGCT 2760 

CAAGAAGATG TTTTAGAAGG TTTAACTGAA 2820 

GATACATTGA ATTGTTCTGG TGCTGAAAGT 2880 

ATGAAAGGCA ATTTGCCAGA TGCCATTTTA 2940 

ATGCATAGTG CAATGGATGC TGGTATTAAA 3000 



NSOOCID: <EP 07865 19A2_I_> 



1153 



10 



25 



30 



35 



45 



SO 



EP0 786 519 A2 

CCACAACTTT CTAGTGTTAT TCAACCATTA TATGATATCG GTGCAGTAGG GATGCGCTTA ' 3120 

TTAACAAAAT ATATGAACGA TGAAAAGATA GAAGAACCAA ATGTAGTTTT ACCTCACAGA 3180 

ATTGAATACC GAGGAACTAC AAAATAAATT CACAAAATTA GGCATTCATC TAACGACCCA 324 0 

AATTATATGG GTGTTGGAAG AATGCCTTTT ATTTATCTTT TAAAATCGTT GCAGATTAGG 33 00 

TTACTTATTG ACGAGTAGAT TCGTACCAAC TCGCTATATG TAAAGCTAAT TTTTTATTTT 3 360 

TTTCACTAAT TTCTTTTGTG CGGGGGACAT AGGTATAATC ATTTAAACGA TCTTCCCATC 34 2 0 

TTTTAGGTAA TAATTCAGAT GAATAATGTT TCCATTTATT AATCCATTCT AACGGTAAAT 34 80 

^5 AACCACTTTG AATTGGTTGA TCAATTAAAC TTAAGAATAC ATGACTCCAT GCACGTGGTA 3540 

CGACTCTCCA AATATTGTAG CCTCCGCCAC CAAACATAAT TACCTTTCCA TTCGTATAAG 3600 

AATCAGCTAA ATATTTTACA AAATATGGAA TTTCATATAA TGAATGTAAC GTACAATTTA 3 66 0 

20 GATGAGTTAG TGGATCACGA TAATGTATAT CGACACCATT TACGCTTAGA ATAATATCAG 3 72 0 

GTTTAAAACT CTTTACGACA GGCTCAACTG TTAATTTAAA ACACTCCAAA AATGATGCAT 3 780 

CTTCTGTATA CGGTTCAAGT GGGACATTTA CAGTGTGTCC ATAGCCGATA TCTTCACCGC 3 84 0 

GCTCAGTATA GTGACCAGAG CCTGGGAAAA GAAATTTTCC GGTTTCATGG ATAGAATAAG 3 900 

TAGTAACATG GTTATCGGCA TAGAAACTCC ATTGTGTACC ATCTCCATGA TGTGCATCGG 3 960 

TATCTATGAT TAAAACGCGT TGATTGTATT CTTTAGCTAA GTATTGTGCG GTAATTGCAA 4 020 

TATCATTGTA TATACAAAAA CCACTTGCTC GACGAGGTTG AGCGTGATGC AAACCACCAC 4 08 0 

CTAAGTGACA ACCATTTAAT ACTTTGCCTG ACATAATAAG ATCTGCTAAA GTTAAAGCGC 414 0 

CTCCAACAAT TGTGGCACTA TGGCGGTGCA TATGCTTAAA TTGACCATTC TCTTCATCAT 4200 

TTAATCCATA TTTCTTAGCC TCATCTTCAC TGATAATGCC ATGTCAAGCA TGCTTAATAG 4260 

CTTCGAOGTA ATCATATTTA TGAATTAACA TTAATTCGTC ATCTCTTGCA ATTCTAGGTT 4320 

40 GTACTATTTG TTCTGGAGAC AATAAATTTG CATTCAAAAG TAGCTCTGTT GTTAATTTTA 4380 

AACGCAT3TC ATTGAAGGGA TGTTGGTCAT GAAATCGATA TTGTAATAAC TTATCTGAAT 4440 

AAACATATGC AGTTTTTGAT GAATGTTGTT GCATATAATC CCTCCXSATAT TCCAAAAATT 4S00 

AAAAGAAAAA CCGATTCATA TAACGAATAT CATCAAACGC TTGTTGCTGT TCTAATGTAA 4 560 

TGTTTTTGCC AATTCTTGCC ATTAAACAAT TAGCTGGATG ACTTGTTATT TCTGGATCAT 4620 

CTGTAGCGAA TATTTCAAGT CCACCAGTTG CCATTAACCG CTGCATTAAT TTTTTATAGT 4 680 

CAAATACATC TAACTTTGAA TTTTTTAAAT CCCAATGCCA GTAATATTCT GTAGTTATAA 4 740 

CGATATAATT CTCXSAATTCT GGTGTAGAAA GGCTAAGTTG TATCAGCTTT TCTGCAAGTT 4 800 
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TACCTGTAGA CCAGCGCTCA ATTTCATCAG GATAGTGGAA AGTGACATAA CCCACAATGA 4 920 

GTTGATCTTG GCGAGCAACA TAAATTCTAC CTTCTGGTAA TGTTGTAATT TCTAACAAAG 4 980 

CTTTATACTG ATCTTCAGCA TCTCTAAATG CGGTTAAATG CGCATCGAAA GTAAGCGCTT 5040 

TCAAATCTtC GTGTGTTAAA GGACCTTCAA TAACAAATTG CTTGTCATGA ATGTAATAAT 5100 

CTTCGGATTG ATACGTCTTT AAATGATTCA TATTTTCAAC TCCTCAATCG ACGTCGTGAT 5160 

TGTATTAAAT TCATTATATA GAAaATTTAC AATAATTAAT ACTAGAAAAA GGATAAAAGT 5220 

AAAAATTTTG AATAATTAGA AATGTTATGT ATAATATTGA GAAAGAAAGC GTTTTCACAT 5280 

AACAAAGGGG GAGTTTCAAA TGAAAGTCGA AGTTTATAAA GGAGCGCAAG GTAAACATAA 5340 

CCTTAAAGAT TATGAAGAAA CATATAATAC TTTTGATTGG AAAGACGTAG AACAAGCATT 54 00 

TTCTTGGAGT GAAACTGGAA AAATGAACAT GGCATATGAA TGCATAGATC GCCATGTAGA 5460 

20 TCAAGGATTA GGGGATAAAA TAGCGTTAAA TTACAAAGAT GAGCACAGAA AAGAATCGTA 5520 

TACTTATAAA GATATGCAAC GGTTATCTAA TAAAGCAGCG AATGTTTTGT CTGAACATGC 55 BO 

AGAAGTTGAC AAAGGTGACA GAGTATTTAT ATTTATGTCG CGTACACCTG AACTATATTT 5640 

TGCGTTGTTA GGTGTTTTAA AAATTGGTGC AATTGTTGGG CCGTTATPTG AAGCATTTAT 5700 

GGAAAAGGCA GTTGCGGATA GATTAGAGAA CAGTGAAGCT AAAGTGTTAA TTACTAATAA 5760 

GGCATTGTTA CCTCGAGTAC CTGTAGATAA ATTACCAAAC TTGAAAAAAA TTGTTGTCGT 5820 

AGATGAGGAT GTAGAAGACA ATTACATAGA CTTCATTAGT TTGATGGAAA CTGCTAGCGA 5880 

TGAATTTGAC ATTGAATGGT TAAAGTCGGA TGATGGTTTG ATTTTACATT ATACATCAGG 5940 

TTCTACTGGG CAACCTAAAG GtGTATTGCA TGTTCAACAA GCAATGTTAG TGCACTATAT 6000 

TTCTGGAAAA TATGTATTAG ATTTACAAGA AGATGATGTT TATTGGTGTA CAGCAGATCC 6060 

AGGrrGGGTT aCAGGAACAT CTTATGGTAT TTTTGCACCA TGGTTAAATG GCGCTACAAA 6120 

40 TTGTATAGCT GGTGGTCGCT TTTCGCCAGA ACW3TGGTAT AGTATGATTG AAGATTTTAA 6180 

AGTGACGATT TGGTATACGG CACCAACAGC TTTAAGAATG TTAATGAGTG CTGGTGACGA 6240 

TATTGTTGAG AAATATGACT TGTCATCGTT ACGTTCGATT CTATCAGTAG GTGAGCCTTT 63 00 

AAATCCTGAA GTTATAAAAT GGGCGAAAAA AGTATACGGT TTAAOGGTGT TAGATACTTG 6360 

GTGGATGACA GAAACAGGTG GACATATGAT TGTTAACTAT CCAACGATGG ACGTcAAGCT 6420 

TGGCTCAATG GGCAAACCAT TACCTGGTAT TCAAGCTGCA ATTATCGATG ATGCAGGGAA 64 80 

TGAATTACCA CCAAATCGAA TGGGCAACCT TGCTATAAAA AAAGOCTGOC CATCAATGAT 654 0 

GTATCGTATC TGGAAGAATC CAGAAAAATA TAAATCATAT TTTATTGGAG ACTGGTATGT 6600 
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TGATGTAATT ATGACAGCTG GTGAACGAGT TGGACCATTT GAGGTTGAGT CTAAATTGGT 6720 

TGAACACGAA GCAGTTGCCG AAGCAGGAAT TATTGGTAAA CCTGATCCGG TTCGCGGTGA 6780 

5 

AATAATTAAG GCGTTTGTTG CACTGAGAAA AGGATATCAA CCAACAGACG AATTAAAAGA 6340 

AGAAATTCGT ATATTTGTTA AAGAAGGTTT GTCGGCACAT GCAGCACCAC GTGAAATCGA 6 900 

ATTTAAAGAT AAATTACCTA AAACACGGTC AGGTAAAATT ATGAGACGTG TATTAAAAGC 6960 

TTGGGAATTA AATTTAGATG CTGGGGATTT AAGTACAATG GAATAATGAC ATGAATGTTA 7020 

TTGAAGATTT TTTTCGAAGA ATAAAGGGTG ACAACATATT TCATGTCAAT GTTTAAATAA 7080 

75 TCGTTTACTT TACGATAAGC AATATAAAGA ACTGTTAACT TGTGTCATAT CATTTCGTAG 7140 

AAAGCATTTG AAAATGATGA CATAACAATA ATGGCATATC TTTATATTGC TTTTTATTTT 7200 

TAATATGATC TTTGGAAGAT GATTATTTTA AATAATAGAA AAATATAGTT ATCAATAGTA 7260 
TCAAGCGCTA AAAGTTGTAT AATACAAAAC TTTAATAAGT GAATTTATTG CAAAAATGAA 
AGCGCTAACC CGATTTAGTC GACAAGTTTT TAACAGTTCG TTATTATATG AATGTAAGTA 
AAAATTTCTT AGCTACAACT TACATATTAT AAATGCATAA ATTAAACAAA AAGGGGCGAA 
AAAAGTTGAC TCATTTATCA GATTTAGATA TTGCGAATCA ATCAACACTA CAACCAATTA 
AGGATATTGC TGCATCAGTA GGTATTTCAG AGGATGCATT AGAACCTTAT GGTCATTACA 

AAGCTAAAAT CGACATTAAT AAAATTACGC CAAGAGAAAA CAAAGGGAAA GTTGTTTTAG 7620 

TAACTGCGAT GAGCCCAACA CCAGCTGGTG AAGGTAAATC AACGGTTACA GTTGGTTTAG 7630 

CTGATGCATT CCATGAGTTA AATAAAAACG TTATGGTTGC ATTAAGAGAG CCTGCTTTAG 774 0 

35 GACCAACATT TGGTATCAAA GGTGGTGCGA CTGGTGGTGG TTATGCGCAA GTCTTACCTA 7800 

TGGAAGATAT CAACTTACAT TTCAACGGAG ATTTCCATGC GATTACAACT GCAAATAATG 7360 

CATtSTCTGC GTTTATCGAT AATCATATTC ACCAAGGTAA CGAATTAGGA ATCGATCAAA 7920 

GACGTATTCSA GTGGAAACGT GTATTAGATA TGAATGATCG TGCACTTAGA CATGTAAACO 7980 

TTGGGTTAdG TGGACCTACA AATGGTGTAC CACGTGAAGA TGGCTTTAAT ATTACAGTAG 804 0 

CGTCTGAAAT TATGGCGATT TTATGTTTAA GTAGAAGTAT TAAAGACTTA AAAGATAAAA 8100 

TTAGTCGTAT TACTATTGGT TACACTAGAG ATCGCAAGCC AGTTACAGTT GCAGATTTAA 8160 

AAGTGGAAGG TGCACTTGCA ATGATTTTAA AAGATGCAAT AAAACCAAAC TTAGTACAAT 8220 

CAATTGAAGG GACACCTGCA TTAGTTCATG GTGGACCATT TGCGAATATC GCACACGGTT 32 80 

GTAACTCAAT TTTAGCAACT GAAACAGCAC GTGATTTAGC TGATATCGTT GTAACGGAAG 834 0 

CTGGATTTGG TTCAGACTTA GGCGCTGAAA AATTCATGGA CATTAAAGCG CGTGAAGCAG 8400 
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7320 
7380 
7440 
7500 
7560 
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GTGTAGCGAA 
ATTTAGAGCG 
ATGCATTTAT 
ATAACGTACG 
TAGCAAATGA 
TAGAATTACC 
AAGTAACGTT 
ATAATTACCC 
TAGGTGCACC 
GATTTATCGT 
CAGCATTAAA 
TGTCAATTGT 
TATTTCAGCA 
ACAATGTTTG 
GAAAGCGATG 
AATTGGGATG 
ATAATCTTAT 
ATTCTATTAG 
TTACTTCTCA 
AAAATCAAAA 
CTGctACGCA 
ATGCAATTAA 
TCAATTTCCA 
AAGATCCAGC 
CTGCTTCAAC 
TTGTATCATA 
GCACACAAGA 
ATGATTATAC 
CAGATACAAA 



AGATAATTTA AAAGAAGAAA 
TCATGTTAAT AATATTAAAA 
ACATGATACC GATGCAGAAG 
AATTGCCTTA ACTGAAGTTT 
AGTATTAGAA GTCATTGATC 
ATTAGAGCAA AAGATTGAAA 
TAGCAGTAAA GCGCAAAAAC 
AGTATGTATG GCGAAAACAC 
ATCAGGATTT GAAATTACAA 
AGCGTTGACA GGTGCAATCA 
CATGGATGTT ACTGATGATG 
TTAATAAAGA TAAGTAAATA 
GTTCGGTTTT TTGCTGTGCT 
AAGGTGGTTA ATTAATGTAT 
CAATGAATAT TCCATATTAA 
CGCTACAATT AACACTAATA 
AGGAGTTGTT AACAACATGA 
AAAATCAACT CTAGGCGTTG 
ACATCAAGCA CAAGCAGCAG 
TAATAATGCA ACTACAACTC 
ACCAGCAAAC ACTGCX^AAAA 
AGATCCTGCA TTAGAAAATA 
GTTATTAGAT AAAAACAATG 
AGATGTGTAT TACACTAAAA 
ATGGAAGAAG TTTGAAGTCT 
TAGTCCTGTA CCAGAAGACC 
ATTGAAAATT GTTTCTTCGA 
TAAATTAGTA TTTGCTAAAC 
TGATGCAGTA QTAACGAATG 



ATGTAGAAGC AGTAAAAGCA GGAATTGTTA 8520 

AATTCGGTGT AGAACCGGTT GTTGCAATTA 8580 

TAGAATATGT AAAATCTTGG GCTAAAGAAA 864 0 

GGGAAAAAGG TGGTAAAGGT GGCGTTGACT 8700 

AACCTAATTC ATTTAAACCT TTATATGAAT 8760 

AGATTGTGAC TGAAATCTAT GGCGGTTCAA 8820 

AATTAAAACA ATTTAAAGAA AATGGTTGGG 8880 

AATATTCATT CTCAGATGAT CAAACGTTGT 8940 

TTCGTGAATT AGAAGCGAAA ACAGGTGCAG 9000 

TGACTATGCC TGGTTTACCT AAAAAACCAG 9060 

GTCATGCAAT TGGGTTATTC TAATAAATCA 9120 

GTTTAATAGA CCGGACTGTT GGAGATGCAT 9180 

AAAAATAGAT TCAATTTGGC GAATCTAACG 924 a 

ATGAAGATAA AAAGTGGGCT TGAAGAATAG 9300 

AAAAAATTAA TAAAATAGGT TGCAATATTT 9360 

ATTGATATTG ATAATTATTA TCAATTAAAT 94 20 

ACAAACATCA CCCAAAATTA AGGTCTTTCT 94 80 

CATCGGTCAT TGTCAGTACA CTATTTTTAA 954 0 

AAAATACAAA TACTTCAGAT AAAATCTCGG 9600 

AGCCACCTAA GGATACAAAT CAAACACAAC 9660 

ACTATCCTGC AGCGGATGAA TCACTTAAAG 9720 

AAGAACATGA TATAGGTCCA AGAGAACAAG 9780 

AAACGCAGTA CTATCACTTT TTCAGCATCA 9840 

AGAAAGCAGA AGTTGAATTA GACATCAATA 9900 

ATGAAAACAA TCAAAAATTG CCAGTGAGAC 9960 

ATGCCTATAT TCGATTCCCA GTTTCAGATG 10020 

CTCAAATTGA TGATGGAOAA GAAACAAATT 100 80 

CTATTTATAA CGATCCTTCA CTTGTAAAAT 1014 0 

ATCAATCAAG TTCACTCGCA AGTAATCAAA 10200 
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AGGCAACGAC CAATATGAGT 
CGTCAAGCCA ACCAGCTCAT 
AGTCAAGTAA TCAGTCGGAT 
ATGCAATTAA AAACCCGGCT 
TTGATTTTCA AATGAAAAAT 
TTGAACCAGC AACTGTCATT 
CAGCTTCAAC ATGGAAGAAA 
TAGTATCATA TGATTCTGAT 
CGAGAGAAGT TAAAATTGTG 
ATTATACGCT AATGGTCTTT 
AAGAAACATA CAATTTACAA 
GACAAGTTTA TGAATTAGAA 
ATAAAAAGAA ATTAGATCAA 
CGGAATTTGA AAATGTtACA 
TTGTTGTTTT TGAAAGTGAA 
CATTCTATAC AGCAACTTTA 
GTTACTGGAA AGATTTAATT 
AAAATAATTC TAGAACGCTG 
TTGTTAAAGT CGTTGTGGCA 
ATCAGGATAT CAATACAAAA 
ATGTACAAAC AGGACAAGAA 
CTGCAACAAA TCCTAAAGAT 
TGGTTAAAGA TGCTGATAAT 
CCGATATGTC GGATAATAAT 
TTGCCAAAGA TACTGATAGA 
CTAATGTCGA TACTGATAAA 
ATATTGCCGA TAAAAATAAT 
ATTATAATAA TACAGACAAA 
TTCATAAAAC TGTAGATAAA 



CAACCTGCAC 
GAAACAAATT 
GTTAATCAAC 
ATCATCGATA 
GATAAAGGTG 
TTTACAAAAA 
TTTGAAGTTT 
AAAGATTATG 
TCATCTATTG 
GCACAGCCTA 
AAATTATTAG 
AAATTACAAG 
ACTAGAGTAG 
CCTACAAATG 
GAAAATAGTG 
AATGGTCAAA 
GTAGAAGGTA 
ATTTTCCCAT 
AACATTGGTT 
GATGATGATA 
GGTAAGGTTG 
GCGTCTGATA 
AATATTGATA 
CACTTCGATA 
AATGTGGATA 
GACTCTAATA 
CATACTGGAA 
GTTACTGACA 
ACAGTGAAAA 



AACCAAAATC 
CTAATGGTAA 
AGTATCCACC 
AaGAACATAC 
AAAGACAGTT 
CAGGACCAAT 
ATGAAGGTGA 
CCTATATTCG 
AATATGGTGA 
TTACTAATAA 
CTCCGTATCA 
AGAAATTGCC 
AGTTAGCTGA 
ATCAATTAAC 
AGTCAGTTAT 
AATATGTAGT 
AACGTGTCAC 
ATATACCTGA 
ATGAAGGTGA 
CATCACAAAA 
CTGATACAGA 
AAGCAGATGT 
AAGATGTGCA 
AATATGATTT 
AAGATGCCGA 
AAAATAAAGA 
AAGCAGCAAA 
AAAAAACAAC 
CAAAAGAAAA 



GTCAACGAAT 
TACTAACGAT 
AGCAGATGAA 
AGCTGATAAT 
CTATCATTAT 
AATTGAATTA 
CAAAAAGTTA 
TTTCCCAGTA 
GAACATCCAT 
CCCAGACGAC 
CAAAGCTAAA 
AGAAAAATAT 
TCAAGTTAAA 
AGATTTACAA 
GGACGGCTTT 
GATGAAAACA 
TACTGTTTCT 
CAAAGCAGTT 
ATATCATGTC 
TAACACGAGT 
TGTAGCTGAA 
GATAGAACCA 
ACATGATGTT 
AAAAGAAATG 
TAATAGCGTT 
CAAAGTCATA 
GCTTGACGTA 
TGAACATCTG 
AGCCGGCACA 



GCAGATCAAG 
AAAACGAATG 
TCACTACAAG 
TGGCGACCAA 
GCTAGTACTG 
GGTTTAAAGA 
CCAGTCGAAT 
TCTAATGGTA 
GAAGACTATG 
TATGTGGATG 
ACGTTAGAAA 
AAGGCGGAAT 
TCAGCAGTGA 
GAAGCGCATT 
GTTGAACATC 
AAGGATGACA 
AAAGATCCTA 
TACAATGCGA 
AGAATTATAA 
GAACCGCTAA 
AATAGCAGCA 
GAGTCTGACG 
GATCATTTAT 
GATACTCAAA 
GGTATGTCAT 
CAGCTGAATC 
GTGAAACAAA 
CCGAGTGATA 
CCATCGAAAG 



10320 

10380 

10440 

10500 

10S60 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 
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CATGGTGGGG CTTATATGCG TTATTAGGTA TGTTAGCTTT ATTCATTCCT AAATTCAGAA 12120 
AAGAATCTAA ATAATTAnCT AAATATAGCA TATGTATGAT TAACTTTGTA GAC 12173 

5 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 
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CAACATTAAT 


ATTGATATTA 


AATCTTCCTG 


GATAACGTGC 


TTGTTCGAGT 


GATAAGTATG 


60 


CACGCACTTG 


ACTTAACTCT 


TTATCTAAAG 


TAATCGTATG 


TTGCTTAGAG 


CCTTGTAAAT 


120 


TCGCTCTGAA 


AAAATAACTC 


AATTCTAATA 


GTAACTCTCG 


TGCCrriTCG 


CTATTTATTC 


180 


TAACTAAAGC 


TGAGATCGTG 


TTAATTGAAT 


TGiAAGAAAAA 


ATGTGGACTC 


ACTTGTGCCT 


240 


GTAATGACTT 


AATCTCAGCA 


TCTTTCAATA 


ACTTACTTTG 


CGTTTCGGCT 


TCACCAAGTT 


300 


CAATTTGGCT 


ACTAAAAATA 


TTTGCCAATC 


CTTCTGCAAG 


TTGACGTTCC 


ACAAAAGTTA 


360 


AATCATTAGG 


GTTTGTAAAA 


TACATCTTCA 


ATGTACCGAC 


GATAGAACCA 


TGCATCTCAA 


420 


GTGGTATCAC 


GATAGCTGCT 


CT AAGCGGG C 


AATTCGGATG 


ACTACAACCA 


ATCTCTTCTT 


480 


TAGTATGAAC 


TTCTTTCAAC 


^^^^^^^^^^^^^^ ?i ^^^^ 


i WAAa AGATC 


TTTAGACAGA 


CTTGTTAATA 


540 


TTTCATTTGT 


TGGTATGTGA 


TGATCACTAC 


CTGCACCTAC 


ATGCGATAAG 


ATTTCATTTT 


600 


TGCTTGTAAT 


TGCTACGGCA 


GATACTTTCA 


TTAAATTTTT 


AATAATCATC 


GCAATTTGCT 


660 


GTGCCGATTC 


TCTATTCAAT 


CCTTCTTTAA 


AATACGGCAA 


TGTCTGGTTC 


ATCAATTGCA 


720 


GTACATCATG TGTTTGAACA 


GCCTTCATTT 


GCTCCTCTTG 


CTTTAATGTT 


GAAATGATAA 


780 


TAGACATAAA 


AATCGCCGTA 


CCAACGCTAT 


TAACAATAAT 


CATTGGTAGT 


GCAATTAATG 


840 


ATATGAGGTC 


AACCGCATAT 


GCTTTGTCGT 


GGGAAAATGT 


TAAAATGCTC 


AACATTTGAA 


900 


TCATTTCCAT 


AACAATTCCA 


ATCATGGCAC 


TTTTCGCAAT 


ACTCGGGTAA 


CGCTTGCGTC 


960 


TTTGAGCTTG 


TAAGCCAAAA 


TAACCAGCAA 


TTATACCAAT 


AAATATAGAT 


GAGATAAGAT 


1020 


AAACTTGTGC 


ATCCGCCCCA 


CCCATATACA 


CTCTGAAAAT 


ACCTGAAATA 


ACGCCAACAA 


1080 


ATAGACCTAC 


AAAAGGGCCA 


CCAACTAATC 


CTGCGACACC 


TATCGTTAAT 


ACACGTGTGT 


1140 


TAGCTAAAGA 


TACATCATCA 


TCTAAACGGA 


AGTACACACT 


TCCTGACAAA 


CTATGTTGAT 


1200 


GATCGATGAC 


GATACCAGTT 


AAATTAGACA 


TTAAGGCAAA 


CAAACTGAAA 


ATAATACATA 


1260 
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(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 7972 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 





TATAAATATT 


ATTTTATTAT 


CGTTTATCGC 


TATTTGTGTT 


ACATAATCAA 


AACCATAAAT 


60 


IS 


TCTTACTCAT 


TCAGATTTAC 


CCAATATTTT 


TACTTTTATA 


ATGTAATGCG 


TTTTATCCAA 


120 




GTTATTrrrr 


AAAAATAAAT 


ATTGAATTnG 


GGGCTGnTTT 


CATGTCATTA 


AGAGATGAAG 


180 




CATTGGAAAT 


GCACAAACGT 


AATCAAGGTA 


AATTAGAAGT 


TAAACCAAAT 


GTAAAAGTTA 


240 


20 


CTAATAAAGA 


GGAATTAAGT 


TTAGCATACT 


CACCTGGCGT 


TGCTGAACCG 


TGTAAAGATA 


300 




TTTATGAAGA 


TAAAAGAAAA 


GTATATGATT 


ACACAATTAA 


AGGAAATACA 


GTTGCAGTTA 


360 


25 


TTACTGATGG 


AACAGCGGTA 


TTAGGTTTAG 


GTAACATTGG 


ACCTGAAGCA 


AGTATTCCTG 


420 


TAATGGAAGG 


TAAAGCAGTA 


TTATTCAAAA 


GCTTCGCTGG 


TATCAATGGG 


GTGCCTATTG 


480 




CGTTAAATAC 


AACTGATACC 


GAAGAAaTCA 


TTAAAACAGT 


TAAGTTGTTA 


GAACCTAATT 


540 


30 


ATGGTGGTAT 


TAATTTAGAG 


GATATTTCGG 


CACCACGTtG 


TTTTGAAATT 


GAAGAACGAT 


600 




TGAAAAAAGA 


AACTAATATT 


CCGGTATTCC 


ATGACGATCA 


ACATGGTACA 


GCAATTGTAA 


660 




CATTGGCAGG 


TTTGGTAAAT 


GCATTGAGAG 


TTGTTAACAA 


AGATATTGCT 


AAAATAAAAG 


720 


35 


TTGTACTAAA 


TGGTGCTGGT 


GCAGCAGGAA 


TAGCCATTGT 


TAAATTACTA 


TACGCGTATG 


780 




GTGTAAGAAA 


TATGGTTATG 


TGTGACTCAA 


GAGGCGCAAT 


TTTTGAAGGA 


CGTTCATATG 


840 




GTATGAATCC 


TACGAAAGAT 


GTTGTAGCAA 


AATGGACAAA 


TAAAGATAAG 


ATTGAAGGGT 


900 


40 


CTTTAGAAGA 


AGTCGTAAAA 


GACGCAGATG 


TATTTATCGG 


GGTTTCTGTA 


GCTAATGCGC 


960 




TGTCACAAGA 


TATGGTTAAG 


AGTATGGCAG 


ATAATCCAAT 


TATATTTGCA 


ATGGCTAATC 


1020 


45 


CAAATCCTGA 


AATAATACCT 


GATGATGCCA 


AAGCGGCAGG 


TGCACGAGTT 


GTTGGTACAG 


1080 


GACGTTCAGA 


CTATCCTAAC 


CAAATTAATA 


ATGTATTAGC 


TTTCCCTGGT 


ATTTTTAGAG 


1140 




GTGCATTAGA 


GGTTGAAGCT 


ACACATATAA 


ATGAAGAAAT 


GAAAAAGGCA 


GCTGTAGAAG 


1200 


SO 


CGATTGCTGA 


TTTAATCGAT 


AGTTCTGAAT 


TAAATGAAGA 


CTACTGTATC 


CCAGGACCGT 


1260 




TTGATAAACG 


TGTAGCGCCA 


TCAGTTGCTC 


GTAATGTTGC 


TAAAGCGGCA 


ATGGAATCTG 


1320 




GAGTAGCTAG 


GATTGAAGTT 


GATCCGCAAG 


ATGTGTATGA 


TAAAACAATG 


AAACTTACAG 


1380 



ss 
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ATTAAAATGA TGAAAATGAA AGTTTATGAT AAACATTCAA CAGTCAAACG AATATAAATC 1500 

AAATAAATTT AAACCCGTTT TTAACTGGTC AAGTTCAGTT TAAGGCTCTA AATGGTTAGA 1560 

5 

ACAGAGGTTA TTTGGAGGTT TTCCTATGTT TAAAGATTTT TTTAATCGAA CAAAGAAAAA 1620 

GAAATATCTT ACAGTACAAG ACTCTAAAAA TAATGATGTG CCTGCAGGTA TTATGACTAA 1680 

GTGTCCAAAG TGTAAGAAAA TTATGTACAC AAAAGAATTA GCTGAAAATT TAAATGTGTG 174 0 

10 

CTTTAATTGT GATCATCATA TTGCTTTAAC TGCGTATAAA CGTATAGAAG CAATTTCTGA 1800 

TGAAGGATCA TTTACAGAAT TCGATAAGGG AATGACCTCT GCGAATCCAT TAGATTTTCC 1860 

75 aAGTTATTTA GAAAAAATTG AAAAGGACCA ACAAAAGACA GGTCTTAAAG AAGCAGTTGT 1920 

GACTGGTACA GCACAACTAG ATGGTATGAA ATTTGGCGTT GCTGTCATGG ATTCACGTTT 1980 

TAGAATGGGA AGTATGGGAT CGGTTATCGG TGAAAAGATA TGTCGCATCA TTGATTACTG 204 0 

20 CACTGAGAAC CGTTTACCAT TTATTCTTTT CTCTGCAAGT GGTGGTGCAC GTATGCAAGA 2100 

AGGTATTATT TCCTTGATGC AAATGGGTAA AACCAGTGTA TCTTTAAAAC GTCATTCTGA 2160 

CGCTGGACTA TTATATATAT CATATTTAAC ACATCCAACT ACTGGTGGTG TATCTGCAAG 222a 

25 

TTTTGCATCA GTTGGTGATA TAAATTTAAG TGAGCCAAAA GCGTTGATAG GTTTTGCAGG 2 2 80 

TCGTCGAGTT ATTGAACAGA CAATAAACX5A AAAATTGCCA GATGATTTCC AAACTGCAGA 2 34 0 

ATTTTTATTA GAGCATGGAC AATTGGATAA AGTTGTACAT CGTAATGATA TGCGTCAAAC 24 00 

30 

ATTGTCTGAA ATTCTAAAAA TCCATCAAGA GGTGACTAAA TAATGTTAGA TTTTGAAAAA 246 0 

CCACTTTTTG AAATTCGAAA TAAAATTGAA TCTTTAAAAG AATCTCAAGA TAAAAATGAT 2 520 

35 GTGGATTTAC AAGAAGAAAT TGACATGCTT GAAGCGTcAT TGGAACX3AGA AACTAAAAAA 2580 

ATATATACAA ATCTAAAACC ATGGGATCGT GTGCAAATTG CGCGTTTGCA AGAAAGACCT 2640 

ACXSACCCTAG ATTaTATTCC ATATATCTTT GATTCGTTTA TGGAACTACA TGGTGATCGT 2700 

"^o AATTTTAGAG ATGATCCAGC AATGATTGGT GGTATTGGCT TTTTAAATGG TCOTGCTGTT 2760 

ACAGTTaTTG GACAACAACX3 TGGAAAAGAT ACAAAAGATA ATATTTATCG AAATTTTGGT 2820 

ATGGCGCATC CAGAAGGTTA TCGAAAAGCA TTACGTTTAA TGAAACAAGC TGAAAAATTC 2880 

AATCGTCCTA TCTTTACATT TATAOATACA AAAGGTGCAT ATCCTGGTAA AGCTGCTGAA 2940 

GAACGTGGAC AAAGTGAATC TATCGCAACA AATTTGATTG AGATGGCTTC ATTAAAAGTA 3 000 

CCAGTTATTG CGATTGTCAT TGGTGAAGGT GGCAGTGGAG GTOCTCTAGG TATTGGTATT 3060 

SO 

GCCAATAAAG TATTGATGTT AGAGAATAGT ACTTACTCTG TTATATCTCC TGAAGGTGCA 3120 

GCGGCATTAT TATGGAAAGA CAGTAATTTG GCTAAAATTG CAGCTGAAAC AATGAAAATT 3180 
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GGTGCACT^TA 


. AAGATATTGA 


. ACAGCAAGCT 


TTAGCTATTA 


AATCAGCGTT 


TGTTGCACAG 


3300 




TTAGATTCAC 


' rrOAGTCATT 


ATCACGTGAT 


GAAATTGCTA 


ATGATCGCTT 


TGAAAAATTC 


3360 


5 


AGAAATATCG 


GTTCTTATAT 


AGAATAATCA 


ACTTGAGCAT 


TTTTATGTTA 


AATCGATACT 


3420 




GGGTT'rrACC 


ATAAATTGAA 


GTACATTAAA 


ACAATAATTT 


AATATTTAGA 


TACTGAATTT 


3480 


10 


TAACTAAGAT 


TAGTAGTCAA 


AATTGTGGCT 


ACTAATCTTT 


TTTTAATTAA 


GTTAAAATAA 


3540 


AATTCAATAT 


TTAAAACGTT 


TACATCAATT 


CAATACATTA 


GTTTTGATGG 


AATGACATAT 


3600 




CAATTTGTGG 


TAATTTAGAG 


TTAAAGATAA 


ATCAGTTATA 


GAAAGGTATG 


TCGTCATGAA 


3660 


IS 


GAAAATTGCA 


OTTTTAACTA 


GTGGTGGAGA 


TTCACCTGGA ATGAATGCTG 


CCGTAAGAGC 


3720 




AGTTGTTCGT ACAGCAATTT ACAATGAAAT 


TGAAGTTTAT 


GGTGTGTATC 


ATGGTTACCA 


3780 




AGGATTGTTA 


AATGATGATA 


TTCATAAACT 


TGAATTAGGA 


TCAGTTGGGG 


ATACGATTCA 


3840 


20 


GCGTGGAGGT 


ACATTCTTGT 


ATTCAGCAAG 


ATGTCCAGAG 


TTTAAGGAGC 


AAGAAGTACG 


3900 




TAAAGTTGCA 


ATCGAAAACT 


TACGTAAAAG 


AGGGATTGAG 


GGCCTTGTAG 


TTATTGGTGG 


3960 




TGACGGTAGT 


TATCGCGGTG 


CACAACGCAT 


CAGTGAGGAA 


TGTAAAGAAA 


TTCAAACTAT 


4020 


25 


CGGTATTCCT 


GGTACGATTG 


ACAATGATAT 


CAATGGTACT 


GATTTTACAA 


TTGGATTTGA 


4080 




CACAGCATTA 


AATACXSATTA 


TTGGCTTAGT 


CGACAAAATT 


AGAGATACTG 


CGTCAAGTCA 


4140 


30 


CGCACGAACA 


TTTATCATTG 


AAGCAATGGG 


CCGTGATTGT 


GGAGATCTAG 


CATTATGGGC 


4200 


TGGATTATCA 


GTTGGTGCTG 


AGACAATTGT 


AGTTCCAGAA 


GTGAAAACAG 


ATATTAAAGA 


4260 




AATAGCTGAT 


AAAATTGAAC 


AAGGTATTAA 


ACGTGGTAAG 


AAACACTCAA 


TCGTTCTTGT 


4320 


3S 


AGCAGAAGGT 


TGTATGACTG 


CGCAAGATTG 


TCAAAAAGAA 


TTATCACAAT 


ACATCAATGT 


4380 




TGATAATAGA 


GTGTCTGTGT 


TAGGTCACGT 


TCAACGTGGT 


GGTAGCCCAA 


CAGGTGCGGA 


4440 




TAGAeTTTTA 


GCATCACQTT 


TAGGTGGATA 


TGCGGTAGAC 


TTATTAATGC 


aAGGTGAAAC 


4500 


40 


AGCTAAGGGT 


GTTGGAATTA 


AGAACAATAA 


AATTGTAGCA 


ACATcrrrixi 


ATGAAATTTT 


4560 




TGATGGTAAA 


GATCATAAAT 


TTGATTATAG TCTATATGAA CTTGCTAACA AGTTATCTAT 


4620 




ATAAGATTTC 


AGGAGGAATT 


ATAAAATGAG 


AAAAACTAAA 


ATTGTATGTA 


CAATTGGACC 


4680 


45 


AGCTTCAGAA 


TCAGAAGAAA 


TGATTGAGAA ATTAATCAAT GCTGGTATGA ACGTTGCACX; 


4740 




ATTAAACTTT 


TCACATGGTA 


GTCATGAAGA 


GCATAAAGGT 


AGAATTGATA 


CAATTCGTAA 


4800 


50 


AGTAGCTAAA 


AGATTAGACA 


AAATTGtAGC 


AATTTTATTA 


GATACAAAAG 


GTCCAGAAAT 


4860 


TCGTACGCAT 


AATATGAAAG 


ACGGTATCAT 


TGAACTTGAA 


CGTGGCAACG 


AAGTTATTGT 


4920 




TAGCATGAAT 


GAAGTTGAAG 


GAACACCTGA 


AAAGTTCTCA 


GTAACATATG 


aAAACTTAAT 


4980 
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TAAAGATATT GACCATGCTA AAAAAGAAGT TAAATGTGAT ATTTTAAACT CTGGTGAGCT 5100 

TAAAAACAAA AAAGGTGTTA ACTTACCTGG CGTAAGAGTA AGTTTACCTG GTATTACAGA 5160 

AAAAGATGCT GAAGATATCC GTTTCGGTAT TAAAGAAAAT GTTGACTTCA TTGCAGCAAG 5220 

TTTCGTACGT CGTCCTAGTG ATGTTTTAGA AATTCGTGAA ATTTTAGAAG AACAAAAAGC 52 30 

TAACATTTCA GTATTCCCTA AAATTGAAAA CCAAGAAGGT ATTGATAATA TTGCGGAAAT 534 0 

TCTTGAAGTG TCTGATGGTT TAATGGTTGC ACGTGGTGAC ATGGGTGTTG AAATTCCACC 54 00 

TGAAAAAGTA CCAATGGTTC AAAAAGATTT AATCAGACAA TGTAACAAAT TAGGTAAACC 54 60 

^5 AGTTATTACA GCTACACAAA TGTTAGATTC TATGCAACGT AACCCACX5TG CTACACGTGC 5520 

AGAAGCTAGT GACGTTGCCA ACGCAATCTA TGATGGTACA GATGCAGTAA TGTTATCTGG 5580 

TGAAACTGCT GCTGGTTTAT ATCCTGAAGA AGCTGTTAAA ACAATGAGAA ATATTGCTGT 5640 

20 ATCAGCTGAA GCAGCCCAAG ATTACAAAAA GTTATTGTCA GATCGTACTA AATTAGTTGA 5700 

AACTTCATTA GTGAATGCTA TCGGTATTTC GGTTGCACAT ACAGCTTTAA ACTTAAATGT 5760 

TAAAGCAATT GTAGCTGCTA CTGAAAGTGG TTCAACGGCA CGTACTATCT CCAAATATCG 5820 

TCCACATTCA GACATTATTG CGGTGACTCC AAGTGAAGAA ACTGCACGTC AATGTTCAAT 5880 

TGTTTGGGGA GTTCAACCTG TAGTTAAAAA AGGACGTAAG AGTACAGATG CATTGTTAAA 5940 

CAATGCAGTT GCAACAGCTG TTGAAACTGG TAGAGTATCT AATGGTGATT TAATCATTAT 6 000 

TACTGCTGGT GTACCAACTG GTGAAACTGG AACTACTAAT ATGATGAAAA TCCACCTAGT 6060 

TGGTGACGAA ATTGCTAATG GTCAAGGTAT TGGACGTGGA TCAGTTGTTG GTACTACGTT 6120 

AGTTGCTGAA ACTGTTAAAG ATTTAGAAGG TAAAGATTTA TCTGACAAAG TTATCGTTAC 6180 

TAACTCAATC GATGAAACGT TTGTACCTTA TGTAGAAAAA GCTTTAGGCT TAATTACAGA 6240 

AGAAAATGGT ATTACATCAC CAAGTGCAAT TGTTGGTTTA GAAAAAGGTA TTCCAACAGT 6300 

40 TGTAGGTGTA GAAAAAGCTG TTAAAAACAT AAGCAATAAC ATGTTAGTTA CGATTGATGC 6360 

TGCTCAAGGT AAAATCTTTG AAGGATATGC AAACGTACTA TAATTTATAA AAAAACGTCT 6420 

TTCCATTTAT CAACAATGGA AAGGCGTTTT TTGGTTcATC TGGTAmTA TGACGTAATT 64 80 

AATAGGTTAT TTGATAATGA TAGTGTATGA ATGGCAATCT ATATAAATGT TTATATCTTT 6540 

TATACATGTA CATTATCACC TTCAAACTTT CACTCATATT ACTTTGGAAA TTTATTATAA 6600 

AATAGAAGTA TGGATGTATT TCTGAAATGA TACATTATTA AATAGATGAG AAAGTAAAAG 6660 

TTTTGAGCCA AGTACGCAAT TTAATATTAT AAGTTGCATA TAAAACAGGA TGGGACATAA 6720 

ATCCCTAAAA AAACAGCAGT AAGATAATTT TCAATTAGAA AATATCTTAC TGCTGTTCTC 6780 

55 
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tCTTCGACTG 


GCACTGCTCC 


CTCAGGAGTC 


TCGCCATTAA 


TACTACGTAT 


TAACATGTAA 


6900 


5 


TTTTACTTTT 


ACATACTTTA 


AAAAAATAAA 


ACACTTTGCC 


CAACTTGCAC 


ATAAATGTAA 


6960 




AATTCAATAA 


AATGAATTTT 


CTGTGTTGGG 


TCCCTTCGTA 


TAATTTAATA 


AATACCACTA 


7020 




AACTAAATTA 


ACGAGGTGCC 


TTATGTATAA 


AATTTATAAC 


ATGACCCAAC 


TTACACTACC 


7080 


10 


AATAGAAACC 


TCTGTTAGAA 


TTCCTCAAAA 


TGATATTTCG 


CGATATGTTA 


ATGAAATTGT 


7140 




TGAAACX5ATA 


CCTGATAGCG 


AATTCGATGA 


ATTCAGACAT 


CATCGTGGCG 


CAACATCCTA 


7200 




TCATCCAAAA 


ATGATGTTAA 


AAATCATCTT 


ATATGCATAT 


ACTCAATCTG 


TATTTTCTGG 


7260 


IS 


TCGTAGAATA 


GArAAATTAC 


TTCATGACAG 


TATTCGAATG 


ATGTGGTTAG 


CTCAAGATCA 


7320 




AACACCTTCT 


TATAAAACTA 


TTAATCGTTT 


TAGAGTGAAT 


CCTAATACTG 


ATGCGTTAAT 


7380 




TGAATCTTTA 


TTTATTCAGT 


TCCATAGTCA 


ATGTTTAAAG 


CAAAATCTTA 


TTGATAATAA 


7440 


20 


TTCAATrrrr 


ATTGATGGTA 


CAAAAGTAGA 


AGCTAATGCC 


AATAGATATA 


CATTTGTGTG 


7500 




GAAGAAAAGT 


ATTCAAAATC 


ACGAATCGAA 


ATTGAACGAA 


AATTCAAAAA 


CATTATATCG 


7560 


25 


TGACTTAGTT 


GAAGAAAAAA 


TAATACCAGA 


GATAAAAGAA 


GATGGAGATA 


GCGATTTAAC 


7620 


AATAGAAGAA 


ATAGATTTAA 


TTGGTAGTCA 


TTTAGATAAA 


GAAATCGAAG 


ATTTAAATCA 


7680 




TTCTATTGAG 


AACGAAGATT 


GTGCTCAAAT 


TAGAAAACAG 


ACCCGTAAAA 


AAATAACTGA 


7740 


30 


GATTAAGAAG 


TTCAAAAAGA 


AATTTGATGA 


TTATTCCGAA 


AGAAAAAATA 


AATATGAAGA 


7800 




ACAAAAATCG 


ATTCTTAAAG 


ATAGAAATAG 


TTTTTCTAAA 


ACTGATCTGA 


TCATGATGCA . 


7860 




ACTTTTATGA 


GAATGAAGGA 


AGACCATATG 


AAAAATGGCC 


AACTTAAGCC 


AGGATACAAT 


7920 


3S 


TTACAAATAG 


CGACAAATTC 


TCAAAAATGT 


TTTATCCTAT 


GACCTATTTC 


AA 


7972 



(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dotlble 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 
ATTTTTTAGT TAATTGTCTT TCTTAAAATA ATTTTAGCTT TCATTAAATT AAACAATTTT 60 
ACAAGCTTGG AACACCAATC AAAATCCTAA GTTCTAAAAT GCAATATTAG TAGTCGTTGA 120 

SO 

CTGAATGAAC ATATGCTTAT AATATTTTTT TGCAATGCTA GTCAAGTTGA TTTATGCTCA 180 
CAAGGATATG CGATTTATAT TTTCTTACAA CAATGAAAAT GCCTGATACA ATGCGATCCT 24 0 

55 
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10 



IS 



20 



2S 



30 



35 



40 



45 



SO 



AATTAAATTA TTGTTGTTTT AACCATTGTA CTATCGTCGG AATCATAAAT CCTGTTGGCC 360 

CTTTTGGACC ATTATATGAA GCTTTATTAG TCGTTGCTGG ACCTTGCAATA TCGAAATGAA 420 

TGTGAGGTGT TTGACCACTA AAATGTGTTA CAAAACTTGC CGCAAATAGC GCTTTACCTT 4 80 

GTCCATTCGT ATGGTTAACT AAATCAGCGA TATCACTGTG TTTAATACTT GCACGTTCGG 54 0 

TTGCAGTAAT CGGTAATTCA AATACCATTT CATCGACTTC AGAACTTATT TGTAATATAT 600 

CGTTTAATAT CACTTTACTA TTCGATTCAA ATGCAGCAGC TTTATCATCG CCTAGTGCAa 660 

CAATTGCTGC ACCCGTTAAT GTAGCAAAGT CCATAATCAC ACTAGGCTGA TATTGATTTG 720 

CATAAAACAC AGCATCTGCA AGGACTAATC TACCTTCAGC GTCTGTATTC ATTACTTCTA 780 

CAGTTTCACC ACTTAATGCT GTAAATACAT CATCTGGcTT CATTGATGCT TCATTTATCA 84 0 

TATTTTCAGC ACACGCAAGC ACTCCGACAA TATTTACAGG CAGTTGTAAA CGAcTAGCCG 900 

CTTCAATGAT ACCAACGACA TTCGCAGCGC CACACATGTC AAACTTCATT GTAGCCATGC 96 0 

CATTCTTCGT TTTAATACTA TAACCACCAG AATCATACGT TATACCTTTA CCAACTAAGG 1020 

CAATTGGTGC TTCATCTTTG TCTTTGCCAT TATATGTGAT^ GGTTACTAAt CTCGGTTTAT 10 80 

GCTTACTACC TTTACCTACT GCTTGTAAAA GTCCGAATCC TTCAGAAACT AAAGTGTCAT 1140 

AATCTTTAAC ATCTACTTTG ACCTTTGTAT TTTTAAAATG ATTAACAATA TCTTCTGcAA 1200 

ATGTTTGTGG TGtTAATACA TTCGGTGGCA TATTACTAAA GTCTCyTGcC AAATTAATGG 1260 

aTTGGcCAAT ACTGATACCC TCATGAATAA AATCTAATTC AATTAATGAT TCACTAATAA 1320 

GGkTTAAATT CGTCTTAAAC GG tGCCTTCT kACTTgrTTT ATAATGATCG aATyCATATG 13 80 

kTGcACGCTC ACTTTGGAAT ACCGCATGCC ATTAATACAT CTGgATAACT GgATCcATAT 144 0 

TTTGgAnATA AATGaATCCa TAAGTAAATA CGTATCTTCa ATGTGTTCTG ACTTTATGTA 1500 

TTGGAAAAGA TGTCCCCaTA TTTTCAACAT ATCTTGATAA tACCTGTCTT TAAGTTTCCT 1560 

AAACCAACTG TAATTAAACX3 ATATGTTTGA TCTTGTACAT CAAATGCAGT TGTATAAATC 1620 

TTCCCAACTT TACTACCAAT AATATGTTGA TGTTTTAGTC TTTCAAGTGA TTCTGTAATA 1680 

TCGATATGAT TAAAACTAAT GCX3CTCTAAC TGATTTAAAT GTTCTGGTAT ACCAATAATC 1740 

AATGTATTTA TTTCGTTGCT TAGTGTGTTA TTTAATTTAA AATTCATAAT GTACCTCCTT 1800 

AGATTTGATA TGTTACTCAC TTCAAATTGT ACAACAATAA AGCCCTcAGT GACACTGAAG 1860 

GACTTTATAA ATGAATAATT TAATTTTATG TGTTTAGCTG AAACTTTATT TAAAGCCTAG 1920 

AATTTACCTT TTTTGAATGC TAAACCGATA CCACCGATTT TGAATACCGC ACGTGTATCA 1980 

ATCACTTTTT TCATGAATGC TGCTTTTTTA CCAGCGATAG GTTTACCAAA TACCATACCT 2040 
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GTTGATTCAC CGTTTAAGAT GCGTTTAATG TTTTTAGCAA CACTTTCACC TTCTTGCATT 
GCAATTTGTG CTGTAGTTGG TAATGGACGT TCTTCTCCAG CTGGGATAAA CGCTGAACAG 
TCACCAATAA CAAAAATGTT GTCGTAACCA TTGATTGTTA AATCTTGCTT TGTAACGATA 
CGTCCACGTT TAACGCCTTC AAATGATTCT TCCATTAATT TACTACCACG TACACCAGCT 
GCCCATACTG AAGTACCTGC ATTTAATTGT TGTTTTTCAC CATCTACTTC AACTACAAAA 

10 

CCTTTTTCGT TACAAGCAAC GATTGGTGTA GCAATTTTAA ATTCAACACC GCGGTCTTCT 

AAGTAGCTAA cTGCGTGGTT AACTAATTCT TCTGAGAACA TTGGTAACAT TTTAGGTGCT 2S20 

GCTTCAACAC AAGTGATTTT AACTTTATTT TGATCCACAC CATATTTGCT ACATAATTCA 2580 

GGAATTCTGT CTGTTAATTC ACCTAAGAAT TCAACACCAG TGAATCCAGC ACCACCAACT 2640 

AAGATAGATA AATCGTTATC ATCTTTTTCT TTTGATGCTG CATAGTTAGC AAATTTGTCT 

20 TCGATATGAC GTGATAATTC ACGTGCTGTG ATAACATTTT CAATTTGGAA AGCATGATCT 

TTCATACCTT CGATGCCGAA TGTTTCACTA ACGAAACCTA ATGCTACTAC TAAAATATCA 

AAGTCATAAA TACCTTGATT TGTTTCTACC TTTTTAGCAT CACGGTCAAT TTTTGTTACT 

TCTGCtTGAA CAAAGTTCAC TTTGTCTTTC TTCAAGACAC TTTCCACAGG ATATAATACA 

TCTTCATAGT TTAGTGTACC TGCTGATGCT TCATGTAACC ATGTTGCTTC ATAGTGATAT 

TCATTTTTAT TAATAAGCGT AATTTCTGCT TCTTCTGTTG ATATCGCTTT TTGCAATrTA 

30 

GTTACAGTTT GTAAACCTGC ATAACCAGCA CCAAGTACAA GTACTTTTTT ACGATCTTGA 

GCCATTTAAT TnCACCTAAG CnTTCATATT TTTTTAACCA AATGCTGATA ATTAC 

(2) INFORMATION FOR SEQ ID NO: 314: 

3S 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



2700 
2760 
2820 
28B0 
2940 
3000 
3060 
3120 
3175 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

CATCATTATT AAAGATTTTC AATCAATACA gAATCACAAT ACGTACGCAT TGTGCACGAT 60 

AAAAATACAG ATGTGTATAT TAACTATGAA CTACAAGAGC AACTAACGAA CAAAGCTTAC 120 

ATTGGTGATC ATATTTATGT TGAAGGGATA TGGCTCGAAG TACAAGCTGA TGGTTTAAAT 180 

GTATTGAGTC AGAATACAGT GGCATCGTCA TTAATTCGCT TAACACAAGA GATX3CCACAT 24 0 

GCACAGGCAG ATGATTACAA TACGTACCAT CGTTCGCCAA GGATTATTCA CCGTGAACCG 300 
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TGGCGTTCCA TTATACCGCC ATTAGTAATG ATTGCTTTAA CTGTTGTCAT CTTTTTAGTG ' 420 

AGACCAATTG GTATTTATAT TTTAATGATG ArTGGTATGA GTACAGTAAC GATAGTATTT 480 

GGTATTACAA CGTATTTCTC TGAAAAGAAA AAGTATAACA AAGATGTTGA AAAACGAGAG 54 0 

AAAGATTACA AAGCTTATTT GGATAATAAA TCTAAAGAAA TTAATAAAGC GATTAAAGCA 600 

CAACGTTTTA GTTTGAATTA CCATTATCCA ACGGtTGCTG AAATTAAAGA TATCGTTGAA 660 

ACGAAAGCAC CAAGAATATA TGAnAAAACC ATCGGCATCA TC 702 
(2) INFORMATION FOR SEQ ID NOi 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 
TATGTTCCGA CAACGAAACA AAGTGTAATT ACAAGAGCAA AGATAACTTT GAATGTTTGT 60. 

AAACGTCCAT CTTTACCTTC AGTTAAATGC ATGAACATTA ATAATTGArG TCCTGCTTGG 120 

ACGAATGCAA AGCCAAAGAT AATTGTCAAC TTCGCGTGGA ATGTTAATGA CGTGTATAGT 180 

GTTACGTAAA CTGCTAAAAG CGTTAATACG ATAGATGCGA TAAATCCTAC AGTATGTTTC 240 

ATTATTGTAC TCATCCGCTA TACACCATCC CTATCATATA TACGGCAGTA AAGATGAAAA 300 

CCCAAACAAC ATCTAAGAAG TGCCAGTATA AACTTACTAT AAATAATTTT GGCGCATTAT 3 60 

ATTTGTCTAA TCCGCGTCGT TGGATTTGGA TTAATAAACA AATGGCCCAA ACGATACCTA 420 

GCGATAOGTG ACAACCATGC GTTCCTAATA GGATAAAGAA ACTAGACCAG TAAGAACCAA 480 

TTGTTGGGTT AACGCCTTCT GATGCATAGT GTGCGAATTC ATAAATTTCG AATCCAACAA 540 

40 AGACTAAACC TAAAAGTAAC GTAATGATCA TCCAAAACAT CATTAACTTT TGTTTTTCTT 600 

GGq3CATGTA GTAAATAGCA ATACCACATG TGTAAGAACT GAATAATAAT GCAAACGTCA 660 

TTATTAAAAC AAGAGGCAAT TCAAATAACT CAGTAGTCAT TTTACCTGCA TAATCGCCAC 720 

CATGTTGCAA AGTTAATAGT GTCGCAAATA GGGTACCGAA TAACGCAAAT TCGGCTGTAA 780 

TGAAAATCCA AAAGCCAAGC TTATTTAATT CGCCTTCATG TGTGCGTGAA TCAATAGTGT 840 

TTGTATCATG ACTCATGACT TACAGCCTCC CTTTCTTTAA TACGCGCTTC TCTTAATCTA 900 

GCTTCAGTTT CTGCAACTTC AGCAGCAGGG ATATGATATC CGTGATCGAT TTGGAAACTC 960 

CGATAAATCA TAGTACCAAA AATACCGAAT AAACAAATTA ATGCTGGAAT TACA G TTTCG 1020 
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TTGTTTGGCA 


TATGAATGTC TTTGTAATTA 


TGGTTGTCTA AGTAATGACG ACCATGTTCT 


1140 


5 


TTCATATCAA 


CAAATGTGTC GTAGTCATTC 


CAATCTGGTG 


TAATGGCAAA 


GTTGTATTTA 






GGTGGAATTG 


CTGATGCAGT AGTCCACTCT 


AGAGTACGAC 


CAAGGCCATC 


CCAGTTATCT 


T O C 




CCAGTTGCTT 


CACGTGGAGA TTTGAAGTGA 


CTGTATACGA 


TACTAACAAC 


AAGGAATAAG 


1 1 o 
X o ^ U 


10 


AATCCGATTG 


CCATTAATAA TGCACCGATA 


GTTGAAATTA 


AGTTTAATAA 


GAACCAACCA 


X ^ o U 




TCTGATGGCA 


TATAAGTGTA TAAACGACGT 


GGCATACCAT 


CTAATCCAAG 


AATGAATTGT 


1 d. d n 

X *t *t \J 




GGTAAGAAAC 


AAACGTTAAA TCCGATCATG 


AAGAACCAGA 


AGCACCATTT 


GTTTAATGTT 


1 ^ o n 

X 3 vf U 


IS 


TCGTTTAATT 


TGTAACCCAT CATCTTTGGA 


TACCAGAAGA 


TTAAACCAGC 


TAAGCAGGCA 


X 3 o U 




AATACAACAC 


CAGTAACCAA TGTATAGTGG 


AAGTGAGCTA 


CTAAGAAGTA 


CGTATTGTGA 


X b 




TATTGATAGT 


CAGCTGATGC CATTGCTAAC 


ATTACACCCG 


TAACACCACC 


TAAAAGGAAG 


Xb oU 


20 


TTAGGGATAA 


ATGCTAATGA GAATAGCATT 


GGTGACTCAA 


ATGTAATACG 


TCCTTTATAT 


X /4 U 




AATGTTAATA 


ACCAGTTAAA CAATTTCACA 


CCAGTTGGAA 


TACCAATCAG CATTGTTGAA 


X o UO 


2S 


ATTGAGAAGA 


ATGAGTTGAT TAAQGCACCA 


TTACCCATTG 


TGAAGAAATG 


GTGAACCCAA 


1 Q c n 
X a b u 


ACTAAGAAAC 


TAAGGAACGC GATACCGGCA 


GTTGCCCATA 


• 

CCATACTTTG 


ATGTCCGAAT 


1 Q *i 
X 




AAACGCTTAC 


GAGCGAATGT CGGGATAATT 


TCTGAGTAAA 


TACCAAATGC 


TGGAAGGATA 


1 Q O f*l 

X ? ts U 


30 


ACGATATAAA 


CTTCAGGGTG CCCCCATACC 


CAGAAGAAGT 


TAGCCCAAAG 


CATTGGCATA 




CCGCCATGTG 


CAACTGTGAA GAATGCTGTG 


TCAAATATTC 


TATCAGTTGT 


CATTAATGCT 


^ X u u 




AACGCTACTG 


TTAAAGGAGG GAAAGCAAGA 


ATAACAATTA 


ATGTAGTAAT 


AAATGTTGTT 


^ X Q W 




ACTGTAAACA 


TTGGCATTTG CATAAACTTC 


ATAGTTGGTG 


TTTTACATCT 


TAAAATTGTT 






ACAAAGAAGT 


TGATACCTGT AGCTAAGGTA 


CCAAGCCCTG 


AAATTTGTAT 


AGCTATTAAG 


^ ^ O w 




TAATAGTTAA 


CACCCGGACC AGGACTGAAT 


TCACCTGCTA 


GTGGCGCATA 


GTTTGTCCAA 


^ J 1 w 


40 


CCAGCTGCTG 


GTGAACCACC AATAATAAAT 


GACAGGTTGA ATAAAATCAT ACCTGCAAAG 


2400 




AATAGCCAGA AACTTACX5TT GTTTAATACT 


GGGAATGCAA 


CATCACGTGC 


TCCAATTTGT 


2460 




AATGGAACAA 


CX3ATATTCCA TAAACCAAAG 


ATAAATGGCA 


TTGCCATGAA 


GATAATCATG 


2520 


4S 


ATTACACCAT 


GTGTACTAAA AATTTCGTTA 


TAGTGGTTAG 


ATTCTAAAAA 


TTTGTTATCA 


2560 




GGTACTGTTA 


ATTGCGCACG AATAAGTAAC 


GCATCAATAC 


CACCACGGAC 


GAACATTAAT 


2640 


SO 


ACGGCACAGA 


TTAAATACAT AATACCGATT 


TTCTTATGGT 


CTACAGATGT 


GAACCATTCT 


2700 


TTGTAAAGAT 


ATTTCCATAA TTTAAAGTAA 


GTAATTACTG 


CGATTAAACC 


AATAACTAAG 


2760 




AATGGGGCAC 


CAATTTGTGC CATTGTAATC 


ATcCAGTTAC 


CTTTAACTAG 


TAATTGATCC 


2820 
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TTGAAATTTT CTTCATTTCT TTCGCATTTT TCGATTCATC TTTCTTGAAC TCATTGTTAT * 294 0 

ATGGTTCGTC ATTTCCAAGA ATCATCAACT TCATACCATG TCGTTTATAG TTCGCATTTG 3000 

TAATTTGAGC TTTACGAGCA GGTATTAATG GTTTGTCTGA TACATCTTTA AACATATTTT 3060 

CTTCACTAGT GAAGTTTGGA TCTTTCAATT CGAAATTGAA ACGTTTATAT GCATAGAAGA 3120 

TGTATTCTGG ATCGGCTGCT GGATCAACAA ACGCCATATG TGTACCATTA AATTCTAAAG 3180 

CTTTATTAGG TGTGCTTGGT AATAATTGTT TATCAAATGT ATCTTGATCT AACGTTTTCT 3240 

TACCTTTAAC TTCTTTCACC CATTTGTCGT AGTCTTTTTG ACTAACGGCA TTTACTTTAA 3300 

ATGTTTGACG TGAGAATCCT TCACCATTGA AGTTAGAGTT ACGACCTCTG AACGTACCAG 3 360 

TTTGAGATGC TTCTAACGTC CAATTCATTG TCATGCCAGT CATGGCATAT TTTTGACCAC 34 20 

CTAATTGTGG AATCCAGAAA CTTGTCATTG TATCCATAGC TTGAAGCTTA AATACAACAG 34 80 

GACGATCTTT AGGGATTGTT AATGTATTAA CAGTCTCTAT ATGTTCATCT GGATAAGCAA 3540 

AGAACCATTT GTATCCTGCA CTTACTGCAT ATACAACCAT TGGATCTTTC TCACTCTTCG 3600 

GTGGTTTTTC GTAATCGTAT AAAGTTTTAA CTGTAGGAAT AGctAAAGCA GCAACGATTA 3 660 

25 TGATAGGTAT TACAAACCAT ATTGTTTCAA TGATGGCATT ATGGTGCATC TTACCAGATT 3720 

CGGCATTCTT ATTATAACTA TACTTGTAAA TAAAAATGGC GAACATGCCA AGTACAACX5A 3780 

AACAAATAAC AAGCATGAAG ACGATTGAAT AAAGAATCAA GAACTTCTGA CTACTTGCTA 3 84 0 

SO CTGGCCCTTT TGCGTTGAAA ATTTCTATAT TTGAACAACC ACTAAGTAAA ATTAGTGTGC 3 900 

CAAATAATAG AAGCAAAGAC TTAAATTTTG ACACTTTTTT GACCTCCTAA TACTACAAAT 3960 

GTAGGGCTTA ACATTAATTT TAAGTTATTA CACAATATTT ACAAGGGCTT ATGGGAAAAA 4020 

AATTAATAAA ATTGTATCAA AAATGTTGAT AAATCAAGGT GTGACGTGGG TTCACACATT 4 080 

TGTTAAAATT ATGTGTACAT TTTGTQACTA ATAGCGTTTT T 4121 
(2) INFORMATION FOR SEQ ID NO: 316: 



20 



35 



40 



45 



SO 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 3i6: 
CGAGTGAGTA CAmACATATT TTTATTTGCA AGGGGTAAAT GGCATATAAC TATCTTTTTT 60 
ATGTAAGCTG GTATAAAATT TATACTAATA GGAGGGATAG TATGAATATA GTAGGGCATC 120 
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TTGGATTACG ATTAGTTGAA AAGTCGGTTA ATCAAGACAA 


. TCCTTCAATG 


TATCATTTGT 


240 




TTTATGGGGA CGAAgTAGGT ACAGCCGGAA CAATITTAAG 


CTTTTTTGAA ATTCCCAATG 


300 


5 


CGGGTCATAA 


. GCAGCCAGGT ACTGAAACGA TTTATCGATT 


TTCATTATTA 


GTACCAAATC 


360 




AAGCGGCACT 


TCATTATTTT GAAAAACGTC TTGAGAATAA 


TGGTATTAAG 


TCTGAACGTT 


420 


10 


TGTACTATCT 


TGGACAAGAA GGTGTTGTCT TTAAAGATGA 


AGACGACTTA 


GAAATCATAT 


480 


TGCTTGTTAA 


TGATAGTTTT GAAGTACCAC ATCAATGGCA 


ACATAACGCT 


TATAGTGAAA 


540 




TACCTCAAGC 


ATATCAAATT TTAGGAATAG GGCCAGTCGA 


ATTAAGAGTT 


AGAAATGCAG 


600 


15 


CGCGTACGGT 


AGAATTTTTG GAAAATGTCT TAGGTTATCG 


CAAAAGAGAT 


AATAAATCAT 


660 




TCGATGTGCT 


GACATTAGCA CCACAAGGTT TATATTCGGA 


TTTTGTAGTT 


ATTGAGCAAC 


720 




AGGGACAACG 


TGAAAGACCT GGACGAGGTT ATATCCATCA TATTGCAGTT 


AATACACCAC 


780 


20 


AAATGAGTGA 


CTTAGATGCA ATTTACAAGA AATTACAACA 


ACAACCACAA AGTAATTCAG 


840 




GTATAATTGA 


TCGCTATTTC TTTAAATCAT TATACTATCG 


CCATAATTCA ATTATGTATG 


900 




AATTTGCGAC 


TGAAGCGCCT GGATTTACTA TTGATACACC 


TGTTGAACAA 


TTAGGAAGfC 


960 


25 


AATTGAACTT 


GCCTGACTTT TTAGAAGCAG AACGTGAACA 


AATTGAAAGT 


AAGTTACACG 


1020 




AAATATAAAG 


GAGAATGTTT AATGGCCAAA TTAGAAATGA 


ATAAAAATAC 


GCCTCTTGAG 


1080 




TTTGGTTTGT 


ATTCCTTAGG TGATCATTTA TTGAATCCAT 


TGAAAGGTGA 


AAAAGTTAGT 


1140 


30 


TATGAGCAAC 


GTATTAATGA AATTATTGAA GCAAGTAAAT 


TAGGAGATGA 


AGCAGGTATT 


1200 




GATG'l"ri"I''l'G 


CAGTTGGTGA AAGTCATCAG GAGCATTTTA 


CAACACAGGC 


ACATACGGTT 


1260 


35 


GTGTTAGGTG 


CAATTGCCCA AGCGACAAAG CATATTAAAG 


TTTCAAGTTC 


TTCAACGATT 


1320 


ATTAGTGCAc 


AGATCCTGTA AGAGTATTTG AAGACTTCGC 


GACATTAGAT 


TTGATTTCTC 


1380 




ATGGTAGAGC 


CGAAATTGTA GCTGGCAGAG CATCAAGAAC 


AGGTATTTTT 


GACTTGTTTG 


1440 


40 


GCTATGATTT 


AAAAGACTAT GATGAATTGT TTGAAGAAAA 


ATTAGGTTTA 


CTTTTAGAGT 


1500 


TAAATAAAAC 


TGAGCGTATT ACTTGGTCTG GAAAATATCG 


TCCAGAACXT AGAAATATGA 


1560 




AAATATTGCC 


AAGACCAATC GATAATATAT TGCCAATATG 


GCGTGCTGTT 


GGTGGTCCAC 


1620 


45 


CTGCAAGTGC 


TATTAAAGCX; GGAAAACAAG GTGTGCCAAT 


GATGATTACA 


ACCCTTGGTG 


1680 




GCCCAGCAAT 


GAACTTTAAA GGTTCTATAG ATGCTTATCG 


TCAAGCGGCA 


ACTGAAGCAG 


1740 




GTTTCGATgc 


TTCGCCTAAG TCTTTACCAG TAAGTACAGC 


GAGTCTGTTT 


TATACAGCTG 


1800 


SO 


AAACAACTCA 


GGATGCTATG AGAGAATTTT ATCCACATTT 


GAATACAGGG 


ATGTCATTTA 


1860 




TTCGTGGTGT 


TGGTTATCCG AAACAGCAAT TTGCTAATTC 


GTCAGATTAT 


cx;agaagcgc 


1920 
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GTCATCAACG TTTTATGGCA CAGCTTGATT TTGGCGGTGT GCCATTTGAA AATGTTATGA 2040 

AGAATATTGA GTTAATTGGC AACGACATTA TACCGGCGAT TAAAAAGCAT TTATCAAAAT 2100 

AGGAGGGGCG TCATCATGAA TATTGTATTA TTGTCAGGTT CCACAGTAGG TTCTAAAACG 2160 

AGAATTGCTA TGGATGATTT AAAAAATGAA CTAGAAGTCA TCAATGAGGG ACATCAAATA 2220 

GAGTTGATGG ATTTACGAGA ACITGAATTA GAATTTAGCG TTGGAAAGAA TTATCTAGAT 2280 

ACTACAGGAG ATGTATATAA ATTAACGACG TCGTTAATGC AGGCTGATGT GATTTTTATT 2 34 0 

GGTTTTCCAA TTTTTCAAGC TTCCATCCCT GGTGCTTTGA AAAATGTGTT TGATCTACTT 24 00 

CCAGTCAATG CGTTTCGTGA CAAGGTAATA GGACTTGTAG CGACAGCAGG TTCTAGTAAA 2460 

CATTATTTAA TTCCTGAAAT GCATTTAAAA CCAATATTGA GTTACATGAA AGCACATACG 2 520 

ATGCAAACGT ATGTATTTAT TGAAGAGAAA GATTTTTCAA ATCAACAAAT TGTCAATGAT 2580 

GATGTTGTAT TTCGGTTAAA AGCGTTGGCA CAATCCACAA TGCGAACTGC CAAAGTACAA 264 0 

CAACAAGTGT TTGAAGAAGA AAACAACCAA TACGACTTTT AAAGTATAAA AATAAGACGC 2700 

TCGGCACACT AAATTTGTAA GTGTTTGAGC GTCTTTTCAT ATTAACTATA TAGCCAATGA 2760 . 

2s ACGACGATAA AGGCAAGTGA TGACAAGCAT ATTGAGGTAA TAATGATTGT CATAAGCGGT 2 820 

TTAAGTGCGC GATTTTTAAG ATCTTTAAAT GCAACATTTA ACCCTAAAGC AACCATGGCC 2 8 80 

ATTAATAAGC AAATTGTTGA TACAGTATTT AAAATATTTA GCAATGCTGA CGGAATAGTT 294 0 

30 ACATATGTAT TCACTAAGGC CATAATGACA AATCCAATTA AAAAGTATGG AATGCTTATT 3 0O0 

CGACCCTTGC TAGATGATTC TGATGAACGG AAACGCATAA TTAAAATAAG TACGATGGTT 3 060 

AATGGAATCA GTAAGAATAC TCTACCAAGT TTACCAAGAA GTGCAATTTT AAGTGCATCA 3120 

CTACCACCAA AGCCACCAGC TAAGACAACG TGTGCAATTT CATGAAGACT AACACCAGAC 3180 

CAAGCGCCAT AAACATTTGT CGTCATTGAA AAGATAGCGT AGATAGCTGT ATATATAAGT 3240 

GAAAATATCX3 TACCAATCAA TGCGATGATA CCGATACTAA TAGCTGTATC CTTTTCACGT 33 00 

GATTTGAATA TTGGAGCGAC TGCGGCAATA GCAGCAGCAC CACAAACXSCC TGTGCCGACA 3360 

CCTAGTAATA ATGCGATGTT TTTGTCACCA TGCAACAGTT TGTTGACAAA GAGCATCATT 3420 

ACAATACTGA AAATAACGAC ACCTACATCG ATGGCTAATA GTTTACTACC TTGACCGATA 34 80 

ATATCGAATA TATTGAGTTT AAGTCCATAT AGGATGATTG CAAATCTTAA TAAATATTTA 354 0 

GATGAAAACG TAATACCTGA GCTATATTGT TCAGGATATC CTCTAA^GTG ACGATATAGA 3 600 

SO ATAGCGATTA ATATCGCGAT AGTTAATGCG CCAACCTTAT CTAGGATTGG CAATTTAGCT 3660 

GCTAAAAAGC TAAATAATGC GACTATAAAT GTTAGTGATA GCCCAATCAT AAAATGCTTA 3 72 0 
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ATTTTTAAAT 
GGAAACTTGG 
AAAGAAGCTG 
AGAGAAATTG 
AATTTCGAAA 
CAATGCGTAA 
TGAGGATGTG 
GAAGCTCATG 
GAATGGTTAC 
AGAGGACTTT 
ACGCTAGTTG 
GGGTACTATT 
TTGATAAAGT 
AATAATGAAA 
CGTGACAATG 
TCAGAATACG 
CAGCGCTCTT 
ATATAATGAT 
CAGCCGTTTT 
AAGAACACAT 
ACAACGCTAT 
AAGGATGATT 
ACAAAAGCAT 
AAATTTGTTA 
GTTCGTATGT 
AAAATTGATG 
TATCATAAGA 
GATGCACAAC 
ATCACACCAA 



ATAAATTTGG 
ACATCAATCA 
ACGAAAAGTC 
GGTCCGTAAT 
AGAAATTCTA 
CATTAATATG 
ATGGTATGTT 
ACACAGAAGC 
CTTGGGTAGA 
TGCAATTTGC 
GTGTCATCGG 
TAGATAAAGA 
ATTGTTTCGA 
AAAGCCGGGC 
AATTACTAAA 
ACCAAAAATG 
TTATTTAATT 
TTTGATAATT 
AAGTGTAATG 
CTATGGGGAT 
TATTAGCATC 
CAAAGACATC 
ATGATAAATA 
AAGCX3ATTGA 
ATTATGAACG 
CACGTCTTGC 
TTGAAAAAGC 
AATTATTGAA 
AATTAATGTT 



AATGAATAAT 
AAGTAATAGG 
agCTTGcAAT 
TTCTACAGAC 
CAAGCAATGC 
TTATTATAAT 
TGGAATGAAA 
GCTTTTCAAT 
TGCAACTGAG 
TGATGGTAAT 
TTTACATGAA 
ATTTGAGGGT 
AGAGCTTGAC 
TATTCCTGAA 
TGGTATTTAT 
ACAAATTAGA 
TTTGAAAATA 
ATTATCAACT 
TATTATTTTA 
AATAGAATTT 
AACX3TTATTA 
GAAAAAAGAT 
TACTGATGAA 
AAATAATGAT 
CTCTGAACCA 
AGATATGAAA 
ATTATACGAA 
AGATGCAAAA 
ACAAGGTTCT 



AAAGTAGTGA 
CACTACAACG 
AATGTGCAAG 
AATGCAAGTT 
AAGTTGGGGA 
GATAATTTAC 
GTGAATGAAC 
TTAGTCAATC 
CAACCATCAG 
GGATTTCAGT 
ATTAATCACA 
CATGGGATTA 
TTAAACCGAA 
AGGCTGGGAT 
TCATCGAGTT 
CTTACAAAAG 
AAAGGTTGTT 
AGATGATGTt 
GCGTGTAGGG 
CTATAATGAG 
ATTGCTGCAT 
GATGGTGTTA 
CAGTTAAATG 
ATGGCCCAAG 
GTTGCAGAAG 
GAAGAGAAAA 
GATAAGAAAA 
GAATTGCATG 
GTTGACCTAT 



TTAAATTAAG 
CTTATTGGCG 
TTGGGGATGG 
GGCGGGGCCC 
AGGACAACAA 
AGAATTATAT 
AAATAACATT 
GTTCAAGAAA 
ATAOGCGTGC 
GTGGCATTTG 
TGCACAGAAA 
TGACACAAGC 
TTGAGATTAG 
TTACTAGAGA 
ACATCTATAG 
AGTGATGACA 
GACAGTATTA 
TATGGGAGGA 
AATGCGAAAA 
GTGTCAAAAT 
GTGGGAACXSA 
AAGCAGAATT 
AATTTTTAAA 
CAAAAGCGTT 
CATTTGGAGA 
AGGAAAAAGA 
TTGATGATGT 
CCAAAGCTGA 
TAAATGAAGT 



TTGTGTGATA 3840 
GGGCCCCAAC 3900 
GCCCCAACAT 3950 

CAACATAGAG 4 020 

ATTTAAGATA 40 80 

GAAAAATGAA 4140 

AAAAATTTTA 4200 

TTCACTTAGG 4 2 60 

ATTTATTAAA 4320 

GTATGAAGGA 43 80 

AACTTCATTA 4440 

AGTTGAGGCA 4 500 

TGCCGCAGTT 4 560 

AGGTATGTTA 4 620 

TTTATTAAAA 4680 

TTTAAAATGG 4740 

TTTTATAACA 4 8 00 

TGCTTTAAAA 4860 

TAATATTTAT 4 920 

GAAAAAGTTA 4 980 

CGATAGTAAG 5040 

AAAACAAGCA 5100 

AGGTACAGAA 5160 

ATATCCAAAA 5220 

TTTAGATCCT 5280 

ATGGTCAGGA 534 0 

GACTAAAAAA 5400 

TACATTAGAT 5460 

TGCAACTTCT 5520 
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GTTGAAGGCG 


CACAAAAAAT 




X 1 1 iWW\w^ 1 A 


TTTTAGAQAA 


31 & Ik H ^ Ik T^lk Ik Ik 

AAAAGATAAA 


S640 




AAATTAAGTG 


ATGATATCCA 


AATGAACTTC 


GATAA&^T/^& 

X /UV\V9 1 \J£% 


Al k-AAl XAX X 


AT*?! Ik Ik T* >k»T» 

vivaAxAAATAT 


5700 


5 


AAAGATAACA 


ACGGCGGTTA 


TGAGTCATTT 


GAAAAAGTAT 


GGAAGAAAGA 


X AAAuVjA 


5760 




TTTGCGGATG 


CTGTTAATGC 


ATTAGGAGAG 


C CACT AAGT A 


A A ATrSTS rTYlT 
MMn X LjVjL. 1 1 


v*AX X AU. XvoAA 


5820 




TGACAAATTA 


TGAACAAGTT 


AACGATAGTA 


\>rV9\vAnX X X x^ 


A AGAGGTA/^ A 


^^^^^^^^^^^^ A A. ft A 

X 1 1 X XijAAAA 


5860 


10 


TGTTAGGTAT 


TGGCGGTGCC 


GGTGTTGCAA 




1 1 w 1 L\j\a X 


AijvJAX vj X tJO 1 


5940 




CTTTCAAATC 


AATGTTCAAT 


ACACCAGAAG 


ATGPGGAAAA 


A n A TYSOm* A T 


CjAAI X 1 lAIo 


€000 


15 


GTAAAGTGCA 


ACCAGGCATT 




r^G^* A A A A A a r* 


n X ViV-AA X 1 X 


GTTGCGTTAG 


6060 


ATTTGAAGTC 


AAAAGATAGA 


GATGCAATTA 


AGGr*A ATY^*r^ 


'PA AAA Ar**IY5/^ 


ACGGTTATGG 


6120 




CTGATCGTAT 


GATGGATGGT 


GATAryXfSTTG 

X raVw/rO X X VJ 


Gr* A A Gr* r'Y; A ri 


X AAK-AAX X 


TTAATGCCAC 


6180 


20 


CAGTAGATAC 


CGGTGAATCG 


X X X n\j 


GTGGA AGT'AA 


^^T*T*a a ATT 

X X AA^uA X i 


AuCTTTGGGA 


624 0 




TTAGTAAGTC 


TTTGATGAAG 


AAAATT^^^^f 


T* A ^f^^*!* A7°iT a A 


AAl iuV-^UAT 


gcctttaaag 


6300 




ATTTACCGCA 


TTTTCCGAAT 


GATCAGTTaA 

X \»>/^w X X 


T* A G A rrj A xnr A 


f a/Ti/^/^ a *p^v*'p 


fk ^1k ^^1k TV* Tk 

GATATTATGA 


6360. 


25 


TTCAAGCATG 


CTCAAATGAT 


TCGPAAGTAT 

X \^\J\.J\£WJ 1 X 


1X1 ViiaIIjV. 


viOX XCAXAAT 


TTAGTTCGTC 


6420 




CATTTCGAGA 


TATTGTTAAG 


GTACGTrTGGCS 
vjx#%^— vjx x^'Wj 




111 Inl 1 L« 1 


tiCTAAAGGTA 


6480 




AGGAAACACC 


TAGAAATTTA 


ATGGCATTTA 

'^XVJwVpi/^X X xx% 


AAGAIYIGAAG 


AaX XAAX(^l»X 


a^/^a a/^a a*T»ik 
Av^AAGAATA 


654 0 


30 


ATCAACTTAA 




TTTATTGATG 


ACGGATGGGG 


GAAAGAXGGA 


A. ^^^^^p A ^^^^^^ ^^^^ 
A^ X X A 1 X\7\.Vj 






TTGTCAGACG 


TATTCAAATA 


CACATTGAAA 


CGTGGGATGG 


1 /W« X Vj\^VjV^ X \J 


/^a ar*a a ^a a/^ 
vjAAIjAAw\Au 


6660 




AGGCTACATT 


TGGTCGGAAA 


CGACATAGTG 


GTGCAC!CGTT 


AA GAGGTGGG 


ft ft ft ^5 ft ^^^^^^^^^^^ 


£ "7 A 


35 


ATGAAATTGA 


CTTAAAAGCG 


AAAGATAGTC 


ATGGCGAGTA 


TATTATTGAT 

XAX XAX XVM%1 


AA AGATGG/^^ 


o /a (/ 




ATACGAGGCT 


AGCGAAAGAA 


GCAAATACGT 


f^AATn^TTAGG 

X X X X 


T* A G A GG GT'P'^ 


A A TT A fYS'WV2 
AAX 1A1\9X\JVJ 


oo40 


40 


ATGGTACGGA TGACCGGACA GGTAACTTCX3 


AAACAGGCTT 


W X X X X X n X X 


f^^^T^rn^p^ A A A 

U\.X X X X\>^vw« 




AAGCGACAAA ACAATTTATC GATATACAAA 


ATAATTTAGG 


TAGTAATGAT 


A A ATT A A A*TVi 
AAAl 1AAA1\J 


£ OCA 




AATATATTAC 


ACATAGAGGT 


TCTGCTTCAT 


TTTTAGTATT 


AGGAGG' 1 Y J*IT 


AGT A AGGG A G 


/ U^U 


45 


GATACCTTGG 


TGAAACATTA 


TTTGACTAAA 


TTTGTAGGAA 

XXX w X #\ 


TYIGTAATAAG 


1 UV.. X V9WX A Xu 






GTGTGTAGCT 


TTGGGTTACT 


GAAAAGTCAG 


GCAGCAGAAC 


AACAAAGTAT 


TAGTGATGTA 


7140 




TATAGTGTGA 


TAACX3GATGC 


GAAATCTGCA 


CTTTCTAATA 


ATTCGATATC 


GAATGACAAT 


7200 


50 


AAGCAGAAAG 


CAATTGAGCA 


AGTGGTAAGT 


GCAGTTAAGA 


AATTATCGCT 


TGAAGATAAT 


7260 




AGTGAAAGTA 


ATGCTGTCAA 


ATCAGATGTG 


AGAAAGCTTG 


AAGATGCAAA 


AGCGAATGAT 


7320 
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GCTAGTAAAG ATGCGGGTTC TAAAATTAAA CTATTGCAAC AGCAAGTCGA TGCTAAAGAT 


7440 




GCTGCGATGA CAAAAGCGAT TAAACSATAAA AATAAAGCX3G AATTAGAATC TTTGAACAAT 


7500 


5 


AGTTTGAATC AGATTTGGAC AAGTAATGAA ACAGTGATTC GCAATTATCA CGCAAATCAA 


7560 




TATGGACAAA TTGAAGTCGC ATTATTACAA CTTAGAATTG CAATTCATAA GTCACCATTA 


7620 


10 


CATACGGCAA AAGTGTCACA TGCTTGGACA ACTTTTAAAT CAAATATTCA TCATGTCGAT 


7680 


AAAAAAAGTA ATACGTCTGC AAATGATCAA TACCATGTAT CACAATTAAA TGATGCGTTA 


7740 




GAGAAGGCGA TTAAAGCTAT CGACGACAAT CAATTGTCGG ATGCTGcaTg TGCGCTTACA 


7800 


15 


CATTTTATAG AAACTTGGCC GTATGTTGAA GGTCAAATTC AAACTAAAGA CX3GTGCTTTG 


7860 




TATACGAAAA TTGAAGATAA AATACCATAT TATCAAAGTG TATTAGACGA ACATAATAAA 


7920 




GCACATGTGA AAGATGGTTT AGTAGATTTA AATAACCAAA TTAAAGAGGT TGTTGGCCAT 


7980 


20 


AGTTATAGCT TCGTCGATGT GATGATTATC TTTTTACGTG AAGGGCTAGA AGTGTTGTTA 


8040 




ATTGTAATGA CATTGACTAC CATGACGCGT AATGTAAAAG ATAAGAAAGG GACTGCAAGT 


8100 




GTGATTGGTG GTGCAATTGC CGGACTTGTA CTGAGTATTA TCTTAGCAAT TACGTTTCTA 


8160 


25 


GAAACTTTAG GGAATAGTGG CATTCTTCGT GAAAGTATGG AAGCGGGATT AGGTATCXSIT 


8220^ 




GCGGTCATAT TAATGTTTAT CGTTGGTGTT TGGATGCACA AACGTTCAAA TGCAAAACGT 


8280 




TGGAATGACA TGATTAAAAA TATGTATGCT AATGCGATTA GTAATGGTAA TTTGGTATTG 


8340 


30 


TTAGCGACGA TTGGTTTAAT ATCTGTGTTG CGTGAAGGTG TCGAGGTTAT CATTTTCTAT 


8400 




ATGGGGATGA TAGGTGAGCT AGCGACCAAA GATTTTATTA TTGGTATTGC TTTAGCTATC 


8460 


35 


GTTATTTTAA TCATCTTTGC ATTATTATTT AGATTTATAG TTAAATTAAT ACCTATTTTC 


8520 


TATATATTTA GAGTGTTGTC GATdTTATT TTTATTATGG GATTCAAAAT GCTTGGCGTA 


8580 




AGTATTCAAA AGTTACAATT ATTAGGTGCG ATGCCAAGAC ATGTTATTGA AGGATTCCCA 


8640 


40 


ACGATTAACT GGTTGGGCTT TTATCCAAGT TATGAACCAT TGATAGCACA AGGTCCTTAT 


8700 


ATTATGGTAG TTGCTATCTT AATCTTTAAA TTTAAAAAAT AAAAAACAGG CCGAGTGCCT 


8760 




Gaxxiixxtx^ TTGCTATATT GGAAATATTC GGTATTGCAG TATAACGATA ATCACAGCAT 


8820 


45 


TGATTCGTAT AAGGTTAATG TGTTGGCGGT TTGCCTCGGC ATGTGAACTT AACGATGAAC 


8880 




ATACTGAACT CAAAGAGCAA TATGAGTGGC AATGTGAGTA ATATATTTAA TCTTAAATCG 


8940 




GGTGGTGCAA TGATACTTGC TAATACAAAG CAAGCGAAAT AAATATATTT ACGrTAATGT 


9000 


50 


TTCAATGATG TGGTATCTAT AAGACCGAAT TTTGCAAGAC CCATAAATAA TATTGGTAAT 


9060 




TGAAATAGAA GACCAAATGT GAATAACCAA CGTATGAGTT CAATCAAATA TGCTTTAAAG 


9120 
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40 
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50 



GGAAAGCCAA CATAAAATGC AAAAGCGACG CCAGCACAGA ATAATAACAC GCTGAAAAAA ' 924 0 

CTATATTTAT AAATAAATTG ACGTTCATTA TTATGTAATC CAGGTGCAAT GAATGCCCAC 9300 

AATTGATAAA 9310 
(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

ATTAGTATTA CACACTAATG TAAATAGATT GGTTGGAGAA GAGATATTTG CTAATAAGTG 60 

CCTTGCTAAT AATGATGTTC AAATTATGAA TTCAATAAAA AAATTAATTG AAGCTGAGTT 12 0 

ATTAACAAGG ACTAATGATT TTGAAGTTAG TATATATAAA AAGACAAGAC CTGAATTACA 180 

AAGTATTTTA AAAAGTTTTG GTATAAAAAC AACAGGTAAT -AAACCTGACT TAATTAAACG 24 0 

TATTGACGAC AATTTTCATA TTATTAATAA CTTAGATTTA CCATATGTAT ATATACCAAT 300 

TAAGAAAGGA GAAGAAATTT TAAAGAAAAC AGAGTACTTA ACCTCTTTTA TACAAAGTTA 360 

TGGTGAAATT TCTCTTGAGC GTGCTTATTA TTTGGTTGAA AACTATATAG ATGAAAATTG 420 

30 CGATGATAAA GTTGCAGAAA TATACAAGTT TGAATTTCAA AGAAAATATG ACAATGGCGA 4 80 

GTTTGATTTT AATCATGGAT ATAATTTCGA ATTGAATATG TTGATAGATC ACTATAAAAG 540 

AGATGTAAAA GACTACGATA ATGCCAGAAA GTATTCAAAT ATTTATCTTT ACTTTGGTTT 600 

GAGAGATTTC TTAAAAAAAT TAATGAGTAA TTATTCATAT TACGATAgTA AAGGGAATAT 660 

AGATTTGAAC GAAATACAAA ACGATCTGAA TAGATTTATC AACTCTAGCG CTTCTGGTAT 720 

GTACGAGCGA TTAATATATA ATGAAAATTT ATCCAATAAT ATTATGTTTG AATTATTTAA 780 

AAAGGACACA CAAGATTATA GTGATTTGGA AGAACAATTG ATTGAAAAGT TCATAAACTA 840 

TGTAGTGTCT AATGTAAAAA AAGAAAGTAG GAGTAATACT CTTATAGAGT TATCAAAAAT 900 

TTTAGAGAAC GGATATACAA TTGATAAAGA AGAATTTAAA AAAGAAGATG ATTATCTTTC 960 

TAAGTACATA TTTACTGACA TAGATTATTT GAAAAAGTTA GAATCAAAAA TAAACGTTGC 1020 

TATTGATATT CGAAGTGGAG AAATTCATTT GGTATTAGAT GATGATAGCC TTGATATATT 1080 

AATACAAAAT CAAAAATACG GCAATGAGTT TTGAGTCATA ACTAAATATA ATATGTTGAA 1140 

GAGAGGTTTT ATTTAAATGG CTAAAATTGG TTATGATCGT GTATCAACGA AAGATTAATA 1200 
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AACGTACAGA GCTTGTTAAG TGTTTAGATT ATTTACGAGA GAGCGACACA TTAGTTCTCT 
ATCAACTTGA TCGGTTAGGT AGAACGACAA AACTATTaAT TGAATTaTCA CAATGATTCG 
ATGATAACGG AATTGACTTA CAAATTAGTA ACATGAACAT TTCAACGAAA GACACAATGG 
GCAAAATGTT TTTTACGATG ATGAGTGCAT TTTTCaGGAT TAGAAGTTAA TTTACTATGT 
GAGTGTATAA AATAGACTTA GCAGCAACAA GAGCGAGAGG CCGAAAAAGC GGGCGCCCCT 
CTTTACCAGA GAATAAAAAA TGAGAAATTA AATTTTTATA TGATGAACAA ACGATAACAG 
GGGAAGAAAT AGCTAGTTAG ACAAGGGTAT GTCACTCAAC TGTTTATCGA GTTGATTAAG 
AAATGAAAAA ACTTATACTA TGAATTACTG TTTAAAAGTG TGCATGTTAT AATATTTATT 
GAGCAAGTTG GATAGATGGT GGCTAATCTC TTAATAAAGG GGTCATCCCT ATGGTTATAG 
TTGTTACTCC TAGnAAAGGA CTAGCATGTC TGATTTTGAA ATGCTTATGG TrGTATTAAC 
20 AATCATTGGT TTAGTATTGA TTAGTACTCA AGACCATAAA AAATAaCCTT CTATTCGCTT 

TGACCGGCAT TTTTGAAGGC TATTTTTAAA TAAAATATAA GGTCACCGTC ITTTTAACGG 
GCTCATTAGG GTAACATGTT TCCGAGTGTT GCCCTTTTTG TGTTTCAAGA GTTAATGATT 
2B TTCATCTTTT GCTTCTACTT GCTACAAATA TATTTTAACA CATTTTrCrr ATGAATTGTA 

GTTCTGAACA TAATCAGAAT TAATAAAACC AACTTTCCAT PJZt^QQI^P^ PiTACAATTAA 

AAGTATAGAA TGTAATCACG AATTATATAA AAAGTATTGG AGACCTCTTC ATATATAGAT 

30 

AATTGACTTA GTTATTTTAG AAAGAAGCCC CTAaCAACTA AAGTTGAAAA ATAGAGGAAC 
ACAGTTgGAT TACGCATCAA CTGCATaAGg CCCCTAAmAA CTAAAGTTGT AAGGGGCyCT 
AAAATTTATT TTGGTTGATT GTCTTCTGGT TTATCTGAAG TCATTGTTTT TGTTGTATTA 
TCATTTAATG AATCTTGTGC TTTTTCTGTT TTTGTTTCAA CAGATCTTOT CACTTTATCA 
TTTTCTACTT TTGTATTTTT TGGTTCAGTG ACAACTTTTT TATTCTTGTC AGCTGTTTTT 

^ ACTTTATCTT TAGTAATTTT TTGACCACGT TTTAAGTAAT ATTXSAACGAT ACCCATTAAA 

ATGATTGCAT GAATAACAAC GAATAAAATA ATTGTCATTA CTGTATAAAC TCTTATX3ATA 
TTTTCTGCAA CACTTTGAGA GAGTGATTGC GTATTAAATG AAATCAAGTA ACCAACAGGT 

4S GTTTTAAATA AAATAATAAT TAAGTTTAAT AATAGTATTC CAATGAAAAA TTTGAAAAAT 

GTTTTTTGAC CATTTTTCAT TGCTTTAAAT CCATTAGCTA AGTGTGTTTT TACTTTATCG 
TTAGTTGATT CaACGAAACT AGTCaTAAAG TTGAAAATAG GTATTAACAA TAACCAAGTA 

SO ATAATTGCTT TAATCAATAA AACAACAATG ATTATGATGC TTTGAGTTX3T AATAGCAATA 

CCAATCAGGT GTGTACTATC TGCATATGAT GATTGTACTG AATTCATAAT CATTTCTCAT 
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ATAGATACAA GACCGATCAG AACGCTCTTT AAATAATTAC CTTTTTTAAA TGCAATAAAT 3120 

AAATCGGTAA ATTTTACTTT TTCATGACTC ATTGCTCGTT TCATAACATT TGTAATTCCG 3180 

ATAAAAATTT GAACTAAAAC AAACAGTGAA ACTACAGCTG CTATAAGTAA TACAGCGATA 3240 

A C TTTTAAAT ATGCATCTAC TGGTGGTTTT TGCCCAAATT GTGAATAAAT TGCAATAGTT 3 300 

TGTGCATACT TAGCAAGCTG GAAATTAGCA AGTAAATATA CA/fCTGCAAT TACAGCAAAT 3360 

GCTATAAGTG CATATATCAA TGTTATTAAT AATTGTGGTT TAGCATTTTT AAATGCTGAT 3420 

TTAAAGTAAG TAAACAAAGT GGTGCCTCCT TTTCTTCA 34 58 
(2) INFORMATIGM FOR SEQ ID NO: 318: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 
25 GCTTGCAAGA TGCTTTCATT AAAGACAAAG TCATCGATAT TATGAATATG TTCCAAAATC 60 

ACACTGATAT CACTTATACG TTGAATAAGT CTCATGCACA TATTATATGT ACGCCAGAaA 120 

TATTTGCTAA ATTGTTACAT ACGATTGCAA CTAGAAATAT CGACATTCTA TCTGCCAATT 180 

ATAGATCGTC ATCTATGAGC AAAGCTCGTA TATCATAATA AAAGACACAT CTATACGATG 24 0 

ATCATTTACA ATGATTGCGT ATAGTGATGT GTCTTTTTTT GTATTATTTA TCTTCGGAcG 3 00 

ATTGATCCTC AACCATTTGT TCAAATGCTT CACGAACTTG TGGTACTGTC ATACCTACTA 360 

CAACTTGGAT ACTCTTACCA CTCTTTACTA ATCCGTGTGC CATrTGTTGA TGCGTGAAAT 420 

ATTCCGTATC TGCAACTTTA CTTTCATCAT AAACAGTTAA GCGTAATCTT GTTGTACAGT 4 80 

TAGTAACATC TTTGATATTT TCTTTGCCGC CTAAACCGTC GAGGTAATAT GCAGCTTTGT 540 

CTTCGTATTC ATTACCAGAT GAAGCACGTT TAGAATCTAC GCTATCACCT TTTTTATTTT 600 

TGTAATCTTG CTTAGAAAAT AATTTAACTT CCTCTTCTGT TTyCTTACGT CCAGGTAATG 660 

GAATATCAAA TTyCAAAATT AAGAATCTGA ATAAG 695 
(2) INFORMATION FOR SEQ ID NO: 319: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 base pairs 
5^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 
AATATTGaTa TTATATATAG TCGAATCaAT GATGACCCaC ATCGACCtAA TGCAAATAAT 
AAGACAATCA AGCAGTTAAA TGACTTGTAT TGCTCGCGGA ATTTAAGATA TAATGAAATC 
GCATTCAAGA AATTTGATAG TCATTTGCTA TCAATTTCAG AAATATATTA TGAATTGCTA 
AATTTAATAA AAGCGAGTGA TCAGTATTAG AGAGAATAGA GCGTTAAGAC TCTATCGCCG 
AAGtGCAAGT AATTTATTAC GAAACTCTCA GGcAAAAGGr TAATACTGTA ACGCGTTCCT 
GAATTGGTGA TTTATAAACA GGGTAGCGAT TGCTATCCTG TTTTTATAAT TTTAAGGGGG 
TATTTCAATG TCAAGTGATT TAAAACAAAC ACCTTTATAT CAAAATTATG TTGATAGAGG 
TGCAAAAATT GTGGAATTCG GAGGATGGGC GATGCCTGTT CAATTTTCAA GTATTAAAGA 
GGAGCATAAT GCTGTTCGAT ACGAAATTGG CCTGTTTCAT GTTAGTCATA TGGGTGAAAT 
20 TGAAGTAACA GGTAAAGATG CTAGTCAGTT TGTGCAATAT TTATTATCAA ATGATACTGA 

TAATTTAACT ACTTCAAAAG CATTATATAC TGCTTTATGT AATGAAGAAG GCGGTATTAT 
TGATGATTTA GTAATATATA AATTAGCTGA CGACAATTAT TTATTAGTTG TTAATGCTGC 
TAATACTGAA AAAGATTTTA ATTGGATTTT AAAACACAAA GAGAAATTTG ATGTTGAAGT 
ACAAAATGTA TCAAACCAAT ATGGTCAATT AGCAATACAA GGACCAAAAG CtAGAGATTT 
AATTAATCAA TTAGTTGATG AAGAkGTAAC TGAAA 
(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5897 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



30 



so 



^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

TCTTTTTACG AAGGCTACCG TTTAGGATCT GATAACTTAA CTGTAGGAGA AATTGTATTT SO 
AATACAGCGA TGACAGGTTA TCAAGAAAGT ATTTCAGATC CATCATATAC AGGTCAGATC 120 
45 ATTACTTTTA CGTATCCATT AATCGGTAAT TATGGTATCA ATAGAGACGA TTTTGAATCA 180 

TTAGTACCTA CATTAAACGG TATTGTAGTC AAAGAAGCGA GTGCGCATCC AAGTAATTTT 240 
AGACAGCAAA AQACACTTCA TGACGTTTTA GAATTGCATC AAATTCCAGG GATTGCAGGT 300 
GTTGATACAA GAAGTATTAC GCGTAAAATT CGACAACACG GTGTGTTAAA AGCTGGTTTT 
ACTGATCGAA AAGAAGATAT TGATCAACTT GTCAAACATT TACAACAAGT AGAATTACCT 



360 
420 
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AGTGTCGTAC TTGTAGACTT TGGTAAGAAG CAAAATATTG TTCGAGAATT AAACGTCAGA 540 

GGTTGTAACG TCACAGTTGT ACCATATACA ACTACTGCCG AAGAAATTTT AGCAATGGCT 600 

CCAGATGGCG TTATGCTATC AAACGGACCA GGTAATCCTG AAGTTGTAGA ATGTGCGATT 660 

CCAATGATTC AAGGAATTTT AGGGAAAATT CCGTTCTTTG GTATCTGTCT AGGACATCAA 720 

CTTTTTGCAT TATCTCAAGG AGCAAGCTCA TTTAAAATGA AGTTTGGTCA TCGTGGTGCG 780 

AACCATCCAG TTAAAAATTT AGAGACTGGA AAAGTTGATA TTACGAGTCA AAACCATGGA 84 0 

TATGCAATAG ATATAGATTC GTTAAAAAGT ACTGATTTAG AAGTTACTCA TCTTGCATTA 900 

AATGATGGTA CTGTAGAAGG TTTAAAACAT AAAACATTAC CAGCATTTTC TGTTCAATAC 960 

CATCCTGAAG CAAATCCAGG ACCGTCAGAT TCAAACTATC TATTTGATGA TTTTGTAGCA 1020 

ATGATGACTA ATTTTAAGGA AAAGGAGCGT CATATCAATG CCTAAACGTA ATGATATCAA 1080 

AACAATTTTA GTAATAGGGT CTGGGCCAAT TATCATAGGT CAAGCAGCTG AATTTGATTA 114 0 

TGCTGGAACA CAAGCATGTC TAGCTTTAAA AGAAGAGGGA TATCGAGTTA TTCTTGTAAA 1200 

TTCAAATCCA GCGACAATCA TGACTGATAA GGAAATTGCG GATAAAGTAT ATATCGAACG 1260 

GTTAACTCAT GATTTTATAG CGCX3AATTAT ACGTAAAGAG CAACCTGACG CTTTACTTCC 1320 

AACTTTAGGT GGTCAAACAG GTTTAAACAT GGCGATTCAA CTACACGAAA GTGGTGTGCT 1380 

TCAAGATAAT AACGTCCAAT TATTAGGAAC TGAGCTAACA TCAATTCAAC AAGCAGAAGA 144 0 

CCGTGAAATG TTTAGAACAT TAATGAATGA TTTAAACX5TT CCTGTACCAG AGAGTGACAT 1500 

TGTAAATACA GTAGAGCAAG CCTTTAAATT CAAAGAGCAA GTGGGATACC CGCTAATTGT 1560 

TAGACCGGCA TTTACGATGG GTGGTACCGG AGgCGGTATT TGTCATAATG ATGAAGAATT 1620 

ACATGAAATC GTCTCAAATG GTCTTCATTA TAGTCCAGCA ACGCAATGTT TATTAGAAAA 1680 

ATCTATCGCA GGTTTTAAAG AAATCGAATA CGAgTAATgC GTGaTAAAAA CGATAATGCC 1740 

ATCGTTGTAT GTAACATGGA AAATATTGAT CCAGTTGGTA TTCATACAGG CGATTCAATT 1800 

GTTGTGGCTC CTAGTCAAAC ATTATCAGAT GTTGAGTATC AAATGTTACa TGATGTTTCA 1860 

TTAAAAGTTA TTCGAGCTTT AGGTATCGAA GGTX3GTTGTA ATGTTCAATT AGCATTAGAT 1920 

CCCCATTCAT TCGATTATTA TATTATAGAA GTAAATCCGC GTGTATCACG TTCATCAGCG 1980 

TTAgCTTCAA AAGCAACAGG ATATCCTATT GCAAAATTAG CTGCTAAAAT CGCGGTTGGT 2040 

CTAACATTAG ATGAAATGTT AAATCCAATT ACAGGAACAT CTTATGCAGC GTTTGAACCA 2100 

ACTTTAGACT ATGTGATTTC AAAAATACCA AGATTTCCTT TTGATAAATT TGAAAAAGGA 2160 

GAACGAGAGC TTGGCACACA AATGAAAGCA ACAGGTGAAG TTATGGCCAT TGGTCGAACT 2220 
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TTACCAAATG GTGAAAGCTT CGATCTTGAT TATATTAAAG AACGTATTTC ACACCAAGAT 
GATGAACGAT TATTTTTCAT CGGCGAACAA TTAGAAGAGG CACAACATTA GAAGAAATTC 
ATAATATGAC TCAGATTGAT TACTTCTTCT TACACAAGTT CCAAAACATT ATTGATATTG 
AGCATCAATT AAAAGAGCAT CAAGGTGATT TAGAATATCT TAAATATGCA AAAGATTATG 
GATTTAGTGA TAAAACAATA GCGCATCGCT TTAATATGAC GGAAGAAGAA GTATATCAAT 
TGCGTATGGA AAATGATATT AAACCTGTTT ACAAGATGGT TGATACTTGC GCAgCTGAAT 
TTGAATCTTC AACACCATAT TATTATGGTA CATACGAAAC TGAAAATGAA TCCATAGTTA 



2700 

,^ CTGACAAAGA AAAAATCTTA GTATTAGGCT CTGGACCAAT TCGAATCGGC CAAGGTGTAG 27S0 

AATTTGACTA TGCGACAGTT CACGCCGTTT GGGCAATTCA AAAAGCAGGG TACGAAGCGA 2820 

TAATTGTGAA TAACAATCCA GAAACAGTTT CAACAGACTT CTCAATTTCT GACAAATTAT 2880 

20 ACTTTGAACC TTTAACTGAA GAAGATGTGA TGAATATCAT TAATTTAGAA AAACCTAAAG 294 0 

GTGTCGTTGT ACAATTTGGA GGACAAACAG CGATTAATTT AGCAGACAAA TTGGCTAAAC 3000 

ATGGTGTTAA AATACTTGGT ACTTCACTAG AAAATCTAAA TCGTGCTGAA GATAGAAAAG 3060 

AATTTGAAGC ACTATTAAGA AAAATTAACG TGCCACAGCC ACAAGGGAAA ACAGCTACAT 3120 

CACCTGAGGA AGCATTAGCX3 AATGCTGCAG AAATCGGATA TCCGGTTGTA GTAAGACCTT 3180 

CTTATGTATT AGGTGGTCGC GCAATGGAAA TTGTAGACAA TGACAAAGAG TTAGAAAACT 3240 

ATATGACCCA GGCTTGTAAAA GCGAGTCCGG AACATCCX3GT ACTAGTCGAT AGATATTTAA 3300 

CTGGTAAAGA AATTGAAGTT GATGCGATTT GTGATGGAGA AACGGTCATT ATTCCAGGAA 3360 

TCATGGAACA TATTGAACGT GCTGGTGTGC ATAGTGGTGA CTCAATCGCT GTATATCCAC 3420 

CACAAACTTT GAOVGAAGAC GAGTTAGCAA CACTTGAGGA CTATACTATA AAATTAGCTA 3480 

AAGGTTTAAA CATCATTGGC TTAATCAACA TTCAATTCGT TATAGCTCAC GATCGTCTOT 3540 

ATGTTTTAGA AGTAAATCCA CGTTCTAGTA GAACGGTACC ATTCTTAAGT AAAATTACTG 3600 

ATATTCCAAT GGCACAATTA GCTATGCGAG CAATCATTGG GGAAAAACTA ACAGATATGG 36S0 
GTTATCAAGA AGGGGTTCAA CCATATGCTG AGGGTGTCTT TGTCAAAGCA CCAGTATTTA 
GTTTTAATAA ATTGAAAAAT GTTGATATTA CTTTAGGACC TGAAATCAAG TCAACAGGTG 

AAGTGATGGG GAAAGATACT ACATTAGAAA AGGCGTTATT CAAAGGGTTA ACAGGTAGTG 3840 

GCGTTGAAGT TAAAGATCAC GGTACAGTAT TAATGACCGT CAGTGACAAA GATAAAGAGG 3900 
AAGTTGTTAA ATTGGCACAA CGCTTAAATG AAGTTGGCTA TAAAATTTTA GCAACGTCTG 
GAAcAGCTAA TAAATTAGCT GAGTATGACA TACCTGCAGA AGTAGTAGGC AAAATTGGTG 
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TGACTAAAGG 


TAAAGAAGTA 


GAAAGGGATG 


GCTTCCAAAT 


TAGACGTACT 


ACAGTTGAAA 


4140 




ATGGTATTCC 


ATGTTTGACA 


TCTTTAGATA 


CAGCTAATGC 


CTTAACGAAT 


GTAATTGAAA 


4200 


5 


GTATGACATT 


TACAATGCGT 


CAAATGTAAA 


TCAATCAAAC 


TGTATCGGTG 


GGGCTGTAAT 


4260 




TAACCATTTA 


CTTAAAGAAG 


TTTATATTAC 


AGCCTCATTA 


TTTTAATGAA 


TTTCTTAATA 


4320 


10 


TAAAGGGAGA 


CaTATATGAT 


GAAAGATTTA 


CCAATTATTG 


CATTAGATTT 


TGAATCAAAA 


4380 


GAAAAAGTAA 


ATCAATTTTT 


AGATTTATTT 


GATGAATCAT 


TATTCGTAAA 


AGTAGGTATG 


4440 




GAAcrrrrrr 


ATCAAGAAGG 


TCCTCAATTA 


ATTAATGAGA 


TAAAAGAAAG 


AGGCCATGAT 


4500 


15 


GTATTTTTAG 


ATTTAAAACT 


GCATGATATT 


CCTAATACAG 


TTGGTAAGGC 


GATGGAAGGA 


4560 




CTAGCTAAAT 


TGAATGTTGA 


TCTGGTAAAT 


GTTCATGCTG 


CTGGTGGCGT 


AAAAATGATG 


4620 




TCTGAGGCCA 


TTAAAGGATT 


AAGAAAACAT 


AATCAAGATA 


CAAAAATTAT 


TGCAGTAACA 


4680 


20 


CAGCTTACGT 


CAACAACAGA 


AGACATGTTA 


CGACACGAAC 


AAAATATACA 


AACATCGATT 


4740 




GAAGAGGCCG 


TTTTAAATTA 


TGCCAAGTTA 


GCAAATGCAG 


CTGGTTTAGA 


TGGCGTTGTT 


4800 




TGTTCACCTC 


TTGAAAGTCG 


TATGTTGACT 


GAAAAGTTAG 


GTACATCATT 


TTTAAAAGTA 


4860. 


25 


ACACCAGGTA 


TTAGACCTAA 


AGGTGCATCT 


CAAAATGACC 


AACACCGTAT 


TACGACACCG 


4920 




GAAGAAGCAA 


GACAGCTTGG 


TTCGACGCAT 


ATTGTAGTCG 


GTAGACCGAT 


TACACAAAGT 


4980 




GACAATCCAG 


TCGAAAGTTA 


TCATAAAATT 


AAAGAAAGTT 


GGTTAGTATA 


ATGGCTAAAG 


5040 


30 


AAATTGCAAA 


ATCATTATTA 


GATATTGAAG 


CTGTAACATT 


ATCACCAAAT 


GATTTATATA 


5100 




CATGGAGTTC 


AGGTATTAAA 


TCACCGATTT 


ACTGTGATAA 


CCGTGTTACG 


TTAGGTTATC 


S160 


35 


CTTTAGTTCG 


AGGCGCAATC 


CGCGATGGTT 


TAATTAACTT 


AATTAAAGAA 


CACTTTCCTG 


5220 


AAGTAGAAGT 


TATTTCTGGT 


ACTGCAACAG 


CTGnTATTCC 


ACATGCAGCT 


TTTATTGCTG 


5280 




AAAAATTAAA 


ATTACCAATG 


AATTATGTTC 


GTTCATCAAA 


TAAGAGTCAT 


GGTAAGCAAA 


5340 


40 


ATCAAATCGA 


AGGTGCTAAA 


AGTGAAGGTA AAAAAGTAGT 


TGTGATAGAA GATTTAATTT 


5400 




CGACAGGGGG 


ATCTTCAGTC 


ACAGCAGTTG 


AAGCCTTAAA 


ACTAGCAGGT 


GCAGAAGTAT 


5460 




TAGGTGTTGT 


AGCTATCTTT 


ACTTACGGTT TGAAAAAAGC 


AGATGATACA 


TTTAGCAATA 


5520 


45 


TTCAACTACC 


rrriTACACT 


TTAAGTGATT 


ACAATGAATT 


AATTGAAGTA GCTGAmAfflTG 


5580 




AAGGTAAAAT 


TTCTAGTGAA 


GATATCCAAA 


CATTAGTTGA 


ATGGAGAGAC 


AACTTAGCAT 


5640 




AATATAGACA 


CTAGAAGGAG 


GAATTCAACA 


AATGAATGAC 


AAAACATCTA 


ATGATTTATA 


5700 


SO 


TGGGAAGATA 


AAACATTGTA 


ACGAATTTAT 


CAATCATTCA 


AATGATTCCA 


ATCTATCTAG 


5760 




TAGTCACGAT 


GTCX5ACGAAA 


gttcaacx;aa 


GCAAAAACAT 


ATAAAAAATA AAACAACTAT 


5820 
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70 



25 



30 



35 



SO 



TTAAAAACAA AAAAGCT 

(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



5897 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 
TCATTCTAAA TCAACTTATT TTCCATTGCA TAAATTGCTG CTTGTGTACG ATCGCTAACT 
TGTAATTTAC TAAATATATG ACTGACATGT GTTTTAATTG TTTTTTCAGA TACAAATAAA 

GrrrCTGCAA tctctttatt tgttttacct ttaaccattt cacgtaacac ttcaatttct 

20 CTCTTTGACA ACTTATTCGT GTAGTGTGGT TTTTGGCTAA CTGTTTCGAA TACATCTTGT 

GCCTTAGGAT GTATCATTTT TTCACCGTTC ATAACTCGTC TAATAGT-TTC AA-rrAATTGC 
TGAGGCTCAA CGTCTTTCAT TTCATAACCA TCAGCACCTT TATTGATTGC TGAAATTACA 
TGTTCATCAT CAACATAACT TGTTAATACC AAAACTTTAA TATCCGGATA ATCTGCCTTA 
ATATATTCCG TAATTTCAAT ACCATTCATG CCAGGCATCA CTAAATCTAA TAGCACAATA 
TCAGGGTGCT CATGCTCTTT TAAATATTCT AAAAATGTTT CTCCATCTGC AAAGTCTTGT 
AAAACTTCTA TGTTTTCAaT CGTGGaTAAT AAAAATCGCA ATCCTTGtCG CACAATATAA 
kGGwCATCTA CTAATATnAC TTTGTTCATG GGTTATCTCC TTAAAtCAAG CTATTTTATA 
GGAATTGTGA ATTGTATTTG TGTACCCTTT GTTGGCTGAG AATGAAAGGT CACTTTACCT 
CTTAATAATT TAACTCTTTG TTTTATGTTA TTAATACCGT GTGATGAAGC TATCTGAACA 
TTATCGATCT CAAATCCTTG ACCATAATCA ATCACGTCAA TATATAGTAT ATCGTTCATT 
^ TGTTTTAATG TAAGATCCAT TTTATTCGTA TCAGCATGTT TCTTAACArT ATTAATACAC 900 

TCTTGTAATG CTCTGTATAT GTTTTCTrCG ATTTCATTAG ATAAATCGAT TAAACCTTCT 960 
ACATTTACAT TTAATTGTAT ATGCATTAAT TTACTATATG CTGTCAAAGC ATX5AATTAAA 1020 
45 CCITGCTCAA GTCCAACTGG CTTAAGTnGC CAAATCAATG CACGCATTTC ATTAACGGCA 1080 

TTTTGACTCG TTtCCTCAAT CGTCTTGAAT GCTTGTTTAG CGATGGATTC GTrTOACATG 1140 
CCATACGCAG CATGlX^CtGT TAGTTTTACA GAAAATAACA TTTGATTTAC TGAATCATGT 1200 
AAATCTCTAG CTAAACGATT ACGTTCATTA AmTTGCCG CTTCTTTrTC AOGGTCTGTT 1260 
AAATAAATAC GTTTGATGGC TGACCCTAAT TGAAATGCGA CAGACTCTAG CAACTCTAAA 1320 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
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TGACCPGATT 


T A AGTGGTA 
X rvvvj X uvx X £x\^ 


CGTTGCATGA 


TGTGTAATAT 


TGTCATTl''iX» 


GCTAGGAAAT 


1440 




GCTTTAGAGG 


CT AAGTT HAT 

w X ^V%V3 X X X 


AuvjACvAACAA 


1 ltfACL»ATAT 


TCGACGCTTT 


CATTAGCCTA 


1500 


5 


CGTTGATTAA 


ATGCTTTPAg 

X \JVv XXX w 


Ar'Arr* A a/^a a 

AUiA^L.AACAA 


VxA^ V. t-A X vJxT 


TAATATAGTG 


ACAGTGGTCT 


1560 




GCTGTCAAAG 


ATTGTGGTAG 

X X VJ X \JU X 


A G/^ A r* A T*^T 


«A X AUAAGTT 


^ TV * 1 V* 1 T^ 


TACGCTATTG 


1620 


10 


ATGAAAAATA 




CGTGA A ATTA 


V. X KdK^ \^\^ X w\A 


TTAAATATTT 


IV TV JV 

TAACG CAC CT 


1680 


TGGGTCATGC* 


TAT AO A TTT^^ 

X£\ X.n\m£\X X lv« 


XT i 1 lA. 1 1 


TTTAAAAATT 


CGGCAATCTC 


TTTTAATAAA 


1740 




G CTAGTCG CG 


T O^TTTGTTr^ 
XV^^X X XVJX 


ATf^ A A ATr'O 


V. X U W\A X X UA 


X~X~i~X X'ACGTG 


TATTAACTAT 


1800 


IS 


TATAf^ATTGA 


GTTATTATAT 


111 lAiWtTUT 


JV & JV K JV 

TAuACWTAAA 


IV *T*0 IV «T*IV tV IV Ik 

CATGATAAAA 


TGGCCTTGAT 


1860 




*T* A r^r^ A T a r; 

^ 1. KmJV^ X 


X lAlAi 1 lAj^ 


taAGAACTGAT 


TTGTGATATG 


ATATTAAAGA 


CTATAGGAGG 


1920 






AX i iAAAATA 


CuAuAAAACT 


TTAATGACTT 


AAGTTTACGA 


GATATTTTCC 


1980 


20 


AAr'A ArTTA A 
/vW»^v^V» X X 


X i AAA 


AAAGATTTAC 


JV ^P/^ IV T^fVT^ T\ fV IV 

ATCA x X x AAA 


TATGTCTAAA 


GATATTACTA 


2040 




X X / v\ X X #U\ 


A r'^F' r""" i\ 
AL-t, 1 (jCuWVjA 


TTAATGGATA 


AAGTG CAT AC 


TGGCGACGAT 


GTATTTGTTC 


2100 




CAAPHATr^GA 


TGA A A A & ar'T 
XUAAAAAAVj 1 


AATTATGTTC 


CAAG X X ATCG 


TTATGCACAA 


ATTAAATACG 


2160 


2S 


A AG A ff^ a TY^ 2V 


T ATfVOA ATO 


/^•^JV IV JV JV IV 

CjTAATkjAAAC 


CTAAAGGTGT 


TAAGACTCAC 


CCTAATGATT 


2220 




TAAA AGAA&I^ 


V..AAXAV. 1 i lA 


IV TV** TV jv nr*^ iv 


TGATTTACAC 


TATTGATAGT 


GACTATGTCG 


2280 




AAPCAATTPA 




OIV/'?/^ JV ?V IV /^*./^ 

CAvjuAAALZAG 


TAGGATTATT 


AATTGTTGCT 


AAAAATCCTT 


2340 


30 


TAATGAAAAA 
xnnx 


A ATTf*^rT^ AT 
nnX XV^X X\jt/\X 


O AT/^ TT A 


ft IV J^ JV JV TV 

AAQvACAAxCvA 


TV IV ^f^^ 

TAX X ACGCGG 


Ik ^niv Ik Ik % ^ ^ 

ATATACAAAG 


2400 




CAAATGTTAA 


GGPArTTTTA 


^^OTTT A A A A 
^V. Ill AAAA^ 


a IV TV A /^r* IV f* 
(-AV..AAAC oAT 


TGAT ATGC CA 


Ik ^^*^/^^^ Ik Ik Jk 

ATTGGTAAAG 


2460 


35 


ATAAATTCCA 


TTCGAAT AAA 


PG A r*GTr;TGT 


^ X X A^ Xviv3 


'f<f~^ty r^f^f^T^r^ r"* iv 
AL-AV7V.G loCA 


7\ J 1 jk ^ IV Ik ^ IV 

A x X ACACACA 


2520 


TTTTAAfPTf^ 
X X X X x x ^ 


AAAAATGATA 


A A A*^ A Anr^TT^ 


TO fV^y^/'^m A 

X X vid-IuAwx 


•T*0 & fV 9V JV JV 

xCoAAATdAAQ 


TTGGATACTG 


2580 




GACGTAr*TCA 


TP A A ATwfVtT 


GTfy^ATTTAf^ 


OTO A & IV TfW* 
I- X (oAAATTVjCj 


TCACCCTGTT 


ATTGGTGATC 


2640 


40 


CTTTATATGG 


TGATTCAACG 


TTAAGACVLAT 


TAriA ArTTY^ A 
X AVjtAAW. X X VjA 


A AiyPTAOA A A 
AAlj X X AV.AAA 


TVT^TV^ JV^'I^P"^'* 

ATAGAGTTTG 


2700 




TGcATCC CTT 


GACTAAGGAA 


wXV»MX X xv»\»vx 


X X X V. X X X V9^jA 


XiaAV. XAAX X X 


GATTAGTTTT 


2760 




GCATGATATT 


tAAACATGr*A 


ATACOGf^ATT 

nX A\*^VJ^aAX X 


flTA AOTA A AT 
u X AA\. X AAA X 


OA AOTATOA A 
L^\Au X AXA..AA 


^PTV )V IV IV 

V.CTAAACGGA 


2820 




TAGATGGAAA 


ATTATT A ATT 
nx xnx x#\nx x 


TTTTPAG ATTi 
X X X X VaiAOn X 


1 X ULyVj X X X X~X 


X X\9'X X X X~X X A 


CGATGCtTAG 


2880 




GATTTTATAT 


TTTGATATTT 


TAGTAATTAT 


TCATTTTATA 


ACATCCTTGG 


ATAATGACTT 


2940 




GTAGTCTTTT 


TCAACTGCGT 


TACGTGTATC 


TATGGACAAT 


ACATGACATC 


ataagatttt 


3000 


SO 


TATCACAGGT 


TGTTTGGCCA 


ATACATGTAC 


AACAATTCAT 


CATATAAAAA 


ATAGGTTCTA 


3060 




TAATAAAACG 


GACTCCATGA 


AAAGTTTTTC 


CTTTTCATGG 


CTCTATATCA 


AATCAGACTT 


3120 
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CCAAACTTTA TTTTATATTA ATATT^aaTT* » M#r>/^«^yi^^«»«^ ^ 

ATATTTAATT AATGAGGATC TACCATATCT TCTGGTTTAA 


- 

3240 


5 


TC CATGCTT C AAATTGT-rr-r Tr^riTK nr^T^n^ j^^^^^ ^. ^ 

xui i AACAT ATCCAGTTTG AATTGCAGAT TCTTTTAAAG 


3300 




TTAAACCTTC TTTATGGGr*T tttt^t arr'A a towtv^ko/^-w*^ -, 

i. AXrtiijo^Ul ITCTTAGCAA TTTGAGCTGC TTTTTCATAA CCAATATGTG 


3360 




GATTTAATGC AGTAACTAAr* ATTaaTrsaTT r'a^rwr^iv jMv*T*ik 

rtvjiM/\v.iAAt- ATTAATGATT GATTTAAATA ATTATCAATA TTCTCTTCGA 


3420 


10 


TTGGTTCAAT GCCCJ^CTCin^ n^^T^r^T^iK^ -m t. T>.m--ijTUT-.,, 

^xx^-^i (^(^v-LACTGCA CAATTGTTAT TAAATGTTTC CATACCATCA GCTAAAAGAT 


3480 




-r««i xv^x iM^ivjiAiiA TGCATAATAA CTGGTTTATA AACATTCAAT TCAAAGTTAC 


3540 




CTTGTGAACT TGCGAAPPAA r'aar*rr"r»A'r/^ Aa«r«K^^/^«mm . 

i.vjv,vj>\Ai^L-AA CAACTGTATC ATTACCCATT ACTTGGACTG CnACCATTGT 


3600 


15 


w^iUjiTjiAG GATTAACTTT ACCAGGCATA ATTGATGAAC CTGGTTCATT 


3660 




w^wwixM v^MAAiixcTG CCAAACCAGC TCGTGGCCCT GAAGCCAACC ATCTCACATC 


3720 




ATTAGCAATT TTr'ATT^aa/^t* /-^/^rwrvr'WTfn * m,-, 

iit-ATTAAGT CTCCTGCTAA TGCCTTCAAT GTTCCATGCA ATTGAACAAC 


3780 


20 


xiw^ii^ii^u i^t-i-GTAAGTC CGTGGAATTT ATTTTCAGAA GATACAAATG GATAACCCGT 


3840 




ATTTTCTGAA ATATAATrrrr^ f far^yi^pAT*!^ t^/^^'h m «r*w*«^m 

xxwxvj/^rt CCACTTTATC ACCAAATTCA GGATGCGCAT TAATACCAGT 


3900 


25 


ACCAACAGCC GTACTLArTYSa Trirr'aa^RT^T* t^m « i^m^m^^ 

--kv-wv»wt\»v.^w vji>\v.k-A<^u\jA IGGCAAGATT TAAAATGTGC TTCTTAGATT CAGATAACAT 


3960 


TGTTTCACAA Pnr^TT'a ar^r»a T&Ti»/^/-»/-»/-»» j-^j-^^-. . -^j... . _. _ 

j.«j.xxw\^->u\ v-U:Kj1UAAGCA TATAACGCCA GCCACTAATC TCTTGTCCTA GTTTGATCGG 


4020 




CGTTGCATCT TGTAAATRTr; Tar^na^r'aaT -i—i— rn n .r.i. 

>- w * A w w% X V, X X \j i 1 \j 1 L> 1 At-GACCAAT TTTAATAATT GAATCAAATT TATCTTCTkT 


4080 


30 


TTCTTTTf^AAA f2'Ta'T^T^/'^*T^T7\ ^ n /•^rrwtgww m . 

ixv^xAxwwi ^lAiiiCTTA AAAGTTTTAA TGCAGGTTCT AATTTTGTTT CAACCTCTTG 


4140 


ATATAATGCA ACGTG(^ATAf^ r'ar»'nvw»-» n mj-i -n-unj^ — - _ - 
*****wxiwv^ i^v-^jx^cAiAQ* CAGTTGGGaA TGTATCaTTC GaACTTTGAG ATTyATTTAC 


4200 




* X X X A. vjijvj X vjIja 1 At- 1 rxCATCACT TTGATGATCT TTTAAATACA TATT AGCAAC 


4260 


35 


ATAACTTACT ACTTCGTTf'A r'a*PT*r^a'pik*T»^ K/-wT«'WT«<^mr^f«t« 

X xrtv. X X X k-va 1 X UA UA n CATATT Ac mxi'iTiTA CCGCTTCCTG TTTGCCATAC 


4320 




AACTAGTGGG AAGTGTTCAT f^AATToa rr* w & »t« » k k « ^ « 

rtrwjxvjxxcMx uiAAiiUACC TGATAAAATT TGATCACATG CGTATACAAT 


4380 




GGCATCTTTC TTTGCCTttV^ TTAaTTT^r^r* •riMi tv«tv-«« » » - _^ 

x-«*wxxxw ixxwcv-ic^jrc i^AATTTTCC TAAATCAAAA TTAGCTATTG CTGCTGCACG 


4440 


40 


CTTTAGTTiiT Gf'!AAAAr*r*aT aaar^iiri»T*»rr^ ^jkVTwv^^^mms 

X xnv7x xvjx ov-MAAACt-Ar AAACTACTTC GATTGGCATA CGCTCTTTAC CAACTGGGAA 


4500 




X x^^x7 xxxrt ^-iicixici^u TTTGAGCACC CCAATATTTA TCTGCAGGTA CTTCTATTTC 


4560 




TCCAAAAGTA TCATnTTnAA T^rrTT'jir^Tv^Ti /-^»rrwT*^» 

* w»nnv> X X 1^ i\» i i L^AA IT^CTTACTGA CATTCAATTT CTCCCCTTAT CACTGTTTAT 


4620 


45 


TTAACTGTAG TATATCyiTTA AATaa*nf»^Aa -r-r-i— t. j-tj-m iLnun 

x-r^^xvjx/va XAXAICAIIA AATAATTTAA TTGAGCAATT TATGATTAAA ACX3TTTTCAT 


4680 




AATTTGAAAT AAAAATAf^^ TAATrv^r'ftr'r» -tj — j-r «. .-i.n.-- tl fru.L.m 

X x%«rm\x rtMMMAiAUA^. iAATCGCACG TGTTCACCCT TTATTACAGT GATACGGTCA 


4740 


SO 


lACGATTAGT GTGTTATCTA TCATTATTTA GTTATTATTG AACTAAGTTT AATTACGATA 


4800 


CTTTGTTTTA GTAGCTTCAA CCGtAGCAAT AGCTCTAAGT ATATATAATA CAGCACTAAC 


4860 




AATTGTCGTA TATGGATTTA GAGCAACAAG CGTaCCTAAA ACTCCTOTTA AACTCGCATA 


4920 
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TACAATACCT 


GATTGATTAC 


TTTTAATGAA 


TGTTTGCGCA 


TTAACATCAT 


CAATTAATCC 


5040 




11 i"i\*ATAAA 


TTGAGTTGTA 


ATTTTATTAC 


TTTGAAAATA 


ACAGGTAAAT 


ATAATGCCCC 


5100 


5 


AA i IGCCAAT 


GGAAAAGCTT 


TAATTGATAT 


TAAACTTATA 


ATAACTGTTG 


CTATCAATAA 


5160 




TTGAATCCAG 


TATTTTCCTA 


ACATAAATAT 


ATAAATCTCC 


TCTAATTTCA 


TTCTTCAATA 


5220 


10 


GCATATCATA 


ATCTTGGCAT 


ATTAAGAAAC 


GCGGTTTAAT 


GATTTCATTA 


AAAATATTAC 


5280 


TGATAGATGA 


CTTCTTTCAA 


TTATGTCTGG 


AGTAATTAAT 


TATCAATTCC 


GTTTAAATGG 


5340 




TGTTTTAATA 


TTTAAAATTG 


AAClTritJAT 


ATATTACTAT 


GTCTGGTACA 


CAAATCAATG 


5400 


IS 


TTTTATGCTT 


TACAAAGTTA 


TATTGGCAGT 


AGTTGACTGC 


AGTCCACAAC 


ATAGAGGCTT 


5460 




CGGAATGTCA 


GCTTCTATTT 


CATGCAAGTT 


GGTGGAGCTC 


CAACATAGTG 


GAATTGGATT 


5520 




CCCAATTTCT 


ACAGACATTG 


CAAATTGGGG 


AAACGGGCCA 


CAAACTCAGA 


AACTGGTGGA 


5580 


20 


AAGTCAGCTT 


AAAATAACAT 


GCAAGTTGGC 


GAGGCCCCAA 


AATAGTGAGA 


TCGGATTTCT 


5640 




AATTTCTACA 


GACATTGCAA 


ATTAGGGAAA 


CGGGCCACAA 


ACTCAGAAGT 


TGGTGGAAAG 


5700 




TCAGCTTAAA 


ATAACATGCA 


AGTTGGCGGG 


GCCACAACAT 


'agaaaaattg 


GATCCTCAAT 


5760 


25 


TTCTACAAAC 


AATGTAAGTT 


GGGGAAACAG 


CCCCAACACT 


GAAACTAGCA 


GAAAGTCAGC 


5820 




TTCTATGAAT 


ATAATAAAAA 


AGCTAGGTAA 


CAAAATGCTA 


CCTAACTTCA 


TATTCAAGAT 


5880 




AATCAATCCT 


ATTTGATATA 


TGTTCTATAC 


TATACATTAT 


TTACATGATA 


AATAACTGAA 


5940 


30 


TATTACACAA 


TTATAATACT 


TTACTGACTG 


TCTTCTTCAG 


aattcttttc 


TTGATCATTT 


6000 




TGATCAGAGA 


TTTGTTCCaT 


TTCTTTACCT 


AATTCTTTTA 


AATCTTCAAA 


ATCCGTTACC 


6060 


35 


ATACTGTTTT 


CTTCTTCATG 


ATAATTTAAT 


TTTGGATCTT 


TGTCTTTAGA 


CATAATCAAT 


6120 


ACCTCACAGA 


TTTT-JLAATTA 


AGCAAAGCGT 


GACGTAAAGT 


AAGCTTTAAC 


ATCTTCAGGT 


6180 




AAACCTGCAG 
GCTGGTTCGA 


CCGCTTCTTT 


ATCAAGAATA 


ACATTTACCA 


TTCTATGTGC 


TTTTAAATCG 


6240 


40 


AGC'luxi'rrrr 


AccAUTrivr 


TGATATAATT 


TTTCAACTAC 


ATCTCcrrrA 


6300 




TTAGCACCTG 


TCACTACTAA 


GAAAATl'lXrr 


CTTGCTTCCA 


TTAGTCCTTG 


ACGAATACTA 


6360 




ACATTTAACT 


TACCTTGCTC 


ATCGATAGAA 


ACAACTTGTA 


ATGTTAATTT 


CCCTTTATTT 


6420 


45 


TCTTTAGTTT 


TAATCTTATC 


AGCGATTAAT 


TCGATTGCAT 


CTTTTTCATA 


AGCAATTGGA 


6480 




TAAACTTGAC 


CTGCTGGTAC 


ACCTAACGCT 


TCGAAATATG 


ArrrriTATC 


GTCATAATCT 


6540 




AAAATATTTA 


TTTGGCTAAA 


ATCAACAGCA 


TGTTTTTCAA 


CATTTTTCTT 


TAATTCATCT 


6600 


SO 


AGAACTGGCG 


CTTGATCTGT 


ATCTAAATGA 


AAACCTGCAA 


TTGTAGTACjG 


ATTATTGTTA 


6660 




AATTGCTTTC 


TAATAATATC 


AGCAGCATAT 


TCTGCTACAA 


GTTGACTATT 


GTCAAAGACT 


6720 
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10 



15 



20 



25 



30 



35 



40 



45 



SO 



GATATTAATA ATTATTATAC CCTAACTTTC AATATATCAA ACCATTTAAC 
TTATACTCTA AATATAGCAC TTAAGCATCA TTTTTATAAT GAAAATGAGT 
TCAATCCCGG rAAATCTTGT TGACGTAACG CTTCATAAAT TAACAACGCA 
ATAAATTTAA TGAACGAATA TGTTCACTCA TAGGAATTCT TAACGCTGTG 
TCTCTTTCAC CCAGTCTGGT AATCCTGTCG TTTCTTTTCC AAAAATGAAG 
TGTCATGATT TGAAAAATCA AAATCACTAT AAGTCTTTT T ACCAAATTTT 
AATACTCGCC ATTTGTGACT TCAAAAAATG CTTCAATACT ATCATGATAC 
CAAATTCCCA ATAATCTAAA CCGGCTCTTT TTAACATTTT ATCATCAGTT 
GAGGTTTAAT TAAATGTAAA TGTGTGTTTG TACCTGCACA CGTACGgCAA 
ATTAGCTGGG ATTTCTGGTT GATATAAAAC GATATGATTf GTCATATTAC 
TTGTGTCTAA TCCTTTTATC ATTTCATTCT GAACTTCTGC ATCCTCTTGA 
CATTGATAAA ATCTCTTGCT TCTTCCCCAA GAATTTGACC AATGGCCCAA 
CTCGAATCAA CGGTCTTTCA TCTGTTGTTG CAACTTTTTT CAATTCTGGA 
CTTCATTAAA ATGCGCCAAT GCTAAAATAG CATTTCGTTG TATCGGCTTT 
AAGCACCTGC AAGgTGACCA TATGTTTGTT TGAATTCTTT ATTAGACATA 
GTACTAATCT TGGCTTTAAA ATTTCTGGTT CCAAAATGAT GTCATCTTGT 
TACCTCTATT TTTCGGACAA ACTTGTTGAC ACGTATCGCA ACCATATAAT 
TTTTATAACG ATATTGGTCA GGCATATAGC CTTTTGTTTG CGTTAAAAAA 
TCTGACTATT TAATTGGCCA TTTCCAACTA ATGCACTTGT TGGACAACGA 
ITGTACAATG CACCACAGCT ATCTAATAAT GGATCATCAG GTTCC 
(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1302 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 
GCCCTGTTGG AGAAATCACC TTTATACGAT GGTTGAAAAA GCATCATTAG GTACAATCGT 
GGCAATTATA GTTATTTTGC TGTTTTTAAG AAACATTCGT ACGACGGCAA TTTCTATTAT 
ATCGATTCCG TTATCACTTC TTATGGCGCT TATTGCTCTG AAATTGAGTG ATGTTTCATT 



TTTAACATGC 
AAATTTTAAT 
GCAGTATTTG 
TCTTGATATT 
TAAAAATCTT 
GTTAATAAGT 
GTAATATTCA 
CTAAATCCAA 
TGTTACCAGT 
TATTCTCTCC 
TCATAATTAG 
TAAGCAGTTG 
ATTGCATCCA 
TTACCACGCC 
CGTAGtAAAG 
TCGGTATTAA 
CTATTCCCAA 
CTAATGCATT 
TCAACACAAA 



6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7965 



60 
120 
180 
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^ ;^CAGATTCA GAAGAACAAC TAAAAGGTGA 

--^ ^^^^^ 

.ccrrrroc T.«=cr.™ 

,c«a«^ rr;>aoTcrro rr«T.c«c TTar.Trr« 
.^.^ — 

m — — 
-rr =: — r r 
— -CO.. — — - - _ 

„0=CCT«T T™.T«.= «=C«=.T« c«T«=AaOT 

.cc^ — ™^ 

r.c^^ -c».c« ^^^^^ ^^^^^^ 

T«;CC«TOT CA«TC^ cM^" 

««=^^c 

,^cr=rr a.T.T.«« tc».c«« t«o»«=. 

,2, IHFORMATIOH FOR SEQ «> "O' • 

m TYPE: nucleic acxd 
C) SrMTOEDNESS: double 
(D) TOPOLOGY: linear 

40 

.eo.«.c. OBSCK.F.XOH, - 

„™ 2.^^ 

„.CT««. TCmCTOT « ^^^^^ 

ACAAAGTACC ATCGTTCACn ^ 



300 
360 
420 
480 

540 
600 
660 
720 
7S0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1302 
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„=ATOTA MOWATWT ATOKCATCT rr«1AGCTT 

r:^- c.«.^ — 

C^»™. «-««.T CC«C^^ 

— = ^„ 
.^.=..= TO^- 

— — ^ -c.™.c 

— " ~„ ^^rr.. 

c™»T«x. 

.^SCATtCT ACTTACCCC TCa.«T«T 

^"-"^^ 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1003 



,2, lOTORMATIOB FOR SEO IB NO 

Bl TYPE-- nucleic acid 

(C) STRWTOEDNESS: double 

(D) TOPOLOGY: linear 



45 



SEOUE^CK OESC.XPTXO.= SEO XO HO- 32* = ^ ^^^^^ 
— 
— — — 

X;.TC^CAGGT GCATTATCTA TTATGACAAA TG ^TTACGATC 

^J^TTTAT TTGGTATGAC AACATACTTT ATCATGCTJ^ 
^rrAmATC ACrrATTTAT TT ^cCGTAA TTTTAATGGC ACGCCCATTA 

- ^^^iirSCAAT AACaAAAAAG cmAACCGTAA 
K.T4caAGT GACATGCAAT AA agcCATGTAA AAACGAGATT 

^ T-rrTATATC AATATTTAAA AtTAAACCtA Aet«^ 

"l^^ TC^-' --^^^ ^''^ 

rr^-r^r. ™- 
.^«c ^ ^ 

* ^^^rr^TATG ACTTGGATTA CTCTTAiuiu. 

CATTTGAG CACCGATAT^ ^ ^^^*TT CATCGCATAG 

' ^r^TC rrrAATGTAA ATACGTCTCG CTTGACCATT CATO^ 

^TATGCA CATTTTCATC TTTAM^^^ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



CGTTTACCTT TATCGATTTC GATTTCTACT 
GTAGGCGTAT CAAGTAACGA TAAGTTTCCG 
^ TATACTTTTG GATATAGTAC ATAACTAATA 

TCTTCTTCAA GCAACTCACG GACTTTTTCA 
CGAGCTGTTA GTGCTTCTTG GCCTTTTAAA 

w 

TTTACAGGTT GTCCTATTTC ACCTTTGAAG 
TTATAGCCAT CTGTAATCAC GGATTGTTCA 
ATATCACCAA CTACTTTAGA CGATGGTGTT 
ACTCTGCGAT ACATATCTTT GACTTCATCA 
TGTTGACTTA AATTCGAATA CTGTCCACCA 
20 GGTGATTTGA TATCACTTTc AAAGTCTGAA 

CT7VAGTGACT CCATACCTTC AATATCAGTT 
TAATATAACG AATTGGCGCT TGGCTGACTT 

25 

TCAATGATAT CGACACCAGC ATCTATTGCT 
GTATCATGAG TGTGAAGATG AATTGGTAAA 
TCGTAAGCGG CTTTAGGTTT TAATAAGCCT 

30 

CCTTCACGTT CTAACTCTTT AGCTAGTTTG 
CGCTCAGGAT TTAAAATGTC ACCTGTATAA 
35 TCTTGTACTG CTTCATTGGC AACTTTCATT 

CTAAAGACAT CTATGCCTGC TTTAGCACTT 

GGATAGTTTT TATAACCAAC TGCGTTTGAA 

40 

GGAATAGCTT TACGTAGACG TTCAAGTCGT 

GCCACATCAA ATGTAGCACC GCCCCACATT 

GCTGTTTTGG ATGCGATATT AATCATATCC 

45 

TGGTGTGCAT CTCTAAAGGT . TGTATCTGTT 
TCAGCTACAC CTTTTGGACC TACTTCATCA 
so ATTTTACTTG AAGATACAGT TGGAATTGAT 

TCAACATTTG GGAAACCATT AATTGTTACA 
CTATCTAGAG ACGGCTGAAT GTCGAAAAGT 
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GTTTCACCAT TACGCATTCC AAAGAAGAAT 840 

TATTGATTTC TAGTTTGAAT ATATTGTTCA 900 

ATATCTTGCT CCGTAACAGG ACCTTGTTGC 960 

AAATCAACTG GCTCTAGATA TTCACCTGGA 1020 

ATAACCGCTT GTAAATCTTT ATTAAAACCA 1080 

AACGACACTA CTGATTCTGG GAAATCTAAT 114 0 

TCAAGATCAT TTTGTACCAT ATAAAGTGCC 1200 

ACTTTTACGA TATCACCAAA TAAGAAATTC 126 0 

AATCTTTCGC CTAAACCTAA ACTTTTAGCT 1320 

GGCATTTCAT GTTGATAAAT TTCAGTATTC 13 80 

TAATAAGTAC GTACAGTTGA CCAATAATGA 144 0 

CTAAGGTGGG GTGGGAAGCC ATTTAATGCA 1500 

GTTAAACCAC TCATTGAAGC AACAGCAGTA 1560 

TGTTTGTATG TTAATAAACC ATTACCACTT 1620 

TCTACAGCTG ATTTTAACTC ACCAATCAAT 1680 

GCCATATCTT TAATCGCTAA AATATGGAAA 174 0 

ACATAATACT CTAAAGTATA AATGTTTGAT 18 00 

CAAATAGTAC CTTCTGAGAT TTTGCCCGCT 186 0 

TGATCTACCC AGTTTAATGA ATCGAAAATT 1920 

TCTTGTACGA ATTTATGAAT AACATTATCA 1980 

GCACGTAACA ACATTTGGAA TAATACATTT 2040 

TCCCATGGGT TTTCCTTCAA GAAATTATAT 2100 

TCTAGTGAGA AACCATCTTT AAATACGTCC 2160 

TTAGTTCTAA CTCGTGTAGC TAATAATGAT 2220 

AGTAAGACAT CATCCTGCTT TTTAACCCAT 2280 

AGCAATTGTT TCGTACCACT AAATGAAGCG 234 0 

GCTAATTCAT AGTCTGGTTT CGGACGTTTC 24 00 

TTACCTATAT ATTCTAATGT TTTAGTACCT 24 6 0 

TCTGGTGTTT CTTCAATAAA TTTAGTTGTG 2520 
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TTAACACCAC GAATACX5CAT TTCTCGTAAT GAGCGTACCA TTTTTTCTTC TGCTTGTTTa" 2640 

AATGATATCG CGTGTGTAGA TAATTTTACG AGTAATGAAT CATAATAAGG TGATATCTCA 2700 

GCACCTTGGA AACCATCTCC AGCATCAAGA CGTACACCAA AGCCCCCGCT TGAACGATAA 2760 

GCAATGATTG TTCCAGTATC CGGCATGAAA TCATTTAACG GATCTTCTGT TGTAATACX3A 2820 

CATTGGATGG CATAGCCTAA TGTTGTAATA TCTTTTTGTT GCGGCATATT AATCTCTTCA 2880 

CCAAATAAAT CGGCACCTGC TGCAACTAAA ATTTGTGTCT TAACAATATC AATTCCTGTT 2940 

ACCATCTCTG TAATTGTATG CTCTACTTGT ACACGAGGGT TAACTTCTAT AAAGAAGAAT 3000 

TCGTCACCAG ATACTAGAAA TTCAACAGTA CCTGCATTGA CATATTTAAT ATTTTCCATC 3060 

AATTGAATTG CAGCATCACA AATACGTTGA CGTAATGTTG ATGATAATCC AACTGATGGT 3120 

GCAACTTCTA CAACTTTTTG ATGACGACGT TGTACTGAAC AATCACGTTC AAATAAGTGT 3180 

20 ACGATATTTC CATGTTCGTC ACCTATGACT TGTACTTCAA TATGCTTTGG ATTATCAATG 324 0 

TATCTTTCTA TGTAAACTTC ACTATTACCA AATGATTTTT CAGCTTCTGA TTTTGCTCTA 33 00 

TGGAAAGCAT CTTCTAATTC ACTTTCTTCA CGAACGATTC TCATACCTTT ACCGCCCSCCA 33 60 

CCACTTGTGG CTTTAATCAT TAGCGGGAAA CCAGGTTCTT CTGCAAATTC TTTTGCTAAT 34 20 

TCATATGATT TAATTGGACC GTCTGTACCA GGAATAACTG GTAAATCTGC CTTGATAGCC 34 80 

GTTGTACGAG CTTTAACTTT ATCTCCAAAC ATATCTAAAT GTTCTAAATG AGGACCAATA 354 0 

AATTTAATTC CTTCTTCTGC ACAACGACGC GCAAATTGTT CATTTTCACT TAAAAATCCA 36 00 

TAGCCAGGAT GAATCGCATC CACATTCGCT TGTTTTGCTA CATCAATGAT ACGCTCAATA 3 660 

3S TTTAAATAAC TTTCAGCAGG ACCTAAATCA CTTCCAACTA AATAGQATTC ATCTGCTTTA 3720 

TATCTATGTA ATGAA C TTTT GTCTTCATTC GAATAAATTG CAACTGTGCT GATGTCTAAT 3780 

TCTGCCGCCG CTCTGAATAT ACGAATTGCA ATTTCTCCAC GGTTAGCAAC AAGTAACTTT 3840 

TTTATTTGTT TCAATAGCGA TACACTCCTC AAACTATTAG AATTTTCTAA CTAATTAGAT 3900 

AATAAAATTT TATCTTAAAG CGCTCTGTTT TGCTATAGTt mTGTTTCmAA TTTTCAAAaT 3960 

TTaACATyCT tGAGACAATT AAAaCCyCCG CTTCmGaAAT AATAATTTCA AAAATGACTA 4020 

TGCAACAACA GGTAGTTCCA CGTTTTTGTT GTGAAACATT TTCGATTTCT ACAACTCTAA 4080 

AAAATTAAAA ATAAAATTGC AAAACATCAA CATTTATTAT CAATAGCGAT AACTTTATCT 4140 

TATCATCATG ATTCTAATTT CGCCACCACA TTTAGTAATT TTTAGTCATA AAATTTAGTT 4200 

ATAATTATAC GTTGTTTTGT TTATAAAATT TGATAATCak GAGTAATCtC GTAATATCAA 4 260 

AACaAAAAGG AAGTTAAGCG TTGTTTGGTT GcCTAACTTC CGTTATTGAA CTCATCcAGT 4320 
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TCTCGTACTA AATATTGGCT AGTATTTTTT TAATTAAATT GTCTTCTTAT ATCAACTTTT " 4440 

TGTTGTTGTT TCTTTCGTTG CTGGTCTACT TTGATTTGTT TACCTACAAT CAGAAGTAAA 4 500 

CCCATAGCAA TACTTAAACT AATCATTGAT GATCCACCAA AGCTGATAAA TGGCAATGGC 4560 

ACACCAGTTA ATGGAATTGT TGCCGAAATA CCGCCAATGT TTACAAACGT TTGACTTCCA 4 620 

AAGTATGTGG CAATCCCAAC ACACACAAGT TTATAAAAAT ATGATGATGT TTTATTTGCA 4 680 

AACTGGAAGG CACGATATAC AATAAAGAAC TCTAAAGTAA TAACTAGCAA TCCTCCGATT 474 0 

AAACCTAATT CTTCGCAAAT AATTGCAAAA ATAAAATCTG TATGTGGTTC TGGTAAATAG 4 800 

CCCAATTTCA TTGCACTATT TCCTAATCCT TTTCCAAATA CGCCACCGTT ACCTATCGCA 4 86 0 

AGCAATGAAT TGGAAATATG GTATCCAGTT CCTGATTCGA ATTGGAATGG ATCTGTTAGC 4 920 

GTACTAAATC TGGCAGTTAA ATAACTTGGT AACCAACCAG CCATTAATGC AATGACAAAT 4 980 

20 ACTACTAAGA ATCCTAGCAC TGCTGGTATA CCAAATCTTA GGACTTTGTT 5030 
(2) INFORMATION FOR SEQ ID NO: 325; 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

CTTGTTAATC CGAAAATTAT TAGTCAATCA AATGAAACAA TAACAGACTT nGAAGGTTCA 60 

ATTACATTGC CAGATGTTTA CGGCGAAGTG ACAAGAAGTA AAATGATAGT TGTCGAAAGT 120 

TATGACGTCA ATGGGAACAA AGTTGaACTA ACTGCACATG aAGATGTAGC AAGAATGATT 180 

TTGCATATTA TAGATCAAAT GAACGGTaTC CCTTTTACAG AACGTGCGGA CCGTATTTTA 24 0 

ACAGATAAAG AAGTGGAGGC ATATTTTATA AATGACTAAA ATAATATTTA TGGGTACACC 300 

AGACTTTTCA ACAACTGTTT TAGAAATGCT TATTGCAGAA CATGATGTCA TTGCAGTCGT 360 

AACGCAACCA GATCGACCTG TTGGACGTAA ACGTGTTATG ACACCACCAC CAGTTAAAAA 420 

AGTTGCAATG AAATATGATT TACCTGTATA TCAACCTGAA AAATTAAGTG GATCAGAAGA 4 80 

ATTAGAACAA TTGCTTCAAT TAGATGTAGA TTTAATTGTA ACTGCTGCTT TTGGACAATT 54 0 

ATTACCTGAA TCATTGTTGG CATTACCAAA TCTTGGGGCA ATTAATGTAC ATGCATCATT 600 

GTTACCGAAG TATAGAGGTG GTGCACCAAT TCATCAGGCA ATTATCGATG GTGAACAAGA 660 

AACCGGCATA ACAATTATGT ATATGGTTAA AAAATTAGAT GCGGGTAATA TTATTTCGCA 720 
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ATTAGGGGCA GATTTATTAA AAGAAACTTT ACCATCTATT ATAGAGGGCA CAAATGAAAG" B40 

CCTACCTCAA GATGATACGC AAGCAACATT TGCTTCCAAT ATTCGACGCG AAGATGAGCG 900 

AATTAGCTGG AATAAACCAG GAAGACAAGT GTTTAATCAA ATTCGTGGAT TATCACCATG 950 

GCCAGTTGCT TATACAACTA TGGATGACAC TAACTTGAAA ATATACGATG CTGAACTCGT 1020 

TGAGACTAAT AAGATAAACG AGCCTGGAAC CATTATAGAA ACGACTAAAA AAGCCATTAT 1080 

TGTTGCTACA AATGATAATG AAGCTGTTGC AATTAAAGAT ATGCAATTAG CTGGGAAAAA 114 0 

GAGAATGTTA GCTGCCAATT ATTTAAGTGG TGCGCAAAAC ACACTAGTAG GGAAGAAACT 1200 

TATATGATAG AAAACGTGAG AAGTCTTGCT TTTGACACGA TTCAAGATAT ATTAAATGAA 1260 

GGTGCGTATA GTAACTTGCG TATCAATGAA GTGTTGTCAG AAAATGAATT AAATGCAATG 1320 

GATAAGGCTT TATTTACAGA AATTGTCTAC GGAACCGTTA AAAGAAAATA TACGTTAGAT 1380 

20 TTTTATTTA 1389 
(2) INFORMATION FOR SEQ ID NO: 326: 



IS 



25 



30 



40 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2746 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

TTTGCTAATA ACAATAAAGC CAAAGCCGAT TCACACTCTA AACAGCTAGA AATTAATGTT 60 

35 AAGAGTGACA AAGTACCTCA AAAAGTAAAA GATCTAGCAC AACAACAATT TGCTGGTTAT 120 

GCTAAAGCAT TAGATAAACA AAGTAATGCA AAAACTGGTA AATATGAACT TGGCGAaCTT 180 

TTaAAATTTA TAAATTTAAT GGTGAAQAAG ATAATAGTTA CTATTATCCA GTTATAAAAG 24 0 

ACGGTAAAAT TGTTTATACT TTAACACTTA GTCCTAAAAA TAAAGATGAT TTAAACAAAT 300 

CAAAAGAAGA CATGAATTAC AGTGTTAAAA TTTCAAACTT CATCGCTAAA GATTTAGACC 360 

AAATTAAAGA TAAAnATTCA AATATCACTG TTCTTACTGA TGrAAAAGGG kTTTATTTTG 420 

aAGAAGATGG ctnAAGTTAGA TTAGTAAAAG CTACGCCTCT ACCTGGTAAT GTAAAAGAnA 4 80 

AAGAAAGTGC TAAAACAGTT TCAGCAAAAT TGAaACAAGA GTTaAAAAAT ACAGTAACAC 54 0 

5^ CTACTAAAGT TGAAGAAAAC GAaGCGATrC AAGAAGATCA AGTTCAATAT GAAAATACAT 600 

TAAAAAACTT CAAAATTwGA GaACAACAAT TCGATAACTC ATGGTGTGCA GGATTCAGTA 660 

TGGCAGCATT ATTAAATGCa ACTAAAAATa CAGACACTTA TAATGCACAT GATATTATGC 720 
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AAATGATTGA ATACGGTAAA TCACAAGGCA 
CATATGAACA AGTTGATCAA CTTACAAAAG 
5 GTGTATCTCA AAACCCTAAT GACCCACATT 

CTAAAATTAA TGACCAAGAA AAACTTATTT 
TCCAAGATGC AGATTCAAGC CTATTACATT 

w 

GTTCAATGAT AGGTTACkAA AAAGTAATAT 
TATCAACTAC AATTTATAAA TTTAGTTTAC 
ACCAATATCA ATTTATTCAT TGGTAATTGG 
ATACGTCATG GCGATGATAC AAGTCACAAT 
CATCAACGCA TTAAATTTTC ATCTATTGAT 
^0 GATGATACAC AGGATATCCT CATGCAAACG 

CCAATCGTTT ATGAGCGCTT AGTATAACTA 

TGGCGAGAAT TCAATTTCTC ACCAGCTCGT 

25 

TATTCTTTCT CTATTAATTT TTCTCAAACT 

CCCTTTTAAA TTCTTATAAA ATAAAAAAGC 

GACTTAATAT CATATTCAAA ATAACTTATG 

30 

CACGTTTTTC TTTAAACGCA TCACGGCCTT 
ATGTTGCATC CCCAGCCATT TGTTGTAAAC 
35 CTGCTTTAAG GAATCGTAAC GCTGTTGGTG 

CAGTTTCATC TTCAACTTTC TCTAAAGGTA 
CTTCTTGTGC ATTGTATTGA CGACATAAGT 
CGATACGTGC TAAATATCCT GAACCATAAC 
TTTGTCCAAA AATAGCATTA TCAGCAGCAA 
TACCGCCACC TACAGCATAA CCTTTTACCA 

45 

TTAAACGCTG TAAATCTAAT ACATTTAAGC 
CATGTCCACG TTTCTTCTGG TCACCACCAG 
so ATACGATAAC TGAAACGTTT TGATCATCAC 

CAACTGTTTT AGGTGTAAAC GCATTGCGTA 
TCCCTTCGTA AAATTCATAT TTGATTTCAT 
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GAGATATTCA TTATCAAGAA GGCGTACCAT 840 
ATAATGTAGG AATTATGATC CTTGCACmAA 900 
TAGGACATGC GCTAGCAGTT GTTGGTAATG 960 

ACTGGAATCC TTGGGATACA GAaTTATCAA 1020 

TATCATTCAA TCGTGATTAT AACTGGTATG 1080 

AGATATTGAT TAAAGGCAGG TAAAACTATG 1140 

GACACAACCA AACTCACACA TCTAGAACAA 1200 

AGTAATCATC AATTACAAAA ATCAATTTGT 1260 

CAATATCATA TTCTTTTTAT AGATACGGCA 1320 

AATGAAGAAA TCATTTATAT TCTTGATTAT 13 80 

TCATCCAAAC AAGGTATTGG CACTTCGCGA 144 0 

ATTTAAATGA TTTCACTTCA TAAAGCGGGT 1500 

TTTTTCATTG TAATAATAAT CTTTAACATT 1560 

ATCTTATCTT TATGATAATT AATTAAAATG 1620 

CACCTATCGT CGCTAATAAA CGACGCAAGT 1680 

GGAATTTAGG GAATTGATCG AAGTCAGGAT 174 0 

CTTTCGCTTC ATCAGTTGTG TAATAAAGCA 1800 

CAGCTAAACC ATCTGTGTCA GCATTCATAG 1860 

AGTGTTTCAT AATCTCTTTA CACCATTGCA 1920 

CCACTGTATT TACTAGACCC ATATCTAAAG 1980 

ACCAAATTTC ACGTGCTTTC TTATGTCCAA 2040 

CCGCATCAAA TGAACCTACT TTAGGACCAG 2100 

TCGTTAAGTC ACAAACAACA TTTAGTACAT 2160 

TCGCGATAAC CGGTTTTGGA ATAATACGAA 2220 

GAGGGATTTG GTCTTCACCT ACATAACCAC 22 80 

AACAGAATGC TAAATCACCT TCACCAGTTA 2 34 0 

GTGCACGTGA AAATGCGTCA ATCATTTCAG 24 00 

CTTCAGGGCG ATTTATTGTT ACCTTAGCAA 24 60 

CATATTCTCT AAGTGTTTCC CATTGTCTGT 2 520 
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TCCACATGAA TTGTATGACC TGTAGCAGAA ATTAATTTAC ATTTACTATT AGGAATTAAA 

TTTGCCATTT TTTTCGCAAT CTGTACAAAT TTTTCATCAT ATTCTCCAGC TAATATTAAT 

^ GTTGGTACTT TAATTTCTnT CAGCGCGGCC ATAAGTTTGG CATTTG 

(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 



20 



25 



30 



40 



TAATGTTTAG 


TTTATTAACA 


GTAAGTTCGT 


ATATGAATGT 


TTAGTGCTCC 


CCAAAATTGA 


o U 


AGTTTGAATT 


TTAAAAGCAT 


CTTGTAGAAT 


TTAGTTGTAT 


rrrriTCAAA 


GAAATTCATT 




TTGATTATTT 


TTGATAATOA 




AvjTAATACAT 


GTTTATAGTG 


TGTAGTATAT 


180 


GTCTATACTA 


GTAGTAACTA 


TATAGAGAAA 


GTAGGAATAA 


ACTATGTCAC 


AAGATGTAAA 


240 


TGAATTAAGT 


AAGCAACCAA 


CGCCAGATAA 


AGCAGAAGAT 


AACGCATTTT 


TCCCATCACC 


300 


ATATTCCCTT 


AGTCAATATA 


CAGCACCTAA 


AACAGATTTT 


GATGGTGTTG 


AACACAAAGG 


360 


TGCCTATAAA 


GATGGTAAAT 


GGAAAGTATT 


GATGATTGCT 


GCTGAAGAGC GATATGTATT 


420 


ATTGGAAAAT 


GGAAAAATGT 


TCTCTACGGG 


TAATCATCCT 


GTTGAAATGT 


TATTACCTTT 


480 


ACATCATTTA 


ATGGAAGCAG 


GTTTTGACGT 


TGATGTTGCG 


ACATTATCTG 


GTTATCCAGT 


540 


TAAATTAGAA 


TTATGGGCTA 


TGCCAACTGA 


AGACGAGGCA 


GTTATAAGTA 


CTTATAATAA 


600 


ATTGAAAGAA 


AAATTAAAAC 


AGCCAAAAAA 


ATTAGCAGAT 


GTGATTAAAA 


ATGAATTAGG 


660 


ACCTGATTCA 


GACTATTTAT 


CTGTCTTTAT 


CCCAGGCGGA 


CATGCTGCAG 


TTGTTGGTAT 


720 


TTCTGAAAGT 


GAGGACGTTC 


AACAAACATT 


AGATTGGGCA 


TTAGACAATG 


ACCGCTTTAT 


780 


AGTTACATTA 


TGTCATGGAC 


CAGCAGCACT 


ACTTTCAGCA 


GGGCTTAACA 


GAGAAAAATC 


840 


TCCATTAGAA 


GGATACTCTG 


TTTGTGTCTt 


CCCTGaCTCA 


TTAGATGAAG 


GTGCAAATAT 


900 



(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3642 base pairs 
so IB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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ATCTGAnTnG AGATAGTGAT AATGTGTCAC CcAATTTTaA AcCtTTATTT TTaAGcGtTT go 

CaTCaGCTAA CaCtTCaTTa TCtTTAGtCG cTTTATGcCC CTCtATTAAA CtCXSGaACtA 120 

AAAATGaTGa CtTTTCAACa CCGAACaCTA AAACaTTGtC tTTTTGATGG CCaTTAGACA lao 

CAATTTCCCC TGTTTGCTTC AAAGTAGCTT GCTTCTTGTA TTTATTTTCA ATATCTTTCT 24 0 

TGTTAAAAAC AGATTGTTGC ACAGTTTGAT TGGCATCTTT ATTTAGAACA ATGGCATCTG 300 

CTTGCCACTT ATCAATGCCT TCTTTATTCA TATTGATAAG ACCATTCGCC AATCCAGATA 360 

ATAAAAATAG CAAGTAACTA ATCATCGTTA ACACACCAAT AATTAGTCCA AACTTCAATT 420 

TGTTGCGCCG TATTTCATTC CAAGCTAAAA ACATGCATTT CTCTCCCTAC TACTATGATT 4 80 

TAAACATTGT TTATATTCTT AGATGCACGT ACGTCGTGTT GCGCTCTGTA ATGTTATACA 540 

TACACTTATC CTTCATTATA CCCGAACTTT TTATATTAAA AGCAAATTTA TGGAAAATGC 600 

AATTAATTGT CTATTATTTT TGTACGGTAC ATTTAAAATT AAGGATCAAT TTAAAAAGGC 660 

CTACATATAC CTTTAAGTAC ATGTAGACGT CCAATTCATA TATTATTTAA CTTCGCCTGT 720 

TTTAGGATCG AATTGCTTAA TAGCATTTTT ACGTAATTTA TCTTTTGCTT TrTCACTTGC 7 80 

TTTATAGTTA TTGTTGTAAA TCGTAGCTTC CCAACTACCA TACATTGGGT TAGGGAAAAT 840 

GATATATTTC TTACCX3AAAT CGTCTTTATG TTTTTCAATT AATGCTTCAC GAGATTCAGC 900 

TGTAGCTTCT TTTGGATCTG TAAAGTCTAA TAAATTATCT CCAAATAGCA TGACAAGTTT 960 

ATGATC C TTT TGAACCATTT GTCTGCGTGA TTCTTTACTC TTATCATCTT TACCTTTTAG 1020 

TAAAATATGA CTCTTCTTAG CTTGAGGGAT ACCTTGTTGT TTTAAGTTCT TTTGTGTTGC 1080 

35 CTTTAAATCT TTTT CT TTAT CTCTATCAGA AATATAGTAG ATATCGACAC CrTTTTTGTC 1140 

AGCATATTTC AAGAATTCTT TTGCGCCATA GACAGGTTTA GCTTTAGCAG cTTGTACCCA 1200 

TTCATGCCAA CCTTCTGGGA AAGGTTTATT ATGTATTGAT GCATAGCCTT GATATGGAGA 12 60 

ATTATCTAAA ACTGTTTCAT CTAAATCCAA AGCAATAGCT AACTTATGTT TACCTTTATT 13 20 

CTTTTTAATC TCTTTATCTA ACTGTGTCTT TGCACTGTTA TAACCTTGTA AATATAATGC 13 80 

TTTTGCTTCA GCTGAATTTT GATACCAAGC CACTGCCATA ATATTTTGAT TACCAAGATT 14 4 0 

CGCC T TTT Q T GATGCTGGTA TAGAAGCTTG TTGCXfTTTGT TGAACTTCAG CAGAACTTTT 1500 

GGCAAACGCT GTAGAATTTG TCGTTTGTGG TGCTGAAACT GTAACCX5CTA CCGATAATGA 1560 

so TGCTATTGCA ATATACTTTG AAATTTTATT CATCTTATCA CCTCATGATT AATATTTAAA 1620 

ATACAGTTAA AATTATAAAT GCATTTATTT AATATTGCTA TACTATGAAA AGATATTTAA 16 80 

TATTATTTCT TGgAAAAGCT AACAAATATG TGAACATTTC ATATAAGCAT GATTTAATGG 174 0 
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- GTGACTAGGC 


CTTCCTATCA 


GACATATTCA 


CTCATCCACG 


TATCATTATG 


TGTACAGTRT 

X X r\K^xWJ X w X 


1860 




GCTATCTCTT 


ATTTACCTAT 


TGGAACAACC 


ATAAACTCAT 


CCATAGTTTA 


CCTTTTATAA 


1920 


5 


ATAGCAGTCC 


TCACTCATAC 


AATTTCTCAT 


AAAAATCACA 


ACGCTCCAAC 


GTATTTCCAA 


1 O O O 




CTTACTTTCA 


CCTATTTTAA 


TTCATAAAAA 


CGACACTTTA 


ATTGTCATTA 


TCCAATAATA 


^U4 0 


10 


GCAAGACGTT 


ATTATTGCAA 


TCTTTTTTAT 


AAAATAATAG 


AATCATAGTA 


X X X Va>r\ XXX £^ 


^ xuo 


AAGATAAAGT 


AAGAACGTTT 


TTATTTTTCA 


GAl-lTriTAA 


ATTATTATGA 


X A X w X X X 


Z X o U 




TTAGGAAGGA 


AATTACATTG 


AAAAAACAAG 


TTATTATTTC 


GGGCCTCATG 


X XaX X X XL.nL. 


*5 O A 


IS 


TArrrrn'GG 


AGCCGGAAAT 


TTAATATTCC 


CGCCCATGCT 


TGGCCATACA 




2280 




ATATGTGGAT 


TGGTATGCTA 


GGCTTTGCCC 


TTACAGGCAT 




X X lAL«l\j 


234 0 




TTATTGTTGT 


TGCATTTTAT 


GATGAAGGTG 


TTGAAAGTGT 


AGGCAATCGT 


AT* H O 2i TTT* 21 T 
Ax V.\Jnl 


24 00 


20 


GGTTCGGGTT 


TATTTTTGCT 


GTCGTGATTT 


ACATGTCTAT 


CGGAGCATTT 




*1 yt ^ A 

^4 b 0 




CACGTGCTGC 


AAATGTCGCG 


TACGAAATTG 


GTACAAGACA 


V—*^ X X X X /^^Vv X 


fSTrj r* n T* a a r* 




25 


AATGGACTTT 


AATTATATTC 


GCAGCAATCT 




%v w XXX X 


/\X xA'kJX X xAn 


O C Q A 


ATCCATCGAA 


AATCGTTGAT 


AATTTAGGTA 


AATTATTAAC 


accgttatta 


V.XAX X/v\XvJVT 


O £ A A 




TCGCTCTATT 


AAGTATTGCT 


GTCATTTTCA 


ACCCTGAATC 


TGCACTAAGT 




*> T A A 


30 


ATAAATATAT 


AACACATCCT 


TTCATTTCAG 


GAAGTTTGGA 


W X X X X X 


ACAATf^rJATY* 

AVnTVl X VJ\Jn X \« 


O C A 




TTGTTGCTGC 


GTTAGcrrrr 


TCCGTAGTCA 


TTGTCAATGG 


CTATAAGTTT 


AAAGGPPTT'A 


•5 Q *> A 

^ 0 ^ U 




CAGATCGCAT 


GAAAATTTTA 


AAATATGTCT 


GCTTTTCAGG 


TCTTATTGCA 


GCCATATTAC 


^ ft ft n 

^ 0 0 w 


35 


TTGGAATGAT 


TTACTTTGCA 


CTTGCATACX; 


TTGGGGCATC 


AACAGCTCCA 


GGAAAf'**m*A 

V7V3XWW» XXX X^ 


^ 27 *t U 




AAGATGGTAC 


AGATATATTG 


ACGTACAACT 


CATTACGATT 


ATTTGGXTC!G 

*• XXX WV7 X X 


X X WVJW X XVlWVv 


J V V u 




TCGTATTTGG 


AATGACGGTT 


ATCCTTGCAT 


GCCTAACAAC 


ATGTATAGGA 




0 \J KJ KJ 


40 


tTGCGCCACA 


TTTACTAAGA 


AACACGTACC 


TAAGTTTTCT 


TATAAAATAT 


TCGCAdTTAT 

X N^WXai^^w X XX% X 


J X ^ V 




TTTCt CTATC 


ATAGGGTTCT 


TATTTACAAC 


ACTTGGTTTA 


GAAATGATTT 


TAAAAATTGC 


3180 


45 


TGTCCCATTA 


TTGACTTTAA 


TATATCCCGT 


GTCGATTGCA 


CTTGTACTCA 


xx^xwnx X Xw^ 






TAACATGTTT 


AGCACATTCA 


GATTCAGTTG 


GGCCTATCGA 


CTCGCAACTG 


TTAT'TA ATT* 

X X^\X X X X 


J J w w 




GATTATTTCA 


ATTTTACAAA 


TACTAAATAG 


TTTCAACTTA 


TTACACGGTG 


TTATTTTGAA 


3360 


SO 


ATCGTTTATG 


ATGTTACCTT 


TAGCAGATAT 


CGATTTAGCT 


TGGCTTGTAC 


CATTCATGCT 


3420 




CTTTGCTATT 


ATCGGTTTCA 


TAATCGATGT 


ATTTATACGC 


CGTCCGAAAC 


AAGCGACAAC 


3480 




TTAATAAATG 


CTCACTGCCT 


AGTAATGATT 


GACCCATCGT 


TACTAgGCTT 


TTttATATGA 


3540 



55 
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TAAAAAATCC TAGCTGTTAT TCAAAAATAT TAGTTTTTAA AA 3642 

(2) INFORMATION FOR SEQ ID NO: 329: 

^ (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

,5 TTATTGATAT TGAAAATTCA AAAACTGCAA CACATCTTAT CACTAAAAAC CAACACTATC 60 

ATCAAACTGA CACTCATTTT GAACAGTATA AGAAATTTAT TTTAGATTCA GGTATATCAT 120 

CAACACAATT TGTATATAAT AACCTGTCTG TAAGCGGATT TAAATATACT AATGATGGTA 180 

AGAATCCAAT TCAATTATCT GACATAGTGT ATCACTTAAT CGCATTATTA GGATATGGCG 24 0 

GTGGCATTAG CTATCAATTA TTAGATGACC ATTCAAATTA TATTTCCTTG TACAACAAAT 300 

ATGGTAGCCC CCTACCATTA ATGCATTTAT ATAAAATGTT TAGACCTTTT GTTAATGAAG 3 60 

25 

ATATTGAAAT TACAAATAAT TATGTATTGA GTCGTAAAGA TAATAACTAC CATTTCTTAT 420 

TATTCAATAA AATTAATGAT CGATATATGT CAGACGTAAA ACAAGATTTC ATTTTCCATA 4 80 

^ ATGAATTACC TCAAGACTCT TTGATGATTA TTAAAACATT GAATCATGAA CATGGTTCAA 54 0 

TTCAACATTT GCTTCCAATA AGCGATCAAC TTGTTTATAT AGAAAAAGAA ATTTTAGATG 600 

AATTAGACAA AACGAATTAC CCTAAAACGG AGCTTGCAGT TCAAGAAGAA ACTGGTCGTA 660 

55 CATTTGAACT CAAGTTAAAT CACGACGAGG TTAAATATAT TTGCTTTAAA CCAAGCTAAA 720 

TACTAACAGT CCTCTTGTGT TTAGTTTCTT ACGTTAAAGG CTATTTATAT CATAAGGAGA 780 

TGATTTGTAA TGAGTAACTC ACAAGCAATT CAAGCAATTG AAAACGTGTT AGTAACGTCA 840 

40 

AAAGTTGGTG TATTATCAAC TGCATATAAT AATAAnCCTA ATAGTAGATA TATGGTCTTT 900 

TATAATGATG GTCTTACTTT ATATACTAAA ACGAATATCC ATTCTGCTAA GGTCAAAGAA 960 

ATTAAAGATA ATCCAGCAGC ATATGTTTTG TTAGGCTATA ACGACACAAC TAATCGTAGT 1020 

45 

TTCGTTGAGA TGGAAGCGAC AATCGAAATC GTTACAGAAC AAGAAGTGAT TGATTGGCTA 1080 

TGGGAAACAC AAGACAAAAG CTTTTTCAGT TCAAAAGAAG ACCCAGAGTT ATGTGTTTTA 1140 

so AGAGTAGTTC CGCAATCCAT TAAGCTAATG AATGATAAAT CATTAGATAC ACCTATCAAA 1200 

ATCGATTTAT AACACAAAGT GTATATAGGA AATAACTTTT ATGAATTCTA GATATAACAA 1260 

TGTTAA^TAC TTAAAATAAC TCGCTATAAT TAAAGTGTTT AATATGTTTA CAATTCAATT 1320 

55 
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5 



10 



15 



20 



25 



TTTAAATTAA 


TTTTATGTAA 


TATAAATACT 


GCATTTGCAA 


ACTGTTGCAC 


TTTTAGGTAT 


1 A A f\ 


AACAGAATTA 


ACTACATTTA 


AGGAGATTGA 


TGAACCATGA 


AAAAGAAAAA AGGTTTTGGT 




CTTGGTATTA 


GTTTAATCGC 


CATCATGTTA 


ATTGTATGTA 


TTGTATTAGT 


AATCATGATG 


X 3 O w 


ATGACTGGCG 


GAAAGAAAGA 


TACATACTAT 


GGAATTATGA 


AAGATAATAC 


TACTATTGAA 


X V ^ U 


AAAATGATTA 


GTGAAAAAGA 


TGAAAGTATT 


GAAAAAAATG 


TTAAATTACC 


TTCAGATTCA 


1 6 R 

X V O U 


GATGTTAAAG 


TTAAAAAAGG 


TGATTTTGTA 


ATTGTTTATA 


AATTAGCAGA 


TTCAGATAAA 


174 0 


ATTGTTAAAG 


TTAAAAAAGT 


TGACCATGAC 


GATGTACCAC 


ATGGTTTAAT 


GATGAAAATT 




CATGACATGG 


GCAAAATGCA 


CATGAAACAC 


TAATTGTAAT 


TTAAATTACA 


AATTTTAGTT 


1 fl fin 

X O D U 


GCCATCAAGG 


TATATACGAG 


TAAAAGCAGC 


GGTAAGTTGA 




GQAATf AT* 1 " f ' 
wann x ^.xlX X X 


X 7 ^ I,/ 


TACTGCTGCT 


TTTTATATTT 


GAAATACTTT 


CATATTGAAT 


AGCTCCACTT 


GCCGTTCGCC 


X 7 O U 


TgcgCTTTGC 


GCATG CATAA 


AAGCCCCTAA 


CAACCTGAGG 


TCACTGCGCT 


CCGGTTCGCC 


2040 


TGcgCTTTAG 


CGCATGCATA 


AAAGCCCCTA 


ACAACCTGAG 


GTCACTACGC 


TTCGGTTCGC 


2100 


CTGCGCTTTA 


GCGCATGCAT 


AAAAGTCCCT 


AACAACCTGA 


GGTCACTACG 


CTTCCGTTCG 


2160 


CCTGCGCTTT 


AnCATGGCCA 


TAAAAGC 








2187 



(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 
CCnCCTTTTT AAACCTGGnG AAATGtmCAG tTTTGATGGt ATTGGGtTCT TTAGTATTAT €0 

40 

GCTTAAGATA GAGTGTAATA CGCTCCTGTT GTTCTTTATA GTATATTGCT rm'CrrX ' rrr 120 

CTTTTTCGTC ATATTTCACT TTTAAATAAA TGACTGATGC AACTATACAT ATACATAAAA 180 

TGACACCAAT AATTATAAAA ACATGTTTTT TGTTCATTAA AGTAACTCCT AAAATGTGGT 240 

45 

GGAAAATGAA AAAATTTTTA TAATCTATAA TTATGAACAT TACAGATTAT AAACCATAAC 3 00 

ACTAACATCG TCGCCTTCAT TAACTTGATT GTTAAAATCA GCAATTACTG AGAATTGTAC 3 60 

so AAGGAATAAT TGCCTATTAT GCCCTCATGT AATTATTGCC TTACTAACAT TAACAAAATC 4 20 

GATAGCTATT ACATTAAATG CCTATACCCC AGACCTCAAA CACCTTTTTA TACAGGACGC 4 80 

ACTGTAATTT CATyyACGTT AACGTGCTTT GGTTGTGTTA ATGCATATAA TACTGCTTCT 54 0 

55 
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GTATCTACCA TTCCTGGAGA AATGCTTGTT ACTTTAACGC CTGTCTTTGC CAACTCTTTT 660 

TCTAATCCTT GAGTAATAGT GTGAACTGCT GCTTTCGTCG CACTATAAAT CGTACTACTT 720 

TTCGTTACTT CAAAGCCAGA AATAGATGcA ATGTTAATAA GATGGCCACT TGATTGTTCT 780 

AACATAGTTG GTAATGCAGC CTGTGCCGTA TATAAAGTGC CTTTGATATT CACATCAATC 84 0 

ATACTATCCC ACTCATCTAC TTGATAATCA GTAATCTTAG ACGACAACAT TTGCCCCGCA 900 

CTATTGATAA CAATATCCAA ACCACCGAAT GTTTGTTGTG CAATTTTTAT CAATTCATCG 960 

ACTTCTTCTT TATTCGTTAC ATCTGTTGGC ACTACCTTCA CACTATCTTG TGACAATTGA 1020 

IS TTCGCTACGT TTTGTAATTT TTCTTTATTT CTACCTGCTA AGACAACTTT TGCCCCTTCT 1080 

TCATGTAGTA ATGTTGCAAT TGCTTCTCCA ATACCACTAC CTGCACCTGT AACTACTGCT 1140 

ACTTTATCTG TTAATACTGT CATAATGATC GACTCCTTTG ATTCTTTTTA TTTTTTCAGG 120 0 

GTAAATCATA AATACATATT ACTTTTAAAA AGCGTATCAC AATTCATATA ACGGTCATAA 1260 

TAACTCGCTT CATTTTCATA GATAAATTAC ATTACAAGCC* ATTCGAAACA TACAATTAAT 1320 

CGTTGCTTAT ATTTTTTATT TTTAAAAATG TTGAAAAATC GTCACTXCTT TATTGTAAAA 1380 

ACATTATATT AGTAATAAAG TTAATACTGT GnATTTaTCA TTCGATTGAA TGATTAGAGG 1440 

GAGGAATAAA ACgTGACATA TCATGAGCX3T GTTTTAGCAT TAAGAGCAGA AAGTAAAAGA 1500 

ACCGCATTTG ATTTTCGATT CGAAGATTTA TTTAGCAAAG AAGAATGGcT AAGTATGTCT 1560 

CTTGCAGAAA GACAAAAAGC TGAAAAAGCA TTTCGACACG AgTTAAAAAT ATGGACGATG 1620 

TAAGAATGCC CTTCTCAAGT GTCCATGACG CCCAAGTAAA ATTATATAAT GTTGTATATT 16 80 

CTTATAACGG CaTTAAACGT AATTTTAAAC AAGTTGAAAA TGGAAGGATT CTAATATCAT 174 0 

TTCGTTTATA TATnGCAGAC CATGGATAGA ATTTTnTATG GTnAATCC 1788 
(2) INFORMATION FOR SEQ ID NO: 331: 



20 



25 



30 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 
TAAGCCAAAT CCATTGTTAC CAGAAGAAGT TCGCTTCATC TTAAAACAAA TGGGTCTTAA 60 
AGAAAAGACT ATCGATGTTG AACTCGAAGT TGGCGAGCAA GTTCGTATTA AATCAGGTCC 120 
ATTTGCGAAT CAAGTTGGTG AAGTTCAAGA AATTGAAACA GaTAAGTTTA AGCTAACAGT 180 

55 
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GCTTTAATTA ACAATTAAAG TTATTAAACT AACCAAAAGA TAAAAAAGAG TATTGATTTT 300 

TTAATTAGAA AAGTGTTAAA ATTATGTGGT cGcGCTTTTA GAGCGCCCAT TTCGTcACGA 3^0 

AATGTTAAGA GTGGGAGGGC AAAACTGAGC CCTGTGACCA CATCACGATA TCAAGGAGGT 42 0 

GCACATCGTG GCTAAAAAAG TAGATAAAGT TGTTAAATTA CAAATTCCTG CAGGTAAAGC 4 80 

GAATCCAGCA CCACCAGTTG GTCCAGCATT AGGTCAAGCA GGTGTGAACA TCATGGGATT 540 

CTGTAAAGAG TTCAATGCAC GTACTCAAGA TCAAGCAGGT TTAATTATTC CGGTAGAAAT 600 

CAGTGTTTAT GAAGATCGTT CATTTACATT TATTACaAAA ACTCCACCX3G CTCCaGTATT 66 0 

^5 ACTTAAAAAA GCAGCTGGTA TTGAAAAAGG TTCAGGCGAA CCAAACAAAA CTAAAGTTGC 720 

TACAGTAACT AAAGATCAAG TACGCGAAAT TGCTAACAGC AAAATGCAAG ACTTAAACGC 780 

TGCTGACGAA GAAGCAGCTA TGCGTATTAT CGAAGGTACT GCACGTAGTA TGGGTATCGT 840 

TGTAGAATAA TTTTACGAAT ATTAAATTTG ATTACATGAT TTAAACGATG AAGCAGATAA 900 

CAGAGATAAT AATGATGAAT TATAAATATA ATCTGAATGA CTAGATTAAT GATTGATTTA 960 

TTCATAAGAT TAATTCTTCT GTTGTCTGCy CTTAACTTGC ATATAGCAAG TAATGTGGGA 1020 

GGAAATTCCG CTAAAACCAC TAAAGGAGGA ACTATAAATG GCTAAAAAAG GTAAAAAGTA 1080 

TCAAGAAGCA GCTAGTAAAG TTGACCGTAC TCAGCACTAC AGTGTTGAAG AAGCAATTAA 1140 

ATTAGCTAAA GAAACAAGCA TTGCTAACTT TGACGCTTCT GTTGAAGTTG CATTCCGTTT 120 0 

AGGAATTGAT ACACGTAAAA ATGACCAACA AATCCGTGGT GCAGTTGTAT TACCAAACGG 126 0 

AACTGGTAAA TCACAAAGTG TATTAGTATT CGCTAAAGGT GACAAAATTG CTGAAGCTGA 1320 

3S AGCAGCAGGT GCTGACTATG T 1341 
(2) INFORMATION FOR SEQ ID NO: 332: 



25 



30 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

CTCTAAATCT TCAATAGGTA ACTTCATTTT AATAATTCCC ATTTGAACAA TTGCTTCTTC 60 

SO KTCKTTQKQK ATTTCTAAAA CTTCACCCTT TTGACCGTAA GAkAATACTT TkACTTCATC 120 

ACCAGCAACA ATTTTATCGT ATTTTTGCTT TTGTACATTT TGCTTTATAG ATTTCGCTTC 180 

ATAATGATCA TCTAATCGTT TCTTCTTATC AATCAATTCA TGTTCTTTAA CATCTGCACC 24 0 
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CTTAATCTTC TGATTTGCTT TkTCTTTCGC TTCCTCTATT AGAGACTTTT CATAATTTTG 360 
GAATTGTTGA TACTGCTTAG ATAAATCATC ATGCACTTGt CCGCkTskTT TACAAGACGG 420 
TCCAGkTCTA ACCTCTGTGT CTCTACACGT TTGTAATTAC GCTCTAATGA TTCAATCATT 480 
TCATTTATTT CTTTTTCATC AGTACCAATC ATCGTCTTAG CCTTATTAAT AATATTCAAA 540 
CTAAGACCTA ACTTTTTAGA AATGTCAAAA GCATTTGAAC GACCCGGCAC ACCCATTAAT 600 
AACTTATACG TTGGACTCAA AGTATCTACA TCAAATTCTA CACTCGCATT CATAACGCCT 660 
TCTCGATTAT AACTATATGC TTTAAGTTCA GGATAGTGCG TCGTTGCCAT TACTAGAGAA 720 
CCAATTTTTC TAACATGATC TAAAATGCTC ATTGCTAATG CAGCACCTTC ACTCXX3ATCT 780 
GTACCTGCAC CTAATTCATC AAATAAAACT AAACTATGTT TGTCTGCATG CTTTAAAATT 84 0 

TCAACTATAT TCGTCATATG AGATGAAAAA GTTGATAATG ATTGTTCTAT TGATTGTTCA 900 
20 TCTCCGATAT CGCAATATAC ATTTTTAAAT ACACTTAACT GACTACCATC AAGTGTGOGA 960 

ATCAACAATG CTGATTGAGC CATAACAATA ATTAAACCTA ATGTTTTTAA TGTTACAGTT 1020 

TTACCACCTG TATTCGGTCC TGTAATAATT ACCGTTTCAA TATCTTCCAT AAATTCGATG 10 80 

GTATTAGCTA CAACAGTCTC ACGATTTAAT AATGGATGGT ATGCTTTAGG TAAATATACA 114 0 

GTACGGTCCT CTTTAAATAT CGGCTTTGTT CCTTTAATAC TTCTACTATA TCTCGCTTTT 1200 

GCGATTAAAA AATCTAACTG ACCCATGACT TGTTCTGCCA CAAGTAGTGC ATCTTTGTCC 12 60 

GCAGCCACAT AACCAGTTAG TTGCGTTAAA ATGCGTTCTT TTTCAATTGC TTCGTCATGA 13 2 0 

CGTAATCGAC TAATTTGATT ATTCATTTCA ACAACTGATG ATGGCTCAAT ATACAATGTT 13 80 

35 TGTCCTGAAG CAGATTGATC ATGTACAATC CCATTAAAAT CTTGTCGATA TTCAGCTTTG 1440 

ACAGGTATAA CGTTTCTTTC ATTCCTAACT GTTACAATAG CATCTGATAA TTTTTTCTGA 1500 

TTTGCTTGGC TTTTAACAAT ACGGTCCAAA TTTTGTCTAA TACGTTGATT CGTGCTAGAA 1560 

ATTTTACTTC TAATCCCTTG CAATTCATAA CTCGCATTAT CATATAAATC ATACGTATCG 1620 

CATGTTTCAT TTATTTGTTG AAAAAGATCA GTTAACACAG GTAATTGATT CATCTTGTCA 1680 

TCTAATATTG GGTATTTAAC ACCTTCATCT TCTTCAACCA ATTGATTATA AAATGTCTTG 1740 

AATTGATTTT GTACTTGAAT TAATCTTTTT ATCAAGTTAA GCTCTGATAC ATTTAAAACG 1800 

CCGCCAATAT CAGCGCGATG AATGAATGCT GATACTTTAG ATAAGCCACT CAAGCTTGGT 1860 

50 AAACGATGCT TATTATAGAT TTGAGCAATC TCATCCGTTT CTTCCATTTG AAAAACAACC 1920 

GTTTCAAAAT TAGTAGCTGG CATCATTTGA TTGACCTTTT CCAAGCCTAA GTCACTAATA 1980 

GTTTCATTGG CAACGAGTGA TTTTATTTTT TCAAATTCTA AGACGTCTAA TGTTTTTTGT 204 0 
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TCACGCGATA ATGCGTTAAT 


CACT CTATCT 


TTTGTtACAA 


ATCCTTTTTG 


CGCAGTTGtA 


2160 


5 


CGCCATAATT CATAAAATCT 


AAATGATTTG 


TATGATGCGC 


ATCAGTGTTA 


ATAGTTAATT 


2220 




TCACATTTGG ATATTTACGA 


At- e ATATCAG 


CGCTCAGATC 


CAGTCGATGT 


GGATTGGCAT 


2280 




TAATTTCTAA 


TACTGTATTC 


GTTTCTT CAG 


CTAATG CCAT 


TAATTGTTCA 


ATATTCGGTT 


2340 


10 


TATAACCATC 


TCTTCTACCT 


A 1 AATACGCC 


CTGTTGGATG 


CGCTATATGT 


CGCACGTATG 


2400 




GATTGCGACA 


TGCATTAGCT 


AAl 1 i CCA 


TAATTTGTTC 


TTCTGATTGG 


TTAAAGCTTT 


2460 




GATGAATAGC 


TCCAATTACA 


TAATCAAGTT 


GTGCTAAAAT 


TTCATCATCA 


TAATCCAGCG 


2520 


IS 


AGCCATCAGG 


TAATATATCC 


ATTTCTGTAC 


CTGAATAAAT 


ATCAATTTCA 


CTATATTCTT 


2580 




TATCTAAAGC 


CTTAATTTCT 


TCGTTTTGTC 


TTAAAAGTCT 


TTCTACTTGT 


AAGCCATTAG 


2640 


20 


CAACACGTAA 


ACTTTGTGAA 


TGATCAGTAA 


TTACCATGAA 


TTTATAACCT 


TTTGCGATAT 


2700 


TTGCTTCTAC 


CATGTCTCGA 


ATAGAAAACG 


CACCATCACT 


ATACGTTGTA 


TGCATATGAA 


2760 




TATCACCATT 


AATATCATCT 


ATTGTAATGA 


TATTACTTAQ ATCTTTATCA AATTCGCTAC 


2820 


25 


CATCITCTCG 


CATAGCAGGT 


GGTATAAAAT 


iVkMnCAi lAAA 


A i GTTCATAT 
• 


ATCTTGGCTT 


2880 




CACTATCATA 


TTGAATTAAT 


GTACCATCAG 


CTTGTTCAAT 


TCCATATTCA 


CTTACTTTTT 


2940 




CATCACGTGC 


TTTAGCAAGT 


TGTCGAATTC 


TTATATTATG 


TTCTTTTGAC 


CCAGTAAAAT 


3000 


30 


GCTGCAATGT 


ATGATAAAAA 


GCACTTGGTT 


CAATTAATCG 


AAAATCGACA 


CCAATCGTTT 


3060 




CATCATCATA 


CGCTAATTCT 


AATGAAACTT 


TTGTGTTCCC 


CACTGCAACT 


TCTTTTACTT 


3120 




TATTGGGAAT 


ATTTAATAAT 


TGCTGCTGCA 


CTGCTTTTGG 


GTTATCGGTA 


CTTATTATGA 


3180 


35 


AATCTAAATC 


TTTGCTCATT 


TCTTTAAAAC 


GACGGAAGCT 


TCCTGCAGAT 


GAATATTGAT 


3240 




CGATATAATT 


TAATGTATCT 


ATATAATCAA 


TGATTTCTTG 


ATTAAGTCTT 


CTCATTTGAT 


3300 


40 


CAATTGGATA 


TCTATCTTTC 


TTAGCACCAA 


GTTGTTTCAC 


AGCTTCTAAT 


ATGTTTTGTT 


3360 


CCGTTTTCTT 


AGCAAATCCX3 


CTTAATTCAC 


TAACTTTTCC 


ATTTTCACAA 


GCAACTTGAA 


3420 




GTGACGCTTT ATCAACAATA 


TTCAACTCTT 


TATATAGCTT 


AGCAATTTTC 


TTGCTTCCAA 


3480 


4S 


GTCCTTGAAT 


TTTCAAAAGT 


GGAATAAGAC 


CTTCCGGAAC 


TTCTTCCTGT 


AATTGCTGTA 


3540 




AATACTGAGA 


TTCACCGGTC 


TCACGGTAAT 


CATTGATTAC 


TTCTGCAACA 


CCTTTACCAA 


3600 




TGCCTTTtAA 


CTCCGTtACA 


TCAGATATTT 


CATCTAATGG 


TCGTTCATCT 


AATTCAAGAC 


3660 


SO 


TTTGAGCTGC 


TTTTCGaTAC 


GCTGaTATTT 


TAAAAGTATT 


TTCCCCTTTT 


AATTCCATAT 


3720 




AAGTAGCAAT 


TTGTTCTAAT 


AGTTTGATAA 


CATCTTTTTT 


TGTCATAATA 


ACACTCCATA 


3780 




AAAAGAAGAC 


CAGGACGTAT 


CATTAATATA 


TACCTTTGTC 


CTGACCTCTT 


ATGATAATTT 


3840 
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5 



10 



25 



30 



35 



40 



TAGATATTTT 


AAGCTGATGT 


TGTAATGCTT 


CGTTAGGATA 


TAATGCCAAT 


AGATATAACG 


3960 


TAAAGTGTAA 


GACAATTATC 


GTCATAAACA 


CACCAACTAT 


CATTCCCATT 


GCACGACTGA 


4020 


AAATATGAAT 


GTTTTGATAC 


GCTATTATTT 


TATCAAAAGT 


TACGATAATT 


AGATATAAAA 


4080 


TGAACTTACA 


AAACAATGTA 


ATCATTAAAA 


AAGCTACAAT 


CGCTTCAAAT 


CGATTTTGTA 


4140 


GATGATTAAA 


ATGAAACXSCA 


AAAGTTGTAT 


TAAATGCTGT 


TGTTTTAGGA 


TATGGAATAA 


4200 


ATACAATTAA 


TCTTTCTACA 


ATAGATTTGT 


AAAATTGACT 


GGCAATCCAC 


AATGATACAA 


4260 


nCGTTGCACT 


CAAATGTATC 


ATAGATAACC 


AAAAACCTCG 


TCTGAATCCA 


ACGATGACAA 


4320 


AATACACAAA 


GAAAATGATT 


ATGATAAAAT 


CAATGACCAT 


TTATTGCTCA 


CGCTGCTGCA 


4380 


ATTrGTGAAT 


TTGTTGTTTC 


AAACGTCGAT 


TTTTTTTCTT 


CTAGTAGTAC 


n-l"I"i'CATGC 


4440 


ATAATATTCA 


CAGCAGTTAG 


TATTGCTTTT 


CTTGAAGTAT 


CTAAACCTGC 


TGCTTTATAC 


4500 


CCTAATTCTT 


TTATTTTATC 


ATCAACTAAA 


TGTGCTACAT 


ATCGTATGTG 


CTCTGGGTTA 


4560 


TCTTCCCCAA 


CAATTGTAAA 


AAGCTGATCA 


TTAATTGATA 


CATTTACCTT 


GTTTTTAAAC 


4620 


TGTGTCATTT 


ATAATTTCTC 


CTGATCCTTT 


TTTTAAAATC 


TAAATTCACG 


TTATAAAATA 


4680 


TGACTGGATA 


GTTTGTCTGA 


ATTTGATACT 


AATATTGTTA 


TATTGCAATT 


ATGATAAAAC 


4740 


AACAACACAA 


TCTCTATAGA 


TGACTTAATG 


TTCTTTTTAT 


AATGAAATAA 


TGTAAAGAAT 


4800 


TTTCTATTCA 


ATACTTTATC 


ATGTTTAAAT 


TGTGTCACTA 


TAACATTTTC 


ATAAACATTA 


4860 


TACATGACCA 


CTATGTATTT 


TGTAAGTATC 


CGCAATTAAT 


TCTTTACAAC 


ATACATAAAT 


4920 


GTTTCTGACG 


TTATTATCAT 


TTATGATATG 


ATTATTTTTG 


CTAAAGACAA 


TGAAATTTTA 


4980 


TGAAAGGATT 


TACACAAATG 


GCGAATATCG 


TTTTTAAATT 


GTCGGATAAA 


GACATAACGA 


5040 


CATTAATGTC 


ACGCATTTCT 


TTTGATACTG 


AGAATTTACC 


TCAAGGnATG 


AnAGCACGTQ 


5100 


CAAAGTATCC 


AAAATACAAC 


TGTAAATATT 


TACCAT 






5136 


<2) INFORMATION FOR SEQ ID NO: 333: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4239 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



so (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

GGCCAAAATT GCACCTCCAC TTTCCTTTGA ACAAATCTAT TTTAAACGCA CATTCCCATT 60 
ATGACAAATT AAATGTGAGT nACATTTGTT TGTATTTTAA CATGACTACT AACGCAAACT 120 
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GGTATCTTTC 


AAAGATAAAA 


TCTTAATAAT 




W X\mXXXX^X 


CTAGATTTAT 


240 




CACAATATTA 


TATAGACCTA 


TTTTATTTTG 


ACGTAAGTTG 


err ACTA T r'T'T 
w X n\j X *vx ^ X X 


^&&Zk.^AAAA& 


300 


5 


ACCTTTATAA 


AAATTCATAC 


CTTTATGCTA 


TCGCTGTAGG 


^ X ^.n X X rVrt V. x 


xOl LJ\\.J\Li\X 


360 




AATTCTTAAC 


TATCCTTTGA 


TGATTGTTTT 


ATT7\GATTGT 


X X X X \3A X 


vxnl-A^ X 1 X 


420 


10 


CGAATTTCTA 


TAGTTCAATG 


CTACTAAAAA 


AACAGCCCAA 


AACTTTAATT 


TGTTTTGGAC 


480 




TGTTTTATAA 


TTATGCTTGC 


GATGGTGTTT 


TAGTTTCTGA 


AGTTTGTTCA 


GCAATGTCAT 


540 




ATTTAAACTC 


TTTACCATCA 


TGATCTACTG 


XX X X X 


ACCTTCAATT 


TGATTACCAT 


600 


IS 


CTAATATTAA 


TTCACTTAAA 


TTATCTTCGA 


* ^ VJ X X X X X X u 


TATCGCTCTA ATTAATGGTC 


660 




TTGCACCATA 


TTCTGGATCA 


TATCCTTCTT 


f' '"'T*t^<**f^ AT*T*n "I' 

w X VJ^VJIf^X XXX 


GTCTTTCGCT 


TTATCAGTTA 


720 




CAATAATATT 


TATGTTTTGT 


TCAGATAATC 


GATTTnTTAA 

Vm^X X XV3X XXV\ 


TTTATTAACC 


ATCATTGTTA 


780 


20 


CAATTTCTTT 


TAATTCTTCT 


TTTGTTAG TT 

AAA A A X X 


TATGG AAT A P 


AATGATATCA 


TCTACACGGT 


840 




TTAAAAATTC 


TGGACGGAAT 


GAATTTTTTA 


ATTf^T*T*T*T A A 

X X ^ X X X XJ^J% 


CATCGTTTTT 


CGAATTGTTT 


900 


2S 


CATAATCTTG 


TCCATCACTT 


GAACCACCGA 


ATCCIAG CLAAA 


TCGTTGATCT 


TGTAATTCTT 


960 


GTGCCCCAAC 


GTTTGATGTC 


ATTATGATAA 


X X \J X #^X X X w X 


GAAATCAACT 


GTACGTCCTT 


1020 




TTGTATCTGT 


CAAATGTCCA 


TCATCTAAAA 


CTTGTAATAG 


AATATTAAAT 


ACATCTGGAT 


1080 


30 


GAGCTTTTTC 


AATTTCATCA 


AATAAAATTA 

A AW^A^f^ A A«^ 


CAGAATATGG 


TTTACGTCTA 


ACTTTTl'CAG 


1140 




TTAATTGTCC 


ACCATCATCA 


TGACCAACAT 


ATCCTGGAGG 


AGCACCAACT 


AATCGGCTCA 


1200 




CTGCGTGTTT 


TTCCATAAAT 


TCACTCATGT 


CTACACGGAT 


CATCGCATCA 


TCATCGCCAA 


1260 


3S 


ACATTGATTC 


AGCTAAAGCT 


CTAGCTAATT 


CAGTTTTA C C! 

X X X X ^ 


AACACCAGTT 


GGTCCAAGGA 


1320 




AGATAAAGCT 


ACCAATTGGT 


CGTTTAGGAT 


CTTTTAACCr 


TGCACGGGCA 


CGTCTAACCG 


1380 




CTTTACTGAT 


TGAATTAACA 


GCATCTTTTT 


GCCCAATAAC 


TCTCTCATGT 


AATGTATCTT 


144 O 


40 


CTAGACTAAG 


AAGTTTTTCA 


GATTCTGTTT 


CATTGATTTT 


AGTTAATGGG 


ATACCTGTCC 


T c. r\ f\ 




ATCCTGCAAT 


AACTTCAGCA 


ATATCTTCTT 


CTGACAATGA 


AGTTGaCATG 


CCATTTTGTG 


T c c rt 

1 bo □ 




CATTCTTCCA 


TTCATTTTTA 


GCTTCTTCAT 


ATTGCTTTTC 


AAGTTTTGTT 


TGTTTATCAC 


T ^ 


4S 


GCAGgTTAGC AGCATTTTCA AACTCTTGAG 


CATGTACTGC 


GGCATCTTTT 


TCATTTTTAA 


•\ Q r\ 

looO 




CTTTTTCAAT 


TTCTTGTTCA 


ATTTCTTTTA 


AATTATTAGG 


TGTCGTATGA 


CTCTTAAGTC 


1740 


SO 


TTACTTTAGA 


ACTTGCTTCA 


TCAATTAAAT 


CAATTGCTTT 


ATCTGGTAAG 


AAACGATCTG 


1800 




AAACGTATCT 


GTTACTTAAT 


TTAACAGCTG 


CTTCAATAGC 


TTCGTCTGAA ATATTAATAC 


1860 




GATGGTGTGC 


TTCGTAACGA 


TCTCTTAATC 


CTTTTAAAAT 


AGCAACTGTA 


TCTACTACTG 


1920 
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TTTTGCGATA TTCATCTAAT GTAGTAGCAC CAATACATTG TAATTCACCA CGTGCTAATG 204 0 

CCX3GCTTCAA AATATTCGAA GCATCGATAG CACCTTCAGC ACCACCAGCA CCAACTAAAG 2100 

TATGCAACTC ATCAATAAAT AGGATGACAT TACCTGCTTG TTGGATTTCT TCCATAACCT 216 0 

TTTTCAGACG CTCTTCAAAT TCACCACGAT ATTTAGTACC TGCAACTACT GTTCCCATAT 2220 

CTAAAGACAT AACACGCTTA TCTTTTAATG TCTCTGGTAC CTCATTATTC ACTAtGGCtT 2280 

GCGCTAAACC TTCAGCAATA GCAGTTTTAC CAACACCTGG CTCTCCAATA AGCACAGGAT 234 0 

TGTTTTTCGT ACGTCTACTT AATACTTCAA TTACACGTGT AATTTCTTTA TCACGTCCTA 24 00 

TAACAGGATC TAATGTACCG TCTTTGGCAA TGACTGTTAA GTCACGAGCT AAACTATCTA 24 6 0 

AAGTTGGAGT ATTATTTGAC TTACTAGCTT GTGCATTTTT ATTACTCATT TCAGGGTTTC 252 0 

CTAAAGCTTT CACAACTTGT GCACGTGCTT TAGTAATATT TAA^TCTAGA TTTGCAAAAA 258 0 

20 CTCTTGCTGC AACACCTTCA TTTTCACGAA TCAAGCCTAA TAAAATATGT TCCGTTCCAA 2 64 0 

CAAAATTGTG ATGTAATTTT CTAGCTTCAT CCATCGATAA TTCAATGACT TTTTTAGCTC 270 0 

TAGGTGTATA ATGCAATGTA CCAACATGAT CTTGACCATG TCCGATTAAT TTTTCAACTT 2760 

CTTCAATTAC TTTATCTTCA GTGATATTAA AACTTTCTAA TACTTTTGCA GCAATTCCTT 2820 

CAGGTTCTTT CATTAACCCC AATAATAGGT GTTCTGTTCC TATATTTGAA TGATTTAAAC 2880 

GAATTGCTTC TTCTTGGGCA TGTGCTAATA CGCGCTGTGC ACGCTCAGTT AATCTACCAA 294 0 

ATAACATAAA TAATGACCTC CTACTTTATA TGTTCTCTTA GTATATCTGC TCGTTTTTCT 3000 

TTTACAGATT TGTCATCTTC TTCATCTAAT AAAAATGGTG ACTGTATAGC TACCATCAAT 3 06 0 

TCATTAAATT TAAAGTTTTG TAATTCAATG TAATTTAAAT CTATACCAAG TTTAACTcGC 3120 

TTAATCTATA AGAAGCCTCT TCCATAGTTA TCATTCTACA GTTTTGTAAA ATACCTAGCG 3180 

AGCGAAAAAC ACGGTCTTGT GTTTCTAATT GATTATAAGT GTCTAACTTT TGTCGTATTT 324 0 

40 GTTTTTCTTC ATGAATGATT TGATTAACAA CTTCTGTTAA TGTTTCTATG ATTTCTAACT 3300 

CAGATTTACC AAGTGTAAGT TGGTTGGATA CTTGATAAGT ATGTCCATAA ACTTGCGAAC 3360 

CTTCACCGTA AATACCTCTG ATTGTATATC CAAAACGATT AATGGTTTGA GCAATCCGTG 342 0 

TCATTCTTTT CATAATAGAT AGACCTGGCA AATGTAGCAT CACGCTTGCT CTCATACCAG 34 80 

TACCTATATT GGTAGGACAT GTAGTTAAAT AACCAAGTTG TTCATCATAA CTTATATCAA 354 0 

GGCTTCGATC TAATTCATCA TCAATTGATG AAGCTTGATT ATATAAAGCC TGTAATGTCG 3 600 

TGTCAGTTCC CATAGCTTGA ATACGAATAT GGTCCTCTTC ATTTATCATG ACACTTAAAG 366 0 

ATTCATCGTC ATTCACTAAT ACTGCGGCTG CTGGTTGTTT TATTAGTTCA GGACTAATCA 3720 
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CAAAGTTTGG 


CAAGGCATCT 


TGTACCTCAT 


TTATAACTCT 


AAATCCCATC 


ATTTTCAGTA 


3840 


GCATACATTA 


GTGGATGCAC 


ATGATTTTCT 


AAAtTACGCG 


CTAACCGAAT 


TCTAGAAGAC 


3900 


ATAACAATTG 


GTGTTTCTTC 


ATTACTTTTC ATCCATTGGC 


TGATATTATC 


ATGAATATTA 


3960 


TGCGTCATCA 


TGTTGCACCT 


CACTCTCAGC 


TTTTAGTGCT 


TTAATTTCAT 


CTCTAACAAT 


4020 


GGCTGCTTCC 


TCAAAATCTT 


GGATTTCAAT 


AAGTTTTTTC 


AAATATTCAT 


TCTTTTCTTC 


4080 


GATTTTTCGC 


TTTAAAGCTA 


TCTTTTTATG 


TGAAGAATGT 


GGTGTCTTTC 


CAACGTGCTC 


4140 


AAATTGTCCA 


CCTTGAACTC 


TGCGGACGAT 


ATCAATGATG 


TCATCTTTAA 


ATGTTGCATA 


4200 


ACAATTAGCA 


CACCCAAATT 


TACCAACATG 


TGCAATATC 






4239 


(2) INFORMATION FOR SEQ ID NO: 334: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 






CAAAACTTTG 


ATTATGTCGT 


GTCAGTTATT 


AATGCATACA 


GTAATGCATC 


GTTGTTATTT 


60 


TTTATTTATT 


TACTATTTAT 


TATGTACCAA 


ATCAGCCTTt 


CAGTAAAATA 


GGTCAAATTA 


120 


CTGATTTTCT 


AAATATAAAA 


TGCCTCCTAA 


TAACATACTA 


TTAGTACATC 


ATTAAGAGGC 


180 


TCTTGTGTTA 


TTTGCATACT 


AAGCGCTCAA 


ATTTAAATTT 


AAGATGAAGA 


TTCTTGCAAT 


240 


AATATTTCTA 


TATTTGTCGA 


TATATCCAAT 


GGATCTTCCA 


TTGGCAAGAA 


TCGATTTACA 


300 


ACATTTCCTT 


GTOGATCGAT 


TATAAATTTT 


GTGAAATTCC 


ATTTTATTGG 


GGaCCCAAAG 


360 


ATTCCTGGTT 


GTTCaTTCTT 


TAAATGCGtA 


AATAACGGAT 


GTTCATCGTT 


CCCGTTCACA 


420 


GATATTTTAG 


CTAACACTGG 


AAATGTTACA 


CCAAATTTCT 


CACGACTAAT 


TTTCAAGATT 


480 


TCTTCATTAG 


AACCTGGTTG 


TCGATTGTCA 


AAATTATTAT 


TCGGAAAACT 


CAACACTACA 


540 


AACCCACGAT 


CCTTATATTT 


TTGAAATAAA 


GTCTCTAGTT 


TTTTCAATTG 


TTCGCTATAT 


600 


ATACATTCTG 


TTGCAGTATT 


AACAACTAAA 


ATCACTTTAC 


CTCTAAATGC 


TTCTAATTTA 


660 


TAAGTTAAGC 


CTTTATAATC 


ACTTACTTCG 


ATATCATACA 


CATTTCTATT 


ATTCATAAGA 


720 


CACCCCTACA 


CAGCCTTTTT 


TATATTGAAT 


ATGTTCTTTT 


TAGAATGTTC 


TGATAAAATA 


780 


AGTGCOCGTT 


TACACCGTGA 


ACACACATTA 


TATAGCGTGA 


TACATTTTTC 


GAGCACACGA 


840 


TAAATAATGT 


TCGAGTTTAT 


GTTGTTGCTC 


AACCTATCCG 


ATTTACCGTC 


rm-iTCACC 


900 
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CCGTTATAAC CCCTCATTAT AATCATCCTT ATTTTCTATT TTTAAAAAGA CAATTAGACC 1020 

GCTCTTTAAA CTATAGaTTA ATACTTAAGk TAAACTCATA CATACTGATA CCATACGTTA 1080 

GATCTAACAA TTTAAAATTC GTTATAACTA TGGATTAAAG AGCTGCCCAA CTCATATAAT 1140 

CCTTAAAAAC TTCACATGTG ATTGTkTATT AAGCCCTCCT TTATCnTATT AAATATCCTT 1200 

ATAACCCTTT TAAAATTAAA CTGACACACT CATACATTGT TACAC 1245 



10 

(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1399 base pairs 
,5 <B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 
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CTTGTAAAGG 


TAGTTGTTCA 


TTTAAATTAA 


AACAGTATGC 


TAAGTTnGTG 


CTTATATTTT 


60 


GCAACTTCAA 


TTCGACAGGC 


TCATCTTGAT 


ACATGAATGC 


CTCAATTTCA 


TCATGTGATA 


120 


ACTTTTTCAA 


AATATCAATT 


AAATGrATAC 


TAAAAACGAT 


AAATAACATA 


TCCCAATTTG 


180 


AATTCAGCCC 


TAGCGATTTT 


AATTTGTTTA 


AAATCTTATC 


T-rXTrGAAAA 


ATTCGATTCC 


240 


TAATGTCTTG 


TATATCGTTA 


TCAGTCAAAG 


TTTCCCAATC 


TATATGTGAA 


TGAAGACCTA 


300 


AATAACACTT 


ATCCATTAAT 


AATTCATATA 


CCGTTAATGC 


AGAGACATCG 


AAACAACGTT 


360 


CTTCACTTAr 


AAAAACGCCA 


TCAACATCAA 


ATAAAATTTT 


CTTCACAATC 


CCCACTCCAT 


420 


TTCTGaAAAT 


TCAGaTATAA 


ATCATTCTAC 


TATTTGACTA 


AAAAAAGCGC 


AAACCCTATT 


480 


GAAGTaGATT 


TGCGCTTTAG 


CTGTTnAAAT 


TTTATAAATG 


TnTTTCAATT 


TCATCAGCAA 


540 


CCTGCTGTAC 


GTGTGTACCG 


ACAATAACTT 


GAGTTGAATG 


TTTGCCATTA 


ACAGTAACAC 


600 


CAACTGCACC 


GGCGTTTTTA 


ATCTTCTGTT 


TATCAATAAT 


AGATGTGTCT 


TTTAACTCTA 


660 


GACGCAACCT 


TGTTGCACAA 


TTGGTTAAAT 


TAACAATATT 


CTCTTGACCG 


CCTAAACCTT 


720 


CTAATATTTG 


TATAGCATGT 


TGATGATATT 


TACTTTGTTT 


AATATCATTT 


TCACCAGGAG 


780 


CAATATTATC 


TTTTACAACT 


GTTGGGTCAA 


CTAATTCATT 


TTCACCTCTA 


CCAATCGTAT 


840 


TCAAGTTAAA 


TACTTGGATT 


ACTACACGGA 


AAATCACATA 


GTATAAGATG 


AAAAATACAA 


900 


CACCTTGAAC 


AAGCAACATC 


AATGGATGAT 


TTGATACTGG 


ATTAATTAGT 


GATAACACAT 


960 


AATCTATCAA 


ACCTGCACTA 


AATGAAAATC 


CAGCTGTCCA 


ATGGAATGTA 


GCTGCGATAA 


1020 


ATAAAGATAA 


TCCTGTTAAT 


AACGCATGAA 


CAACATATAA 


GATTGGCACA 


ACAAACATAA 


1080 
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ACCAACCGTA AACTTGTTTT TTCTGAGTAG TTTTAGCTGT ATGATACATT GcTAACGCAG " 12 00 

CCGCTGGAAT ACCGAACATC ATGATTGGGA AGAATCCCGC TTGATAGCGT CCTGTAATAC 12 60 

CTTTTATAGC ATCTTTGCCA CTTTGGAATT TACCAATATC ATTAATACCA ATCGTATCAA 1320 

ACCAGAACAC ACTATTCAGT GCATGATGTA ATCCTGTAGG AATTAATAAT CTATTGGCAA 1330 
CACCATATAT GAAAGCTCC 

(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1399 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

TATAGTTnTA TTATTTAGCG AAGCATTAAT ACTACCACCG GTTATAACAA ATGTATTTTG 60 

CGTATTAAAT TGAATC3GTAG GACCAATCAA AGTATATTCA ATCGCTGGAC CATCATTGTT 120 

AATTAATGAC TGCGCAACCT TAAAACTAAA TTGATCCATG GCACCTGCGC CTGAAAATCC 180 

AATATGTTCA TAACCTATTC TTCCTAGATC TTGTACCGTT GAAAAGAGAC CTGGTTGTAA 24 0 

AATCTTAATT GACATTTTCA ATCACCACCC AGTCATCAAC ATTAAAGTTG CCATCTGATA 300 

TATCTCTTTC GATTTGTATA AATTTCTGTT CATCTATTGC ATAAAATTGT ATCCATTCTC 360 

CTGCTTCGTA CATTGACATT GGTTCACGCT CGCTGCTAAA TACTTTTAAc GGTGTGCGTC 420 

CAATAATTTG CCATCCGCCA GGAGAATCTG ATGGATATAG TCCTGTTTGA TTATTCGCAA 4 80 

TACCTACAGA ACCTGCATGA ATTTTTAACC TTGGCTGATT ACGTCTAGGT GTATGTAGTT 540 

GTTCATCAAG TcCGCCTAAG TATGGAAATC CTGGCATAAA TCCTAGCATA TATATTAAAT 600 

40 AAGGTTTACT TGTATGTTTT TCAATAACTT GCTCAACAGT TATTCGATTA TGCTTTGCTA 660 

CTTCTTCAAT ATCTGGTCCA TATGTACCAC CATATTGAAC AGGTATTTTA ATAATACGAT 720 

TGGTTTGATT CACAGCATGA ACATTTTTTT CATTAAATTT GTTAAGTTCT AAATTTTCAA 780 

TTAATTTAGA AGATGTTATA GCTTGTTCAT CAAAATATAT TAGAACTGCT CGATACGAAG 840 

GGACAATATC TTGAATTTCT AATATTTCTT TTTCTCGTAT CCACCGTACC ATTGCTGTGA 900 

CATTACGATA TGTCTCTTCG GATATTTTAT TTTCAAAATA AATCATAATT GTCTGCTCGT 960 

TAATAAATCT TACATCCACT TTAAATCCCC CTTTGTATTG CAATAAACCA GTATTGAATA 1020 

CCTTTTCATT GTATCATTGA GAAGCACAAG TTGTTTAATA AGTAATTCAA ATCGCATATA 1080 
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TTAATATTGT AACTCTTACA CTAATTTAGG TTCTGCTATC ATTCGGTCTG ATGGAAAATT " 1200 

TTTACTTTTC ATCTGTCCGA TTTTTTGATT TTGAATATAA AAAAGCACGA CCGAAGTATC 1260 

ATTAACACAC TTCAATCGCG CAATTAAATA ATCTATTTGA TCATTTATTG GATATTAACA 1320 

ATTTTTACG 1329 
(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

GAATAGTGCC ATTTGGAAAG ACATAAAGAA TCCAGAAGCA CCTTTAGCGA ATATGCCGTG 60 

TTCATATAGA TTTGTGAAAC TGGCATGCCC GAATTGAGTT TTAAATGCAA ATAGAATCAT 12 0 

GACGAAACCA ACTACTATTA AACCAATAAT TGTCGCTATT TTAATGATAG AGAACCAAAA 180 

25 TTCTAATTCT CCGAAAAGTC TTGCGCTAAG TAGGTTGAAT GACATTAATA ACAGTACACA 24 0 

AAATAGTGCA CTTATCCAGT TTGGAATTTC TGGGAACCAA AAGCTAACAT ATTTTGCCAC 300 

AGCCGTTACT TCAGCCATAC CTGTAATAAT CCAACAGAAC CAGTATGTCC ATCCGGTAAC 3 60 

AAATCCTGCA AAAGGCCCAA TATATGTATT GGTTACATCT GCGAAAGATT TAAATTCAGT 420 

ATTCTGTATA ATGATTTCTC CTAAACCTCG CATAAACATA AATAACATAA ATCCTATAAT 4 80 

GATGTATGTT AATAGAATTG AAGGGCCGGT TAATGCAATC GTTTGACCAG CACCTAAGAA 54 0 

TAAGCCTGTA CCAATTGCAC CGCCAATTGC AATTAATTGT ATGTGGCGAT TGCTCAGTTC 600 

CCTTTGTAAT TTTTCAGCCA TAATACATCT CCCTTAAATA TAGATATGTT TATTATGCAC 66 0 

40 TTATATTGAG ATATATACAA TTATTTTCGG TAAAAATGTG TAAAATTCCA TGTTAATATA 720 

CTTTGGTTTT TATAATCATA TATAATAACC AATTGAAAAT TTAATTCTAT TGTAAAATTC 7 80 

ATGGATTATT CACATCTTGA AAAAGCTTTA ATGGTGCTAT TTGTGGCTAT TCTGTGACAT 84 0 

TTACATAGAT TTACAAAAAA ATTGTTGCAC ATATAATGCC AGTtTTTATA TTTCACAAAC 90 0 

GAAATGCGTT TAcTATAATA TTAGTTGAAA GCCATTTCAT AAAGAAACAG TAAAGGGGAA 960 

ATTtATCaTA GCmGaATTAC AAAGAGGTTT AGAAGGGGTT ATCGCmGCGG AGACTAAAAT 102 0 

AAGTTCAATT ATTGAAAGTC AATTGACTTA TGCCGGCTAT GATATTGATG ATCTAGCTGA 1080 

AAATGCGCAA TTTGAAGAAG TTATTTTCCT ATTATGGAAC TATAGATTGC CAAACGAAGA 114 0 
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TACACATTTT 


GAGGAGTATG 


TTACAGATCA 


CGTGCATCCA 


ATGACAGCAT 


TACGTACGTC 


1260 




ATTATCATAT 


ATTGCACATT 


TCGATCCTGA 


TGCTGAAAAT 


GAATCAGATG 


AAAATCGTTA 


1320 


5 


TGAAAGAGCA ATGCGTATAC 


AGGCTAAAGT 


AGCATCATTA 


GTTACAGCGT 


TTGCTCGAGT 


1380 




AAGACAAGAT 


AAAGAACCAC 


TTAAGCCTAA 


TCCTGACTTA 


AGTTATGCGG 


CAAACTTCCT 


1440 


10 


ATATATGTTA 


CGTGGGGAAT 


TACCAACAGA 


TATAGAAGTA 


GAAGCCTTCA 


ATAAAGCACT 


1500 


TATTTTACAC 


GCTGATCATG 


AGTTGAACGC 


ATCTGCATTT 


ACGGCACGTT 


GTGCGGTATC 


1560 




ATCATTGTCA 


GATATGTACT 


CAGGTATTGT 


AGCAGCCGTA 


GtTCTCTGAA 


AGGGCCATTA 


1620 


15 


CATGGTGGTG 


CAAACGAACA 


AGTTATGACG 


ATGTTATCTG 


aGATTGGGTC 


AaTTGAAAAT 


1680 




GTTGATGCTT 


ACTTAGATGA 


AAAATTTGCT 


AATAAAGrTA 


AAGTAATGGG 


CTTCGGTCAT 


1740 




CGTGTATATA 


AAGATGGTGm 


tCCTAGaGCG 


AAaTATTTaA 


GaGAAaTGAG 


CCGTCAAaTT 


1800 


20 


tnCGAAAGACG 


CTGGTCGTGA 


AGAATTATTT 


GAAaTGTCAG 


TGAAAaTGGA 


AAAmCGTATG 


1860 




GCAGAAGAAA 


AAGGATTAAT 


TCCTAATGTT 


GATTTTTATA 


GTGCGAGTGT 


TTATCACTGT 


1920 




ATGGAAATAC 


CTCATGACTT 


ATTCACGCCA 


ATCTTTGCTG 


TAAGTCGTTC 


TGCAGGATGG 


1980 


25 


ATTGCTCATA 


TTTTAGAACA 


ATATAAAGAT 


AATAGAATTA 


TGCGTCCTAG 


AGCGAAATAT 


2040 




ATTGGCGAAA 


CGAATCGTAA 


GTATATCCCG 


CTTGrAGaAA 


GAAAmTAATC 


AATACAAATT 


2100 


30 


AAAAATGAAG 


ATGTAAAATT 


TGGAGGTAAA 


ATAACTATGA 


CTGCAGAAAA 


AATTACTCAA 


2160 


GGAACTGAAG 


GATTAAACGT 


ACCTAATGAA 


CCAATTATCC 


CATTTATTAT 


CGGTGATGGA 


2220 




ATTGGACCGG 


ATATTTGGAA 


GGCAGCAAGC 


CGAGTTATAG 


ATGCTGCtGT 


TGAGAAAsCC 


2280 


35 


TATAATGGCG 


AAAAACGCaT 


TGAATGGAAA 


GAAGTGCTAG 


CTGGCCAAAA 


AGCATTTGAT 


2340 




ACAACTGGTG 


AATGGTTACC 


TCAAGAAACA 


CTTGATACT^ 


TTAAAGAATA 


TTTAATTGCT 


2400 




GTTAAAGGAC 


CTTTAACAAC 


ACCAATTGGT 


GGTGGTATTA 


GATCATTAAA 


TGTGGCTTTA 


2460 


40 


CGCCAAGAAT 


TAGATTTATT 


TACTTGCTTA 


AGACCGGTAC 


GTTGGTTTAA 


AGGAGTACCA 


2520 




TCACCTGTTA 


AACGTCCACA 


AGATGTTGAT 


ATGGTTATTT 


TCCGTGAAAA 


TACTGAAGAC 


2580 




ATTTATGCTG 


GTATTGAATT 


TAAAGAAGGT 


ACAACAGAAG 


TTAAAAAGGT 


AATTGACTTC 


2640 


45 


TTACAAAACG 


AAATGGGTGC 


GACAAACATT 


CGATTCCCAG 


AAACTTCAGG 


TATTGGTATT 


2700 




AAACCAGTTT 


CTAAAGAAGG 


AACTGAGCGA 


TTAGTTAGAG 


CAGCTATACA 


ATATGCTATC 


2760 


SO 


GATAATAACC 


GTAAATCAGT 


TACTTTAGTT 


CATAAAGGTA 


ATATTATGAA 


ATTTACAGAA 


2820 


GGCTCATTTA 


AGCAGTGGGG 


TTACGATTTA 


GCATTATCTG 


AATTTGGTGA 


TCAAGTATTC 


2880 




ACTTGGCAAC 


AATATGACGA 


AATTGTTGAA 


AATGAAGGCA 


GAGATGCTGC 


TAATGCTGCT 


2940 
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TTACAACAAA TTTTAACTCG TCCAGCTGAG CATGATGTTG TAGCAACTAT GAACTTGAAT ' 3060 

GGTGACTATA TTTCAGATGC TTTAGCTGCA CAAGTTGGTG GTATTGGTAT TGCGCCAGGT 3120 

GCAAACATTA ATTATGAAAC AGGTCATGCT ATTTTTGAAG CAACACATGG TACAGCTCCA 3180 

AAATATGCAG GTTTAAATAA AGTGAATCCA TCTTCAGTAA TTTTAAGTTC TGTATTAATG 324 0 

TTAGAACATT TAGGATGGCA AGAAGCGGCA GATAAGATTA CAGATTCAAT TGAAGATACA 3 3 00 

ATTGCTTCAA AAGTTGTTAC TTATGACTTT GCCCGTTTAA TGGaTGGtGC TGAAGAAGTT 3 360 

TCTACATCAG CATTTGCAGA TGAATTGATT GnAAATTTAA AATAAGCAGA ATAGAATTAG 3420 

G 3421 



75 

(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3173 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

CCCTnGATAC CCAAACTGCC CTAATACCAC TGGCTAAACC TAATGGATAG TACCATTTAT 60 

TTTCCAATAA ATAAnCCAAC TGCAATnGCT ATaACTCTAA ATATAATAGA GATAATCmcA 120 

30 

TTAATTGGAT TAAtACGCCA AATACTAGTA ATAATAGGCT AGATAATAAT CCACCTAAAA 180 

AGTA C TTTTT AATTCCAAAG AAAGCTAATA TCAATAATGC TGCCGGTGCA GATAATTGAA 24 0 

AATCTAATCC TGGTATAATG GACGGTATTT TCAAAACTGC CAAAATGGTT AAAATCGCAG 300 

35 

CAATGACACT AATTTGAGTA ATATCTTTTG ATGTCATACT AAAACCCCTA TACCGTTTCA 36 0 

TAAACAACTT GCTTCGGTGT GCTTTCTAAA AATGATATGT AATGATTTAA ATCAATACAA 420 

40 TCGTCCACAA ATATTATTCT GCCTCCATAT CTCGTATTAA CTGGTTTAAT ATCAAATAAT 4 80 

CGATGGTAAC CAATTTTAGC AGCGGCAAAA TAACCTGTCG TATACGTTAA GTCATCGQAC 540 

ACGCAAAGTT CTCCTTTGAC ATACGGATGC GCATTGATAC AACTAGCAAT TGCTAAGGCA 600 

'^^ TCAGTCACTC TTTCATTAAG ATCACCTTTA TTATTTATAT CTTCAAACGA AAAATGTGTT 660 

GCCCTAATCC CCCTTTGTCC AAATGAATCT AAACGTTTAC CAGATATAGC AGATAGAATA 720 

ATAGCTCCTG TATAAACCGT TTCATTTTTA ATATATGTCA TCCCTTGATT TAGCGCTTGT 7 80 

SO 

TCAGTGACAC CACATTCTTG TGTTAAATGT TGGAGATTTG CTTTATCATC CTCAATAATT 84 0 

TGTAATGCTT TTATTTGTTG AATCGGTTCC ATGATTTTTT GTATTTTTAT ATTGAGAAAA 900 

55 
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ATTTTGTCAA. ACTCACAAAT 


' CGTTTCAGCA 


CCACTAATAT GAACATCTTG 


ATTGCTAGAA 


1020 




CGCATTTTTA TACTATACAT 


GACGATCACC 


TcAATCTTCT TGaTGCAAAA 


TTTCAAACAA 


1080 


5 


CCTATCTATA TCTTGTTCAG 


TATGAAAATA 


CGACAATGAT ATTCTTAACA 




1140 




CACAGTtGGA TACCTTAAAT 


AACTTGTAAA 


CACATGATGC TTTAATAATG 


TTTGATGAAT 


1200 


10 


GTTCTCAGCC GCTTCTATGT 


CATCAAACTC 


AATAAACTTA ATCGGCGAGT 


TTGaACTATT 


X2b 0 


ATAATkAACA TTGAGTGCTT 


TTAACTTTTG 


GTTAAAATAT TTACTCAAAC 


xn XXX /v\X X X 


1320 




AGTGCGTCTA TCATCAGCAT 


TTATTAACTT 


TTCAATGTTT CTTlTrATAA 




13 8 0 


15 


ATAAATTGGC AAACTACTTG 


AGTAGATGAG 


TGGTCTACCG TGATTAATTA 




144 0 




CaCATCaTTT GaACTkaAAA 


TcACACCCCC 


GTATGCACCA CATGCTTTAG 


ATAAAr'^Ar'A 


1500 




AGTGAGTATA TCTACACCTT 


GATAATTCGA 


GTAAtTCTCT ATtCCAAAAC 


X^ X \9 X\J£V\J\\^ 


1560 


20 


ATCGAGTATC AGTGTTGCGT 


TAnATTTATG 


CTTTAATGAG ACTAATTGAC 




1620 




AACGTCGCCA TTCGTTGAAA 


ATACACTATC 


AGATATGATT ATTTTTGGTA 


■Xf\X X X X\3J\X X 


16 80 




AGGGTATTTC TCTAACCTTT 


TTTCTAAATC 


AGCAATATCT AAATGCTTAT 


ATATr'Ar^TT 
■r\Xe\X V_r\^ X 1 1 


174 0 


25 


TTCTAAACCA CTTAACTTAA 


TACCGTCAAT 




CZAyCTYZZ. AAA 


1800 




CACGACACAA TTTGTATTTT 


TGAAAATATT 


AAATAACGCC AAATTAGCAT 


PATAAl^CAr"^ 


1 Q ^ 


30 


ATTTAAGATA GTACATGcAC 


TATATCCGAG 


CCAACCTGCT AACATTGTTT 


ym-r%*\x X X\r X x\^ 


T Q T r\ 


ATAAGCTGTC GAACTTCCAC 


TAATTAATCT 


TGAACTTGAT AAGTGATAAC 


TATACTTP Pf^ 


1 o Q r\ 




CATAAATCTT TCGAAATCAT 


CCTTATCAAA 


CGCTATTTGA CCTAATCCTA 


AATAATCATT 

t^*^ X fy^\ X X X 


o rt j4 rs 


35 


AGATGTATAG TTCGTACATC 


TCTTATTTTC 


TACTTCAATA TACTGTCTAT 


CTATATACrr 






TACCGATTTA AGCGACCGAT 


ATAACCCTTT 


CTGTTGTAAT AAATCAATTT 








CTTCATTCTT GTTTTCCTTA 


TTTTCACAAG 


TGTCATAATC AATTTCAAAG 


CCTAAATPAT 


^ o o 


40 


TAATCATATC GTAGTCTAAT 


TGGTTCGGTT 


GCCCACCAGT AATTAGATAA 


TnCACCGACA 


^ ^ fi A 




AATATTGAAT TCGCCGCTTT 


TAATGCTAAT 


GGCTGTAACG AACGTAAGTT 


GACCTCT CTT 

X ^ X ^ X X 






CCTCCAGCAA TACGAATTTC 


TTTCGTAGGA 


TTGATTAATC GGAATAATGC 


X X X w X X 




45 


AAACATTTCA TTGGTGTTAA 


ATCATCCATG 


CTTCCAAACT TTGTGCCTTT 


G ATTVS^* ATY^ r* 
vj^x x\j\jtnxV9\. 


O A A 




AAAAAATTAA TCGGAATACT 


GTCGGCATCC 




CATATCAACA 


2520 


SO 


ATATCTTGAT TAGATTCTCC 


CATACCACAA 


ATCACGCCAG AACATGGTGA 


TATATTATTC 


2580 


GCTTTCATTA GTTCTATCGT 


ATCTGTTCTA 


TCTTTATAAC TATGCX3TTGT 


CACGACGTTA 


2640 




TCATGGTAAT TTTCACTTGT 


ATTAATATTG 


TGGTTATATC TGTCTACACC 


AGCTGACTTA 


2700 



55 
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TGTTGAGATT TAATCGTTCT TACAGTATTA CTAATATGAT CAACTTCTTT ATCGCTCGGT 2820 

CCTCTACCAC TCATAACAAT ACAATATGTT CCAATATGAT TATCATGTGC CACCTTTGCT 2880 

CCATCGATAA nTGTTCCTC TGGAATTAAA GCATATCGCT GTTTTTGTTT AATATCTCGT 2940 

GATTGTCCAC AGTACCCACA ATTTTCAGGA CATATACCAC TTTTAGCATT TAAAATCATG 3 000 

TTTAATTTTA CTTTTTTACC AAAATAATGT TTTCTTAAAA TGTACGCCTC ATTTAATAAA 3060 

TCTAAGGTAT CAATATTAGT ATCCTCATAA ATtTTCAATA CAGTCTCTTT TGTtAATTGT 3120 

tCCCCTTGTA ATATGCGTTT AGCCAAATTC ATATTAACAC TTCCTATCTA AAA 3173 
,5 (2) INFORMATION FOR SEQ ID NO: 33 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
iD) TOPOLOGY: linear 



10 



2S 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 
CGATTATCCA TTAATACAAC CCTAAGTAAA TGTATAAAAA TTATCTTCCA CAAACTTCAA 60 

CAAAAGCCTA AATAAATTAC AGCAATTTAT CAAATATTGC TTACTTTGAT TTTATGAAAT 120 

nACTTAATTC TAACACATAC TAAATCATCA TATACTAATT CGAAATCAAA TGCATTTAGA 180 

GATAATCaAA ATGCGGAAAC ATCTCCcLATA ATCAATAATC TATTCCCAAT AAATATGAAT 24 0 

GTTCTCAACA ATACATTATT TATATCTCTT TACACTGTCA TCGACAAAAA CTAAATCTTT 3 00 

CACTTTCAAT TTCGAACGTG GTTCTACGAC ATTTGCTGCT ATATCATTTA ATGGGATTAA 3 60 

AACAAATGCA CGTTCATTCA TTCTCGGATG TGGCACCGAC AGTTTTGGTA AATCTATCAT 4 20 

TTCTTCTCCA TACAACAAAA TATCCACATC TAAAGTTCTA GGACCCCATC GTTCCTTTCT 4 80 

40 AATACGGTGT AAACATTCTT CTGTCkTCAA ACAACATTCC AACAGTTGTA ATACTGTOAG 54 0 

TGTTGTTTSA ATTTCAACAC ACAAATTTAA AAAGTTAGGT TGCTCAGTAT ACCCAACTGG 600 

TGCTGTTTCA TAAATCGGAG AAATAtTAGA TACGTTAATA CCATCATATT CATTCaAAAT 660 

CTTyATAGCA TCGTTTAACt GGCTTTCTCT ATCACCAaTa TTACTACCTA AyCCTAAGTA 720 

TGCTTGAATC ATyTATTCTC CCTCACTATT TCGATACCTA CTCCATCATA ATGACCCGGA 780 

ATCGGTGGGT TTTCTTTAGT GATTCTCACT TTCGTTTCCA TTACACGATT ATATTGTGAA 84 0 

TTTATACGAT TTGCAATACG TTCAGCTAGA TGCTCAAGTA AATTAACGGC CTTACCTTCC 900 

ATAATTGATT TAACCTCTTc GAACACTTCA CCATAATGAA CTGTATCAAT AACATTATCA 960 

55 



45 
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ATTTCATTTT CAGCTGATAA AGCACCATGA TATCCATAAA AGCGCATACC TTTAAGAAAG 

ATTGTGTCTT GCATTTTCAT TCTCCTTTAA AAAATCTATA CCTTTAGCTA ATTTAGCATT 

5 CAACTCGACA TTATGAACGC GTACTGCTCT AACGCCTTTC ATAATACCAT ATGCAGTCGT 

AGCTGCAGTT ACTTCATCTC TTTCAACCGG TGTTGTATCA TAACCCATCA TCTCTTTAGT 

GAAACGTTTC CGGCTTGTCG CTAATAAAAC TGGATATTCT GTTGCAACAA GTTCATCCAG 

10 

TCTTGCCATA ACTTCGGCTT CTTCATTTCT AGTTTTAGCG AAACCTATAC CTGGATCTAG 

CCAAATTTTA TTTGAAGGTA TACCAGCTAT TTTAGCTTGA TGTGCTTGTG CTAACAAAGA 
TGTTAACATT TCTTCGACAA CCGGTTCATC ACGATTACCA TTTCCATTAT GCATTAAAAT 
AATTTCCGCG TCATAtTTAG CTACAATTTG GAACATACGA TGATCATACA GACCgCCcAT 
tGATCATTAA TCATATCAAC GGCTAATTTT AAACATGCTT CAGCAACCTC ACTTCGAAAT 
20 GTATCGACTG AAATTTTTTA CATCAnAACC GACAATAGCT TCAnCAACAG TAATACTCTG 

TTCCATCTCT TCTG 

(2) INFORMATION FOR SEQ ID NO: 340: 



. <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



45 



SO 



60 
120 
180 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 
AGCATTTCTT TTCTATAAAC ATTTAATTGA ACATTATTAA GTACACTATT ACTATAGTCA 

35 

CTATATTGAA CaCATACCTC ATTTAATTCT AATAGCGGTT CAGATTTGTA CTTATTATCA 
TTATTTGCAG ATGTTTCATC TATCCATTTT TTCACTTTAA ATTTAACATG TTCACTCATA 
40 CAAACGTCAC <3TAAATTCGC TAAGTTATCA ATGGATTCGA CATCTACTTC TCCATATITA 24 0 

AGCGCTGTAC AGTATAATGG TTCACGTATG CCTGCTTCTT TAAGCTTAGA TGATTTTAGC 300 
AAATCACTAG GCGTTGTATT AGCGATGATT TTTCCATCTT TAAAAAGAAG AACTCTATCA 360 
AACGTATCAT CTAATGATTC TTCTAATCGA TGTTCGACAA TAATCATCGT OXSACTTTGTT 
TCTTCATGAA TATTGTTTAA CAATCTCAGC GTTTCATGTC CTGTCGCAGG ATCTAAATTG 
GCCAGTGGCT CATCCAATAT TAAAATAGGC GTACGATGGA TTAATATACC ACCTAATGAA 
ACGCGTTGTT TTTGACCTCC AGATAAATCT TGCGGTCGGT GATTTAAATG TTCTATCATG 
CCAACTTTTT CAGCCCAATA ACTTACATTT TTCTTCATAT CATCTTGTTC AACACAATTA 



420 
480 
540 
600 
660 
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TCTTGTAAAA CTGTACCAAC AACATTAGAT CTATCATGTA AACAACTAAC GGTTGCATCT 7 80 

TGATTATTTA TATATAGTTC CCCAGTTATG TTACCTTTAG TTTTAAATGG AATTAATCCG 84 0 

TTTATGCAAT TTGCAAAAGT CGATTTACCA CTACCCGAAG CACCAACTAC TAATACTTTT 900 

TCTCCTGGAT AAATATCAAC ATTTATATTC TGTAATGTAG GTGTTGCTTG ACTATGATAT 960 

TGAAAACTAA AGTCTTTGAA CGAGATAATT GGTTCAGTCA TGATATATCA TTACCTTTCT 1020 

ATATTCATTT ACATATCTGA TTCAACAAAA TAACTATTCC TTACGTAAAC TACCTTTTTT 1080 

AATTTGAGAT GAaGCATATG CTTTTAATAA TATTGTCCCA ATAATGCCAA CTGAAATAAT 114 0 

ATTTAATACT GCAGAGATAA CACCTTGTX3T ATAAACCTTG TTAGCTGGTT CGTTATAAAT 1200 

CAAAATATCT AATGTTGGTG CAATAAGTGC CCAGCAAATA ATATTCGCAA TAATTTGACC 1260 

GATATTAAAA TAAACCATCG ATTTCCTAGA AAATCGGCCT GAAGAAAGAT TTAATTTTAG 1320 

TCCA-ATCCAG CCATATA.AAC AGCCTATAAT TCCCGAGC 13 58 
(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

TAGAAGAATT GGAGAAAATG CTAATTCAAT TGTCAACATT CCATAGTTAT CATGATTTAG 60 

AGTTTCTATT TGTGACACGT GAAGATGAAG TTGAAACATT GAAATGGGCA CGTTGGTTGC 120 

CACATATGAC ATTGAGAnGG CAAAACATTA GAGGATTTGT TTACAATCAA CGAACGCGTG 180 

ACCAAATTTT AACGTCAATT TATAGCATGA TTAAAGAACG TATCCAAGCT GTGCtGaACG 24 0 

CAGCAGAAGT AATGAGCAAA TTATTTTCAC ACCGCAATTA GTGTTTGTCA TTACAGATAT 3 00 

GTCATTAATT ATTGATCATG TCATTTTAGA ATATGTAAAC CAAO^TTTAT CAGAATATGG 360 

TATTTCATTA AT C TTTGTTG AAGATGtGAT TGaAAGTTTG CCAGAGCATG TAGATACCAT 4 20 

TATTGATATC AAGTCTCGTA CTGAAGGCGA ACTGATTACG AAAGAAAAAG AATTAGTTCA 4 80 

ATTGAAATTT ACACCTGAAA ATATTGrTAA CGTCGATAAA GAATATATCG CGCGACGTTT 54 0 

GGCGAATTTG ATACACGTCG AACATTTGAA AAATGCAATT CCTGATAGTA TTACATTTTT 60 0 

SO 

AGAGATGTAT AACGTGAAAG AAGTAGATCA GCTTGATGTG GTTAATCGAT GGAGACAAAA 66 0 

CGAAACATAC AAAACGATGG CAGTACCTTT AGGTGTAAGA GGTAAAGATG ATATTTTATC 72 0 

55 
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AGGGAAATCT 


GAGATTATCC 


AATCATACAT TTTATCTTTA GCTATTAATT 


TTCACCCn CA 


840 




TGAAGTTGCA 


TTCCTATTGA 


TTGACTATAA AGGTGGGGGT 


ATGGCGAACT 




900 


5 


TTTAGTCCAT 


TTAGTTGGTA 


CGATTACAAA CTTAGATGGC 


GATGAAGCX3A 


1 oLaj X vice iTV 


960 




AACATCAATC 


AAAGCCGAAT 


TGAGAAAACG TCAACGTTTA 


TTCGGAGAGC 


Ai ljA 1 Lj nrAA 


1020 


10 


CCATATTAAT 


CAATACCATA 


AGTTATTTAA AGAAGGTATT 


GCGACAGAAC 


V—HA 1 vj l_A UA 


1080 


TTTATTCATT 


ATTTCCGATG 


AGTTTGCCGA ATTAAAATCA 


GAACAACCTG 


Ail TTATGAA 


1140 




AGAACTTGTA 


TCAACGGCAC 


GTATTGGACG TTCX3TTAGGT 


ATTCATTTAA 


TACTTTGCGAC 


1200 


1$ 


ACAAAAACCA 


TCGGGTGTTG 


TTGaTGACCA AATTTGGTCT 


AACTCTAAAT 


TTAAGTTGGC 


1260 




ATTAAAAGTA 


CAAGATAGAC 


AAGACAGTAA TGAAATTTTA AAAACACCAG 


ATGCAGCAGA 


1320 




CATTACmTTA 


CCaGgTCGTG 


CGTATTTACA AGTTGGTAaT 


AATGAmATTT 


ATGAATTATt 


1380 


20 


CCAATCTGCA 


TGGAGTGGTG 


CAACATATGA CATCGAAGGC 


GATAAATTAG 


AAGTTGAAGA 


1440 




TAAGACGATT 


TACATGATTA 


ATGACTATGG TCAACTTCAA 


GCAATCAAGA 


AAGACTTGAG 


1500 




TGGACTTGAA 


GATGAAGAAA 


CGAAAGAAAA TCAAACTGAG 


TTAGAAGCX5G 


TCATAQATGA 


1560 


2S 


TATCGAATCT 


ATTACAACAC 


GATTAGAAAT CGAAGAAGTT 


AAGCGTCCAT 


GGCTACCACC 


1620 




ATTGCCAGAA 


AATGTATATC 


ArGAAGATTT AGTAGAAACa 


GATTTCAGAA AATTATGGTC 


1680 


30 


AGATGATGCA 


AAAGAAGTGG 


AATTAACATT AGGACTTAAA 


GACGTACCAG 


AAGAACAATA 


1740 


TCAAGGACCG 


ATGGTATTGC 


AATTGAAAAA AGCTGGGCAC 


ATCGCGTTAA 


TCGGAAGTCC 


1800 




AGGATATGGT 


AGAACAACGT 


TCTTACACAA CATTATTTTC 


GATGTTGCAA 


GACACCATCG 


1860 


35 


TCCTGATCAA 


GCACACATGT 


ACTTGTTCGA TTTCGGTACC AATGGTTTGA 


TGCCAGTTAC 


1920 




AGACATACCA 


CATGTCGCTG 


ATTACTTTAC AGTAGATCAA 


GAAGACAAGA 


TTGCTAAGGC 


1980 




GATACGTATA 


TTTAATGATG 


AAATTGATCG TCGTAAGAAG ATTTTAAGTC 


AGTATCGTGT 


2040 


40 


CACTAGTATT 


TCTGAATATC 


GAAAATTAAG TGGTGAAACA ATTCCGCATG 


TCTTTATTCT 


2100 




TATTGATAAC 


TTTGACGCAG 


TAAAAGATTC ACCTTTCCAA GAAGTTTTTG 


AAAATATGAT 


2160 




GATTAAAATG 


ACGCGTGAAG 


GGCTAGCATT AGACATGCAA GTAACCTTAA 


CTGCTTCAAG 


2220 


45 


AGCTAACGCT ATGAAAACAC 


CAATGTACAT TAATATGAAA 


ACGCGTATCG 


CAATGTTTTT 


2280 




ATATGATAAA 


TCAGAGGTGT 


CGAACGTAGT AGGACAGCAA 


AAATTTGCGG 


TTAAAGATGT 


2340 




TGTGGGTCGA 


GCATTGTTAA 


GTAGTGATGA CAACGTATCA 


TTCCATATTG 


GCCAACCATT 


2400 


SO 


TAAACATGAT 


GAGACCAAAT 


CATATAATGA TCAAATTAAT 


GATGAAGTAT 


CGGCGATGAC 


2460 




AGAATTTTAT 


AAAGGTGAAc 


ACCAAATGAT ATtCCTATGA 


TGCCAGATGA 


AATTAAATAT 


2520 
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GGATTAGATT 


ATGAAGGTGT 


TACACTACAA 


AAAATTAAAT 


TAACTGAACC 


AGCAATGATT 


2640 




TCATCAGAAA 


ATCCGAGAGA 


AATTGCGCAT 


ATTGCTGAAA 


TTATGATGAA 


AGAAATTGAC 


2700 


5 


ATATTAAATG 


AAAAATATGC 


GATTTGTATC 


GCAGACTCAA 


GTGGAGAGTT 


TAAAGCTTAT 


2760 




AGGCATCAAG 


TGGCTAACTT 


TGCCGAAGAA 


AGAGAAGACA 


TTAAAGCGAT 


TCATCAACTA 


2820 


10 


ATGATTGAAG 


ACTTAAAGCA 


AAGAGAAATG 


GACGGCCCAT 


TTGAAAAAGA 


TTCACTTTAT 


2880 


ATTATCAATG 


ATTTTAAAAC 


ATTTATTGAT 


TGCACGTATA 


TTCCGGAAGA 


TGATGTTAAA 


2940 




AAGCTTATTA 


CAAAAGGACC 


AGAACTTGGC 


TTGAACATTT 


TATTTGTCGG 


CATTCATAAA 


3000 


IS 


GAATTAAtAG 


ATGCTTATGA 


TAAACAGATT 


GATGTTGCAC 


GTAAAATGAT 


TAACCAATTT 


3060 


AGTATAGGTA 


TTCGTATTTC 


AGACCAACAA 


TTCTTTAAAT 


TTAGATTTAT 


TCAACGAGAA 


3120 




CCTGTTATTA 


AAGAAAATGA 


AGCATATATG 


GTCGCAAACC 


AAGCTTATCA 


AAAGATTAGA 


3180 


20 


TGGTTTAAAT 


AGCAATGAAT 


TAAATAGGAG 


GGAGGTATGT 


TATGAATTTT 


AATGATATTG 


3240 




AAACAATGGT 


TAAGTCGAAA 


TTTAAAGATA 


TTAAAAAGCA 


TGCTGAAGAG 


ATTGCGCATG 


3300 




AAATTGAAGT 


TCGTTCTGGA 


TATTTAAGAA 


AAGCTGAACA 


ATATAAGCGA 


TTAGAATTTA 


3360" 


2S 


ATTTGAGTTT 


TGCACTAGAT 


GATATTGAAA 


GCACAGCAAA 


GGACGTACAA 


ACTGCAAAAT 


3420 




CTAGTGCTAA 


TAAGGACAGT 


GTAACTGTTA 


AGGGAAAGGC 


GCCCAATACG 


TTATATATTG 


3480 




AAAAAAGAAA 


TTTGATGAAA 


CAAAAGCTTG 


AAATGTTGGG 


TGAAGATATC 


GATAAAAATA 


3540 


30 


AAGAATCCCT 


CCAAAAAGCT 


AAGGAAATTG 


CTGGCGAAAA 


GGCAAGTGAA 


TATTTTAATA 


3600 




AAGCAATGAA 


TTAATATTGA 


GGTGAAGATA 


TGGGTGGATA 


TAAAGGTATT 


AAAGCAGATG 


3660 


3S 


GTGGCAAGGT 


TGATCAAGCG 


AAACAATTAG 


CGGCAAAAAC 


AGCTAAAGAT 


ATTGAAGCAT 


3720 


GTCAAAAGCA 


AACGCAACAG 


CTCGCTGAGT 


ATATCGAAGG 


TAGTGATTGG 


GAAGGACAGT 


3780 




TCGCCAATAA 


GGTGAAAGAT 


GTGTTACTCA 


TTATGGCAAA 


GTTTCAAGAA 


GAATTAGTAC 


3840 


40 


AACCGATGGC 


TGACCATCAA 


AAAGCAATTG 


ATAACTTAAG 


TCAAAATCTA 


GCGAAATACG 


3900 




ATACATTATC 


A^TTAAGCAA 


GGGCTTGATA 


GGGTGAACCC 


ATGATGAAAG 


ATGTTAAGCG 


3960 




AATAGATTAT 


rrrixrrrACG 


AAGAATTAAC 


AATTTTAGGT 


GGTAGTAAAT 


TGCCTCTCGT 


4020 


45 


AAATTTTGAA 


TTGTTTGATC 


CATCAAATTT 


TGAAGAAGCT 


AAAGCTGCTT 


TAATTGAAAA 


4080 




GGAATTAGTA 


ACAGAGAATG 


ACAAGTTAAC 


TGATGCAGGT 


TTTAAAGTGG 


CTACATTAGT 


4140 




CAGAGAGTAT 


ATTAGCGCCA 


TTGTAAATAT 


TCGAATTAAT 


GATATGTATT 


TTGCACCATT 


4200 


SO 


TAGCTATGAA 


AAAGATGAAT 


ATATTTTGTT 


AAGCCGGTTT 


AAAAATAATG 


GGTTTCAAAT 


4260 




ACX3AATTATC 


AATAAAGACA 


TTGCATGGTG 


GTCGATTGTA 


CAATCATATC 


CTTTATTGAT 


4320 
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CTTAAATAAT GAAAGTATCG ATACGATTGG GCGTGTTTTA GAAATTXSAAA TATACAATCA 
TCAAGGTGAC CCTCAACAAA GTTTATATAA CATTTATGAA CAAAATGATT TGTTATTCAT 

5 

TCGATACCCA TTAAAAGATA AAGTGCTGAA TGTTCATATT GGTGTCATTA ATACATT 
(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 3 931 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

IS 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 
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TTGAGTGACT 


TTATTGAAGC 


GCGTGTAGAA 


GAAATATTCT 


TCGAAGTATT 


TGATGTTTTA 


60 


CAAGATTTAG 


GATTAACAAA 


AGTAAATGGT 


GGGTTTATTG 


TAACTGGTGG 


ATCTGCAAAC 


120 


TTACTTGGCG 


TAAAAGAATT 


ATTATCAGAT 


ATGGTAAGTG 


AAAAAGTTAG 


AATTCACACG 


180 


CCATCACAAA 


TGGGAATTAG 


AAAACCTGAA 


TTTTCTTCAG 


CAATTTCTAC 


AATTTCTAGT 


24 0 


AGTATCGCTT 


TTGATGAGTT 


ATTAGATTAT 


GTTACAATTA 


ATTATCATGa 


TAATGAAGAA 


300 


AtTTGAAGAAG 


ATGTTATTGA 


TGTGAAAGAC 


AAAGATAACG 


AATCTAAATT 


AGGCGGaTTT 


360 


GaTTGGTTTA 


AACGTAAAAC 


AAACAAAAAA 


GATACTCATG 


aAAATGAAGT 


AGAGTCAACA 


420 


GATGAAGAAA 


TTTATCAATC 


AGAAGATAAT 


CATCAGGAAC 


ATAAACAGAA 


TCATGaACAT 


480 


GTTCAAGACA 


AAGATAAAGA 


TAAAGAAGAA 


AGTAAATTCA 


AAAAACTAAT 


GAAATCTCTA 


540 


TTTGAATGAT 


TATTGGCCAA 


TAAAACTAGG 


AGGAAATTTA 


AATGTTAGAA 


TTTGAACAAG 


600 


GATTTAATCA 


TTTAGCGACT 


TTAAAGGTCA 


TTGGTGTAGG 


TGGTGGCGGT 


AACAACGCCG 


660 


TAAACCGAAT 


GATTGACCAC 


GGAATGAATA 


ATGTTGAATT 


TATCGCTATC 


AACACAGACG 


720 


GTCAAGCTTT 


AAACTTATCT 


AAAGCTGAAT 


CTAAAATCCA AATCGGTGAA 


AAATTAACAC 


780 


GTGGl'lTAGG 


AGCAGGAGCT 


AATCCTGAAA 


TCGGTAAAAA AGCTGCAGAG 


GAATCTCGTG 


840 


AACAAATTGA 


AGATGCAATC 


CAAGGTGCAG 


ACATGGTATT 


TGTTACTTCT 


GGTATGGGTG 


900 


GCGGAACTGG 


TACTGGTGCA 


GCACCAGTCG 


TTGCTAAAAT 


TGCAAAAGAA 


ATGGGCGCAT 


960 


TAACTGTTGG 


TGTTGTAACT 


CGTCCATTTA 


GTTTTGAAGG 


ACGTAAACGT 


CAAACTCAAG 


1020 


CTGCTGCTGG 


AGTAGAAGCT 


ATGAAAGCTG 


CAGTAGATAC 


ATTAATCGTT 


ATACCAAATG 


1080 


ACCGTTTATT 


AGATATCGTT 


GACAAATCTA 


CGCCAATGAT 


GGAAGCATTT 


AAAGAAGCTG 


1140 


ACAACGTGTT 


ACGCCAAGGT 


GTACAAGGTA 


TCTCAGACTT 


AATCGCTGTT 


TCTGGTGAAG 


1200 
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GTATTGGTGT TTCTTCTGGT GAAAATAGAG CGGTAGAAGC TGCTAAAAAA GCAATCTCTT 1320 

CTCCATTACT TGAAACATCT ATCGTTGGTG CACAAGGTGT GCTTATGAAT ATTACTGGTG 1380 

GCGAGTCATT GTCATTATTT GAAGCACAAG AGGCTGCTGA TATTGTCCAA GATGCTGCAG 1440 

ATGAAGACGT TAATATGATT TTCGGTACAG TTATTAATCC TGAATTACAA GATGAGATTG 1500 

TTGTAACAGT TATTGCAACT GGTTTTGATG ACAAACCAAC ATCACATGGT CGTAAATCTG 1560 

GTAGCACTGG ATTCGGAACA AGCGTAAATA CTTCTAGCAA TGCAACTTCT AAAGATGAAT 1620 

CATTCACTTC AAATTCATCA AATGCACAAG CAACTGATAG TGTAAGTGAA AGAACACATA 1680 

CAACTAAAGA AGATGATATT CCTAGCTTCA TTAGAAATAG AGAAGAAAGA CGTTCAAGAA 1740 

GAACAAGACG TTAATCGGTT AATATATATA CACAAATAAT TCAACACAAA TCATCAGATA 1800 

ACATATCTGA TGATTTTTTT ACTAATTTTT AGaACATGTA GAAGGACATT TAAGTTTTTC 1860 

20 aAAGTTATTA AAAGTGTTTA AGTATCGTGT GAAAATTAAG TCaAAAATTA TTTGCGCAAC 1920 

ATTTTAACTT TAAACATAAA TGTTATATTA TATAATTATT AACTTTGTAC AGTTAGACGA 1980 

AGATAATTTA AATGAAATGA TGGTGACGAT CGAGTGAATG ATAATTTTAA AAAGCAACCG 2 04 0 

25 CATCATTTAA TATATGAAGA GTTATTACAA CAAGGTATTA CTCTAGGTAT TACAACTAGA 2100 

GGAGATGGTT TAAGTGACTA TCCTAAAAAT GCTTTTAATA TGGCGAGATA TATTGATGAT 2160 

(TGCCCATATA ATATTACTCA ACATCAATTG CAATTAGCTG AAGAAATTGC GTTTGATAGA 2220 

AAAAATTGGG TGTTTCCCAT TCAAACACAT GAAAATAAAG TCGCTTGTAT TACAAAGGAT 22 80 

GATATAGGCA CAAATATAGA CACTTTAACT GATGCGCTTC ATGGTATTGA TGCGATGTAC 2340 

ACATATGATA GTAATGTCTT ATTAACGATG TGTTATGCAG ACTGTGTACC AGTATATTTT 24 00 

TATAGTACAA AACATCATTT TATTGCATTG GCGCATGCAG GTTGGCGTGG TACCTATACT 24 60 

GAAATTGTAA AAGAAGTGCT AAAACATGTG AACTTTGATT TGAAAGACTT ACATGTCGTT 2520 

ATTGGACCAT CTACATCATC AAGTTATGAA ATTAATGATG ATATTAAAAA TAAATTTGAA 2580 

ACATTGCCAA TTGATAGTGC CAACTATATT GAAACTAGAG GACGAGATCG TCATGGTATT 2640 

GATTTGAAAA AAGCCAATGC TGCATTATTA ATTTATTATG GTGTTCCTAA AGAAAATATT 2700 

45 TATACGACAG CGTATGCTAC ATCTGAACAT TTAGAATTAT TTTTCTCTTA TCGATTAGAA 2760 

AAAGGTCAAA CAGGACGCAT GTTAGCATTC ATTGGTCAAC AGTAAACAAG GAGGAGATAT 2820 

GTTTGCGTGT GAAAGATAAT TTACAACAAA TCTCAACACA AATTAATGAC AAAAGTGAAA 2880 

AAAATAATTT TTCAACAAAA CCAAACGTGA TTGCAGTTAC AAAATATGTT ACAATAGAGC 2940 

GAGCTAAAGA AGCGTATGAG GCTGGAATAA GACATTTTGG TGAGAATAGA TTGGAAGGCT 3000 
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AATCTCGAAA AGTTAAGGAC GTTATAAACG ACGTAGATTA TTTCCATGCT TTAGATCGAT 
TGAGCTTAGC CAAAGAAATT AACAAACGTG CAGAACATAA AATTAAATGT TTCTTGCAAG 
^ TGAACGTTTC GGGAGAAGCT TCTAAACATG GTATTGCTTT AGAAGATGTT QATCAGTTTA 

TAGATGATCT TAAAAAATAT GACAAAATCG AAATIGTAGG TTTAATGACG ATGGCACCAT 33 qq 

TGACAGATGA TGAAGCATAT ATTAGATCGT TATTTAAACA GTTACGTTTG AAAAAAGAAG 3360 

10 

AAATACAACG ACTCAATTTA GAATATGCGC CTTGTGATGA ATTATCAATG GGAATGAGTA 
ATGACTATCT TATTGCAGTT GAAGAAGGTG CGACGTTTGT TAGAATTGGG ACTAAACTTG 
TAGGAGAAGA GGAGTGAGCC ACTTGGCTTT AAAAGATTTA TTTAGTGGAT TTTTTGTAAT 3 54 0 

AGATGATGAA GAGGAAGTAG AAGTACCTGA CAAACAACAA CAGGTAAATG AAGCGCCAGC 3600 
AAAAGAGCAG TCACAACAAA CAACAAAACA AAACGCAATC AAATCAGTCC CTCAAAAATC 
20 TGCATCAAGA TATACAACAA CGTCAGAAGA AAGGAATAAC CGTATGTCTA ATTATTCAAA 

AAATAATTCA CGTAATGTTG TAACTATGAA CAATGCTACA CCAAACAATG CATCACAAGA 
AAGTTCAAAA ATGTGTTTAT TCGAACCACG TGTTTTTTCA GATACACAAG ATATTGCTGA 



3420 
3480 



3660 
3720 
3780 



3840 



3931 



TGAGCTTAAA AACCGCCX3TG CGACACTTGT CAATTTACAA CGTaTTGATA AAGTATCAGC 3 900 

GAAAAGAATT ATTGATTTTT TAAGCGGTAC T 
(2) INFORMATION FOR SEQ ID NO: 343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3150 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

AATTGTCGGG GGACTCTTAG GTTTTGTCAT GCAAAGAACA AGATTTTGTT TAACAGGTGG 60 

CTTTCGAGAT ATGTATGTGC AAAAGAATAA TAAGATGTTC TATGCATTAT TAATCGCTAT 120 

TACTATTCAA AGTATAGGAT TATTGATTTT GACGGCAACA GATATTTTAC AAATTCCTGC 180 

45 ACATAGTTTT CCAATATTGG GAACAATTAT AGGTTCTTTT ATTTTTGGAA TTGGAATAGT 24 0 

ATTGGCTGGA GGATGTGCAA CAGGtACTTG GTATCGCGCT GGTGAAGGGC TAATTGGTAG 3 00 

TTGGATTGCA TTAGTATTAT ATGCTGTTAC TGCAGCAATC ACTAAAACAG GGATTTTAAA 3 60 
GCCAGTAATG GATAAAATTA ATCAACCAAC GAATGTAAAT AGTGATATGT CTCAAACAAC 



420 



TGGCATTCCG TTTTGGGGAT TAGTCGTTAT ATTAACTATA ATCACCATTT TTCTAGTTGT 4 80 
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AGGTATTAGA TATTACCTTT TCGAAAAACG ATACCATCCA TTTATTGCAG CAATTGTAAT 600 

TGGACTTATC GCACTCTTAG CTTGGCCAAT GAGTGCATCA ACTGGAAGAA ATGACGGTTT 660 

AGGTATAACA ACGCCTTCAG CAAATTTAGT ACACTTTTTG ATTACAGGTG AAACTAAATT 720 

TATTGATTGG GGTGTCTTTT TAGTTCTAGG AATTTTCATT GGTTCATATA TTGCAGCTAG 780 

AGGATCAAGA GAATTTAAAT GGCGATTGCC AGACAAGATT ACAATACGAA ACAGTGCCAT 84 0 

TGGTGGCATA TGTATGGGAT TTGGTGCGTC AGTTGCTGGT GGTTGTTCTA TCGGTAACGG 900 

TTTGGTTGAA ACGGCAACGA TGACTTGGCA AGGATGGATT GCGCTAgCAT gCGATGATAG 960 

TTGGTGTATG GACAATGAGT CATTTTATCT TTGTTCGTCC AATGAAAAAA GTACACCAAC 1020 

AATCTGCAAA GGTTAAACAG CAAACGCAAA TAGTATAGAA GATTATTATG CAAATGATGT 1080 

TGATCAAATA AAAGTGATTG GAAAAGGAGA AATAATTATG ATACACGAAT TAGGTACAGT 114 0 

AGGAATGGTA TGTCCATTTC CGTTAATTGA AGCGCAAAAG AAAATGGCAA CATTGCAATC 1200 

TGGAGATGAA TTAAAAATTG ATTTTGATTG CACGCAAGCG ACGGAAGCCA TTCCAAATTG 1260 

GGCTGCAGAA AATGGTTATC CTGTAACAAA CTATGAACAA ATTGATAATG CTTCATGGAC 1320 

AATTACAATT CAAAAAGTTT AACGTTATCA TTTTAACAAT AAAATAGATA TTAGATTCTA 13 80 

TGGCTACTTC CGCTAATTTA AAAGTGAGTA AGTAGTCTTT TTTTTTTTAG TTCATGAAAT 1440 

CATTTTTATA TAGTGTGGCA CATTTTATTC CAAAAGATGT AATAAAACTT AACGCATTTT 1500 

TGCTTTTTAT AAATTGTCAG ATTATTATGA AAAAAAGGGA GTGGTAAGTA TGAATCTTAA 1560 

CGATACGATA TTTATGTTTT TGTGTACATT ATTAGTTTGG TTAATGACAC CAGGATTAAG 1620 

TTTATTTTAT GGTGGGTTAG TTCAATCTAA AAATGCGCTT AATACTGTCA TGCAAAGTAT 1680 

GGCAGCAATT GTGCTTGTTA CATTTGTATG GATAACAGTT GGTTTTACAA TTAGTTTTGG 1740 

GAATGGGAAT TTATGGTTCG GAAATTGGGA ATATACTTTT CTTAATCATG TAGGTTTTGC 1800 

GACTCAAGAA GATATTAGCC CACATATTCC TTTCGCTTTG TTTATGTTAT TTCAAATGAT 1860 

GTTTTGTACG ATTGCAATTT CTATTTTATC TGGTTCAATC GCTGAGAAAA TGAAGTTTAT 1920 

TCCTTATTTA TTATTCGTAG TAATATGGAC TGCTCTTGTA TACAGTCCAG TAGCACATTG 1980 

GGTTTGGGGC GGCGGTTGGA TTAACAAACT CGGTGTATTA GATTTCGCTG GAGGTACGGT 204 0 

TGTTCATATT ACATCAGGTG TTTCTGGTTT AGTATTAGCT ATTATGATTG GAAAAGGAAA 2100 

CAAACATTCT GAATCAACAC CACATAATCT TATGArtACG TTGAITUGCG GTATATTCGT 2160 

GTGGATTGGT TGGTATGGAT TTAATGTAGG TAGTGCTTTT ACATTTGATA ATATTGCGAT 2220 

GCTTGCATTT ACAAATACTG TCATTTCAGC CAGTGCAGGT GCTATAGGTT GGTTAATTTT 2280 
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Ai rACTCCTG 


CAGCAGGATA 


TGTAACATAT 


CTTAGTGCAA 


CAATAATGGC 




vjVj i A i uTGTT 


GTTATATTGT 


CATTAATTAC 


ATCAAGGTAA 


AACTAAAATA 




1 iAvvAlOtfCAT 


TTGGTATTCA 


TGGTGTTGGT 


GGTATTATTG 


GTGCTGTTTT 




1 1 CCAAAGTA 


^ ^ ^ m «k ^^^^^ ^ 

AAAAAGCCAA 


TCCTGACATT 


GAGAATGGCT 


TTATTTATAC 




UATATTATAC 


TTGTACAAAT 


ATTATGTGTA 


ACAGCAGTTG 


TAATTTTTAG 


1 A 1 A!L> I LlAX G 


ACGTTTATTA 


TTGCGAAAGT 


AATTAAATTa 


ATTACACCAT 


TATCTGTTAC 




ACGAATATAG 


GATTAGACAA 


GATTGTTCaC 


GGTGAACATG 


CTTACTTTGA 


AGGxGAGCTA 


AATAGATTCA 


ATAAACATAT 


TCGATATTAG 


AATATATTTA 


CATAGAATAT 


TCATTGTCCT 


GACATTTAAC 


TAAAGGTTGA 


TGTTGGGACA 


TTTTGTTATA 


CAAAAGTTTT 


ATTTTGAAAT 


tj'i'-i i-i-iATGA 


AAGAAGCAGA 


AATATTATTT 


AAAGCGGTTA 


CACATATGCT 


AAAATAAGGC 


TAAGTGTCAC 


AAATAATGAT 


AGGTGAATAA 


GTATGAAAAA 


TATATCTGAT 


ATTGCCAAAT 


TGGCAGGCGT 


TTCAAAAAGT 


ACAGTATCTA 


GATTTTTAAA 


TAATGGATCT 


GTCAGTAAAA 


AAACAAGTGA 


AAAATTAACA 


AGAATTATAG 


CAGAACATGA 


CTATCAACCG 


AATCAATTTG 


CTCAAAGTTT 


AAGAGCGAGA 




• 





(2) IKFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 
GTTATAGTGA AATTGACTCA TCACATTTCA CAGACCGTGA CAAACGCGTT ATTAGACGTG 
ATCATGTTAA AGAAGCACAA AGCTTAGTAG AGAACTATAA AGATACACAA AGTGCTGATG 
CTAGGATGAA AGCCAAACAA AAAGTTAACA CATTAAGCAA ACCGCATCAA AACTATTTCA 
ATAAACAAAT TGATAAGGTT TATAATGGAT TACAACGCTA ATCCAAAGTA AATTATAAGT 
TATACATCTC GTTTTTAAAT GACAATTTAT CCCCGTAAAT ATTATAAATA ATCTTTTCAA 
ATTCCACATA GATATAGAGA CACTAATAAA CCTCTTTGTC TCGATATGAT AGTCTGCAAC 
GATTCATGTT GTAGGCTTTT TAATTTTACA AATAAGGCTA AATATATAAG TTCTGGCACC 
TAAAATATAG AAAATACATA AAAGTAAGTA TAGTTATTTT ATTATAATTA TTAAATTTTT 
ATTAATTAAT TGTAAAAATG TATAATTATA ATTAATTAAC GTTTAATATT AAAATTAACT 



.0786519A2_I_> 
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ATCGTTTCAA TATTACTTAT AGGGATGGCT ATCAGTAATG TTTCGAAAGG GCAATACGCA 6 60 

AAGAGGTTTT TCTATTTCGC TACTAGTTGT TTAGTGTTAa CTTTAGTTGT AGTTTCAAGT 720 

CTAAGTAGCT CAGCAAATGC ATCACAAACA GATAATGGCG TAAATAGAAG TGGTTCTGAA 780 

GATCCAACAG TATATAGTGC AcTTnCAACT AAAAAATTAC ATAAAGAACC TGCGACATTA 840 

ATTAAAGCGA TTGATGGTGA TACGGTTAAA TTAATGTACA AAGGTCAACC AATGACATTC 900 

AGACTATTAT TGGTTGATAC ACCTGAAACA AAGCATCCTA AAAAAGGTGT AGAGAAATAT 960 

GGTCCTGAAG CAAGTGCATT TACGAAAAAA ATGGTAGAAA ATGCAAAGAA AATTGAAGTC 102 0 

75 GAGTTTGACA AAGGTCAAAG AACTGATAAA TATGGACGTG GCTTAGCGTA TATTTATGCT 1080 

GATGGAAAAA TGGTAAACGA AGCTTTAGTT CGTCAAGGCT TGGCTAAAGT TGCTTATGTT 114 0 

TATAAACCTA ACAATACACA TGAACAACTT TTAAGAAAAA GTGAAGCACA AGCAAAAAAA 1200 

GAGAAATTAA ATATTTGGAG GGAAGAGAAC GCTGATTGAG GTGAATAATG CTeATTGTAA 1260 

AAGTGTCACT GCTGCTAGTG GCACTTTTAT AATTTTTAGA TCACGATATG ATTTATTATC 1320 

AATTCAGAAT TAAAAAAGTA AATAGTATCA AAAGTAAGTG TATTTAATAT TAGAAAATAA 13 80 

AAATTTTAAA TTTAGTATTA AAATGGAATG TTACTATATA GTTCAATGTG TATTATCACA 144 0 

GAAAATAAAA TAATGCTTTA CTTCTATATT TAAAAGTGTA TAATGAAAGT TAAGTAATAA 1500 

AGAGCGTGAA GAAAAATGTG AGTTATTTAT ATAGAATATT CTCCTTTTCA TTTATGAATT 1560 

TGTTACAAAA TATTTAGTGC AAAAGCACGA cGGAGGTATT CAATATGaAT AACX5GTACAG 1620 

TTAAATGGTT TAATGCAGAA AAAGGTTTTG GTTTCATCGa AAGAGAAGAT GGTAGCGACG 16 80 

35 TATTCgTACA CTTCtCAGCA ATCGCTGAAG ATGGATACAA ATCATTAGAA GAAGGCCAAA 174 0 

AAGTTGAATT CGACATCGTT GAAGGCGACC GTGGCGAGCA AGCTGCAAAC GTAGTTAAAA 1800 

TGTAATTTTA ACTTATTCAA ACAGTCCTTA CTATAGGGCT GTTTTTTTAT GCTTTAAATC 1860 

^0 GATAACAGTT GGTGTGGTAA AAGCACTAGC CGTTATTTTT TTGTCCAATA AATTTAGTTG 1920 

GAGATTTAAC AATATATAAT GGTTCTAAAA TAAATCGAAC TGATGGAAAA GTTTTTTACT 1980 

TTTCATCTGT CCGACTTTTG ATTTTGAATA TAAAAAAGCG CCAATACAGA ACTTTAATAA 2 04 0 

TGACGAGAAT TAAAGTCTGT ATATGGCGAT AACAAGAAGT AATGTTAAAC ACTCAAAATG 2100 

TTTAACAATA ATAGGATACC ACATCGCATA ATATCTTACT ACTTAATTAA TAATTTAACT 2160 

AATCAACTTT TTGTTAATTT TTTATTAAGA CTGATTAATT ATTGAGAATA TTTATTGTTT 2 220 

TTAAAATCTC ATAATAATTC AGTAATCTTG TTTTCATTTA AAAGGCGAAA CATTAAAATA 2 2 80 

ATTAAATAAA AATATTGCGT TTAATTTACA GCGTCAAATA TACTTATTTC TAATGCTTTG 2 34 0 
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ATTTCTGTGy^. 


GTATTTGGAA 


GCTACCATTA 


GGCAACGGTT 


TAACAATAGA 






TCCGCTTGTT 


GTATTAAAAA 


AGGTTTTGTA 


GATTGATTAT 


TAATATGCCA 


TTCACTCATG 


C ^ 


TATOTTTTTC 


ACTCCTGCTT 


TAAAATAGGG 


TTAGAAAGTT 


TATAGTTGAG 


ACATTCATGT 


O c a n 


TCAACCAAAA 


TTTTGTTCGA 


ATTCAATAAA 


TGTCTTGTTT 


AAAATAGAAA 


TATTGTAAAT 


^ c A n 


GTTATCGTCC 


AAAACTTCAC 


CAGTTAAGTA 


TTTGTTTTGA 


ATTAAAATTT 


GGCAGTTAGT 


^ / u u 


TAAGAAGTCT 


TGATAATCAC 


GATCGCAAAA 


ATAGTTTTCA 


CGTG CAT CTT 


TAGCATCGCC 


^ / o u 


AAAAAAGTTA 


GCGACTGTTT 


CTGTTTCTCC 


TTTATTCGAA 


CGTTCAATAT 


ATAATTTGTA 


^ o ^ u 


AAATTTAGCT 


ATTGTATACT 


TTTGTTCTTT 


AGTTAGTTCA 


TTCAAAATAT 




^ o o u 


GAAATATCAT 


TTGTAATCTA 


TACCCAATTT 


ATTGCAAAAC 


AAAAACTAAT 


TTAACTATTT* 




GATGAAACTG 


TGTTAATAAg 


CTTTAACAAG 


CCTTAGTTTG 


TATGGATCTA 


TAAAATTATC 


J w u u 


TTTAATTGCA 


TAGGGTGAAA 


TAATATGTAG 


TCCATAACTT 


TTAACTGATT 




n A n 


ACCAAATTTA 


TAAGCTTGGT 


AGATAATTTT 


AGTACAATAC 


GTAAATTTTT 


TGCTGTTCAA 


O X ^ w 


ATTTAATGTA 


ACTAGATAAC 


GATGATTTGT 


ATTCTCATAG 


TTTTTCTTAA 


CCCATTCAGC 


11 ft o 
J X o 


CGCTTTTTTA 


CCTGCACCAG 


GATAGCTGCA 


ACGATAAACT 


TTCATCCAAT 


CATTTTTGCC 




ACTTGCATAA 


TTATATTTAA 


AAGATTCGAA 


GGATTGTGTA 


GTTGGTTTGT 




n on 

J J u u 


CTCAATTTGT 


AAAATCGTTT 


TATCATCAAT 


CGCGATACTA 


CAATGACCAA 


AAAATCs c CA 


3360 


CATGACAGGG 


CCTTTTGTAA 


CAATTVATATC 


ACCAGGTTGT 


AATTGGAATT 


TGTCATCTTG 


3420 


AATTTCTGAA 


TACTTATTAT 


CTGCAATTGT 


TTTTGGTGAG 


TTTATTGGGG 


ATACGACAAC 


3480 


^ TV TV n^lV TV *T'1V ^ IV 

GAATAATATA 


AGTTIAAATTA 


TCGTTCGTTT 


AATATAGTTC 


ACTTAAT^GC 


TCCTTGTTGA 


3540 


AGAAATATAT 


GTAAATAGTC 


TTAAATTAGA 


ATTGTAATCT 


TTAATAAGCT 


TGtAAGACTA 


3600 


AAACATATCT 


TAAATATTAA 


AGTATGAGAG 


TGTGAAATGT 


CTATTAAGAA 


TnAAAAACAG 


3660 


TCTGAAACAT 


CATTGAGACG 


TTCCAGACTG 


GATATAAAAT 


GAATTTCATT 


TATAGCACA 


3719 



(2) INFORMATION FOR SEQ ID NO: 345: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 base pairs 
4S (B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 
TTGCGTTGCC GCACCAAGAT ATTGAATGCC TAGCGATTCC GAGTATGCAA ACTGAACGCA 60 
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CTCTTTAATA CGCGTTATCG CTTTTTGTAA ATCTGCATCA TGATAACAAA TCATAACGCC ' 180 

ATAGCCACCT GCTGTCGGAA GATCATCTCC CGGCTTAATT ACTAACX3GGA ATTCCCAATT 24 0 

CTTAATCTCG TTTTCGAATT GCTCAATTTT TACAACTTTT C TTTTTGGTA AAAACTTCCC 3oo 

ATTTGTCCAT TCAGGTATTC TTGCTTTATT ATTTAAAGCA ACAAATAACG TTTTATCTAA 360 

• TGCATAATAT TGCTGATTCA AGATTGTTTC ATCATGAATA TATTGAAAAT AAATCTTTTT 420 

ATTTTCCTTA TGTGCCAATT GTTTGATCAA GTTTTCGTAA GATTGCTGAT TGTTAAATGT 4 80 

ATAAATTGAG TTCGGtACTT CCTTACCAAT AACTTGAAAT AGCTGATGCA ATTTGTCTGT 54 0 

CGCACTAGCT TCGTGAACAA TAACAGGTAA TTGATTTGCT ATTAATAACT CCCTACCAGT 600 

TAAAAAATTA GATTGATGTT CGTCCGGTTT CAACCATGGA TTCGATATAT ACGAAGGTCT 660 

TGACGTATAG ACAACATCTT TGTCATATAA ATCACTTAAC GTTAAGTTCG GCTCATTACC 72 0 

20 ATTATTTGTC ATTACTTCCC ATTCCCTTTC AAATGCGCAT GCTCTTCAAT AATGTCTTGA 7 80 

TAAACGTCTT GATTTGTAAT TAACTCTAAC CCCATCAACG CCATTATTTT AGCGCCTTTA 84 0 

ATTAATGCTT CATCACCATG TACACTCGCA GCCGCTTCTC TAAATCTATG CGTATGTCCT 900 

ACTAAATTAC GTGATCCTAT TTTAATATGA GGATGTATTG TTGGCACAaC ATgaCTTACG 960 

TTCCCTGTAT CCGTAGAGCC ATAACCAAAA TCATCATCAA TAACTGCTTC ACCAACTTCT 1020 

TCAGCATATT TAGCAAATAA ATCATCTAAT TTCGGCGTTT TAATGAATTC ATTCACACCG 1080 

TTTTGAATTC GACCAAATTC ATAATCACAA CCAGTCTGTA TCGCAGCTCC ACGTGCGATT 114 0 

TGATTTACTT TTTCTGTTAA TATATCCAAT TCTTTACGCG TCATTGCTCT AGTATAAAAA 1200 

35 CGAGCATGTG TATAGTCTGG AATAATATTA GCTGCTTTCC CGCCATCTAA AATCACACCA 1260 

TGCACACGTT GATCTTTTTT AATATGTTGT CGTAGTTGTG CTACACCATT AAAATAACTA 1320 

ATCATAGCOT CTAATGCATT TAACGCTTCA TCTGCATTTT CAGAGGCATG AGCACTTTTT 13 80 

CCGTAAAATT TAACATCTAA AACATCCACT GCCAAAGTAT CAATCGTTTT ATAAGTTTCA 144 0 

TTTCCCGGAT GAATCATTAA GGCAATGTCT ATTTGATCAA TCACACCAGC CTTGACATAA 1500 

GAAGCTTTAG CGCTACCATT TTCCCCACCT TCTTCAGCTG GACATCCAAG AACGACTACT 1560 

TTACCACCAA TTTGGTCAAT CACTTGCTTC AAACCAATTG CACCAAGAAC ACTTGCAGTT 1620 

CCAATGATAT TATGACCACA AGCATGACCC AATCCTGGCA AAGCATCGTA TTCTGC 1676 

so <2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

5 





TACAGTAGGA 


ATCATAAAAC 


CTAATACAAC 


AAATAGAAAA 


PP ATT^ra a nr; 


CAlAACl AAA 


60 




TGTGTTCCAA 


ATTTGATGGT 


AATTCATTTG 


TAGTTf^CGTT 


TGTGPTr**t*a a 

X X O W X w X AM 


TTa a ft /^fu^T*/^ 
1 lAAACUOlC 


120 


10 


GCGTTCTAAA 


CCATGGATTA 


GACCTG CG AT 


TAP A A r*TGP A 


ATG a T a r*t^T*r^ 


AAvjCATGAAC 


180 




TTCTTCTGCT 


AAAAAGTATA 


CGACAAAAGG 


'Wj X X / X nj\ X 


TGa aTa a a a/^ 


'1 "I'ft TV Z"*/'**!* TV 1 mil 

i IAAvjGTATT 


240 




GTTATCTTTT 


AAAC cm 'AT 


TAGCGGTTAA 


GTPTATAPGT 

Vj X W X/^X/\Ww X 


a 'FTPT* a a r'l^ a 


caacgaatcc 


300 


IS 


AATAATTGCA 


CCAA.TAAGTA 


CACC*TAGTAT 


TGTTGaAATG 


A X AAA X i\a X X 


v-AACAGCTTG 


360 




GAATAATGAA 


AAGGTACr*Ar! 


TT Ar^ A ATGP 




ATTTTAAATG 


7\ TV TV ^n/^ IV rr^ tv 

AAATGATACC 


420 


20 


TGCTGGATCA 




n X X ^ X X ^ 




A i ACjAAwCI 1 


TAGGTAATAA 


480 


TTTTCCGCGT 


GTAATAGCAf; 


ATA r**rGr"ra r* 

^ X tW^ X VJ\. X AW 


J. oV^^l WAV? lA 


^If^ R ^ T\ 7\ TV T\ 

ooACATAAAA 


TTGCTGCTAT 


540 




TGCAAAAGCG 


GCTGGCATTG 


GTAAGGCAGG 


PP A a aTPr»a a 


lAj X A 1 AAAAT 


TV ^ /^/^ TV Tk ^ ^^^^ 

AGCCAACACC 


600 


2S 


GACTACAGTT 


GCAAACAPTA 


GTGPr*ATTY5A 


pa.T*ra aTaGT 

^..AX lAAlAWX 


A A ^^^^^^^^ ^ 

A 1 AV7\j 1 11 AC 


GATATTCTAA 


660 




TAATTTTGTT 


CGAGAGACGT 


X Aw V. X X W 


papaa aa a/^T» 


At» IXaGCvjuGA 


TAACGGCAAA 


720 




CATAAATACT 


TCAGAATTGA 


ATTGGAAATG 


W X X \ J X n X X 


GG a a T* a aiv^ a 

^J^iSAA X AA 1 VjcA 


a ft ft ft ft ft 
AAATAACGAC 


780 


30 


ACCTAATGCA 


ATTTGAATAA 


AGGCAGTAGG 




a a TPG a 'pTa 'p 

AA 1 A 1 1 A 1 


TY^ft^ft TV f^r^^^ TV 

1 \jATAACCGA 


840 




ACTAATAATC 


ACAGCAAAAA 


TAAAAATTAA 


aaa.T'Gpn*' ppt* 

'^n/^xvjwx XV.X 


a ATaGTY^ ppa 

AAX Au Xvj^VmA 


T a o a a Ta r*pr» 
1 ACAA x^ACX C 


900 




CTCAAAATTT 


TAATAGTTAA 


TATTTTATrTA 

X «^ XXX X^\ X 


PTT'ITAAGGP 
w X X X X n/\ wvyw 


aT'a a^PGapaT* 

AX AA XAjAU.~A X 


a ^ a T'ft ^ ft 

Alj A 1 A 1 ATT G 


960 


3S 


ATAAAATGAA 


GTTATTTTCA 

X A 4^ X X X X ^.r^ 


A A A A A A I "^It '"X" 


AGTATPGGTT 
X A X W X X 


G a a r'T^ a T a 


nr» ft ^ ft ^ e^f^ x ft 


1020 




TGTTTAAATT 


ATTGATTGTC 


ATATCTGAAA 


TGACCGCTGT 


CATTTTGTCG 


TTGTTCATAC 


1080 


40 


GCGAGCTTTT 


CAGCATTCGT 


TTTGTATTTT 


TtATAAAAGa 


AAAATAaAAA 


TATnAACCaG 


1140 




AATGGCGAAA 


TATAAATAGC 


TGCTCTTGtT 


TCGTCACTAA 


AGAATAATAA 


AATGAATACA 


1200 




AAGAAGAAGA 


ACGCTAGAAT 


AATGTAAGCA 


ATAGGCTTAC 


CACCAATCAA 


CTTAAATTTA 


1260 


4S 


CTGTTTTTAT 


GTGCCTCAGG 


ATGCTTTTTC 


AAAT 






1294 




(2) INFORMATION FOR SEQ ID NO: 347: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ACATGATAAT GATGACGCTA TTAAAACACG TTTTTTATTT TTCATTGTTA TAACCTTCTT 60 

TCGTATGATT GATATTTGTT GATATGTATC GACATGTGAA TAATATCACA AAAACAGAGA 120 

ATATATATTT AACTATTTAT TAAATGATTT TGTTAATATT ATTAAATACT TTATCCTCTT 180 

TAAAAATAAT GTGTGTACAA AGTCATTAAT TTAGCAAATA TTTTTATTTA GTAGTTAATA 24 0 

ACCATCGATT TGAAATTTAT ATATAATTAT TAGCTAAATA ATATCCTGCA TCTTTCTCAT 300 

ACAATTTACT ATAAAtTAgC ATATCCGATA TCAGCGTTAA TAAGATCGTT GATACTAGmC 360 

AGTTAATTTC ATAGAACGAA ATCAAATAAC ACACTACTTT CTGCATTTTA AATTATGTTT 420 

,5 AAGAATCAnA ATTATGTTTA nATAAATATA TATACTACTT TGAAAGGTGT GAGCTTAATG 4 80 

ACAACTTTTA GTGAAAAAGA AAAAATTCAA TTACTAGCAG ATATTGTTGA ACTACAAACT 54 0 

GAAAATAATA ATGAAATAGA CGTTTGTAAT TATTTAACAG ATTTATTCGA CAAGTACGAT 600 

ATTAAATCTG AAATTTTGAA AGTTAATGAA CACCGCGCCA ATATCGTTGC AGA/IATCGGT 660 

AACGGCTCAC CTATACTCGC ATTGAGTGGT CATATGGATG TTGTTGATGC AGGAAATCAA 720 

GATAATTGGT CATATCCCCC TTTTCAACTG ACAGAAAAAG ATGGCAAATT ATATGGCCGA 7 80 

GGCACTACAG ATATGAAAGG CGGTTTAATG GCTTTGGTCG TATCTCTAAT CGAATTAAAA 84 0 

GAACAAAATG AATTGCCTCA TGGAACGATT AGATTACTGG CTACTGCTGG CGAAGAGAAA 900 

GAACAAGAAG GTGCCAAATT ATTAGCTGAT AAAGGCTATT TAGACGATGT CGATGGCTTA 960 

ATTATTGCTG AACCAACTGG ATCTGGAATT TATTATGCAC ATAAGGGGTC TATGTCATGT 1020 

AAAGTAACTG CAACTGGTAA AGCTGTCCAT AGCTCAGTTC CATTTATTGG TGACAATGCA 1080 

3S ATTGATACAC TGCTTGAATT TTATAATCTA TTTAAAGAAA AATATTCAGA GCTTAAACAA 114 0 

CAAGATACTA AACATGAATT AGATGTTGCG CCTATGTTCA AATCATTGAT TGGAAAAGAA 1200 

ATTTCTGAAG AGGATGCAAA TTATGCATCT GGTCTTACAG CTGTATGTTC GATTATAAAT 1260 

GGCGGcAAAC AATTTAACTC TQTACCAGAT GAAGCTTCAC TTGAATTTAA CGTAAGACCA 1320 

GTTCCTGAGT ATGATAACGA CTTTATAGAA TCGTTTTTCC AAAATATCAT TAATGATGTG 13 80 

GATAGCAATA AGCTTTCACT CX3ATATTCCA AGCAATCACC GACCTGTAAC AAGCGATAAA 1440 

AATAGCAAAT TAATTACTAC GATTAAAGAT GTAGCTTCTA GTTATGTAGA ACAAGACGAA 1500 

ATATTTGTTT CAGCGCTTGT AGGCGCAACA GATGCCTCTA GTTTCTTAGG AGATAATAAG 1560 

so GACAATGTTG ATTTAGCCAT TTTTGGACCA GGTAATCCAT TAATGGCACA TCAAATCGAT 1620 

GAATATATTG AAAAAGATAT GTATCTGAAA TATATTGATA TTTTTAAAGA GGCTTCCATT 16 80 

CAATATTTAA AAGAAAAATA AGAACGATGC TGTCAGCTGC CCTATTCGCG TGCTGGCAGT 1740 
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TCAAATATCA ACAAGCACAT TTTCATTGAT TAAGTGATGT AAAACTGAAA TTATTGTGCT* 
GATTTGTCAT ACATATATTG ACTAATGGGC ATATAAAAAG ATAGCCTCTA ATAGTnACAT 
^ AAACTCGTAA AAnCC 

(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

CCTTTnCCTA AACAATTTTT AGATTTAGAC PACAPACCGA TTTTAATCCA TACATTAGAA 60 

AnATTTATTT TAATTAATGA TTTTGAAAAA ATTATTATCG CGACGCCACn ACAATGGATG 120 

ACGCATACGA AAGATACACT TAGAAAATTC AAAATTTCTG ATGAAAGAAT TGAAGTCATT 180 

CAAGGTGGTA GCGATCGTAA CGATACAATT ATGAATATCG TTAAACATAT TGAATCAACA 240 

AATGGTATTA ACGATGACGA TGTCATTGTG ACACATGATG CAGTTAGACC ATTTTTAACG 3 00 

CATCGTATTA TTAAAGAAAA TATTCAAGCT GCTTTAGAGT ACGGTGCAGT AGATACAGTG 360 

ATTGATGCTA TAGATACGAT TGTTACATCT AAAGATAATC AAACGATTGA TGCAATTCCA 420 

GtGCGTAATG AAATGTACCA AGGTCAAACA CCTCAATCGT TTAATATTaA TTTATTAAAA 4 80 

GAaAGCTATG CACAGTTGAG TGATGAGCAA AAGAGTATTT TATCTGATGC TTGTAAGATT 54 0 

35 ATTGTAGAAA CAAACAAACC GGTTCGACTT GTAAAAGGTG AGTTATATAA CATTAAAGTA 600 

ACAACACCTT ACGATTTAAA AGTAGCGAAT GCTATTATTC GAGGTGGTAT TGCCGATGAT 660 

TAATCAAGTA TATCAATTAG TTGCACCTAG ACAATTTGAA GTTACGTATA ACAACGTAGA 72 0 

TATTTACAGT GACTATGTCA TTGTACGTCC TTTATATATG TCAATTTGTG CTGCCGATCA 780 

AAGATAXTAT ACTGGTAGCC GTGATGAGAA TGTCTTATCT CAGAAATTGC CAATGTCTTT 84 0 

AATTCATGAA GGTGTTGGTG AGGTCGTATT TGACAGTAAA GGTGTGTTTA ATAAAGGTAC 900 

AAAAGTAGTT ATGGTACCGA ATACGCCGAC AGAAAAAGAC GATGTCATTG CTGAAAACTA 96 0 

TTTAAAATCG AGCTACTTCA GATCAAGTGG ACATGATGGG TTTATGCAAG ATTTTGTGTT 1020 

so GCTAAATCAT GATAGAGCTG TACCACTACC TGATGATATT GATTTAAGTA TTATTTCATA 10 BO 

TACAGAGCTT GTAACAGTAA GTTTGCATGC TATTCGTCGT TTTGAAAAGA AATCTATTTC 114 0 

AAATAaAAAT ACATTTGGTA TTTGGGGTGA TGGTAACTTA ggTTACATTA CAGCCATTTT 1200 
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GAGTCACTTC TCATTTGTTG ATGATGTCTT CTTTATTAAT AAAATACCTG AAGGCTTAAC 13 20 

ATTTGATCAT GCATTTGAGT GTGTGGGTGG T I351 
(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

TCATCAAGTC TACGATAAAT TAAGTCCATA TCTAAAGGCT CGGGGTCGAC AGTTTGTAAA 60 

GTATAACCAA CTGCACAGTG GCTACAACGC ATATTACAAA GATTTGTAGT TGTAAATTCG 120 

20 

ATGTTACTTA AAGTTAATTG GCCATGTTCT TTAACATCGT TATATGCTTC CCATGGGTCG 180 

TTTTGAATAC TTATTTTAGG CTTGTTATTA CGCATTTTAT* AAACTCCTTA ATTGTTATTT 240 

GATACCAATT TGATACCGTT TAATCAAATA TGCTCATAGC TTGATGTTTT TTATCAGTAT 300 
25 • 

ATAAATGAGA GTACGTTTGA ATTGTTTCTG TAATGTTAGA ATGCCTCATT AATTCCATTA 360 
ATAAATACAT ATCTACACCA TTATTAATTA AaTAGCTAGC GTACGAGTGT G 411 
30 (2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

40 

TCATTTTCAT AGGTTATTAC GCAGATCAGC ATAATAATCC ATTCCATATG AGTcCTTATT 60 

TTGGTTATGC AGCACGTCTA TTGGCAACAA GTGGCATTGA CTATACOTAT GTAAGAATGG 120 

45 CAATGTACAT GGATCCACTT AAACCATATT TACCAGAATT GATGnATATG CATAAACTGA 180 

TTTATCCnGC TGGCGATGGT CGTATTAATT ATATTACTAG AAATGATATT GCTAGAGGTG 24 0 

TCATTGCTAT TATTAAAAAT CCAGATACTT GGGGCAAACG CTACTTATTA TCAGGCTACA 3 00 

GTTATGATAT GAAAGAACTT GCTGCAATTT TATCTGAGGC ATCaGGCACA GAAATTAAAT 3 60 

ATGAGCCCGT TTCATTAGAG ACATTTGCAG AAATGTATGA TGAACCTAAA GGCTTTGGTG 420 

CATTATTGGC ATCAATGTAC GACGCAGGAG CAAGAGGACT ATTAGACCAA GAaTCCAATk 4 80 
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TTAATAATAA AGGAGCGTTA TAGTGAATAT CATCTCAACA ATTtTAATCA TATTTGTGGc' 600 

ATTAGAGTTT TTCTATATTA TGTACCTTGA AACGATTGCT ACAACTTCCA AAAAGACTAG 660 

CGAGACATTT AATATAAGCG TCGATAAATT GAAAGACAAA AATATTAACC TACTTTTGAA 720 

GAACCAAGGC GTATATAACG GTTTAATCGG AGTTTTGCTA ATATACGGTT TGTTTATCAG 780 

CAGTAATCCA AAAGAAATAT GCGCAGCTAT TTTAGTGTAT ATCATTGGCG TTGCTATTTA 840 

TGGTGGCCTT TCAAGCAATA TTAGTATCTT TTTCAAACAA GGCACATTGC CAGTATTGGC 900 

ACTCATATCA ATGCTTTGGT AAGTATTGGT GTTTGGGGGG GTGGAGATGT AGTCGGAGGT 960 

,5 TTGGAGGATT TGAGCGAATT GTGTGTGGAC TTTAGACTCA GAGTATTTCA TCCTAATTAT 1020 

TTCAAGCAGA GGTGACAGTA GCGTTGCCTC TGTTTCCTTA TAAAAAAATT ATTTAATGAA 1080 

GAAAACCCAT ATCTGATTTA ATTTTCAGCT GATAAATACT CCATATATTA GAATGGCTAC 114 0 

TTTATCTATT GCATCAATCC TTTAAAACAA AAAACCCATG ATTTCGAAAT TCCCGTATGA 1200 

TGGGGTTCCT ACTCTCATGG ATCAGTTAAA TAAATATTAT CACTATCAGT TTATTATTTC 1260 

AATATTATTA ACAATATATG TAGTCGTAAA AGGAAAGAGG ACATGAGAAC TTCGGTGTTG 1320 

ATTGGCATTA CATAACGCTT CCAAACATAT TATTTGGTAA CAATAAGAAA CTATTTACAC 13 80 

AATATATTTT GTATAGTAAA ATTATTTTAT AATATTTAAA TCCAATTGCA CAAGGAGTGA 144 0 

TTATCaTGGT ACCAGAAGAA AAAGGTTCTA TTACTTTGTC AAAAGAAGCA GCTATCATAT 1500 

TTGCAATCGC AAAATTCAAA CCATTTAAGA ACAGAATTAA AAATAACCCA CAAAAAACAA 1560 

ATCCATTTCT TAAATTACAT GAAAACaAAA AATCTTAATC ACTTTtATTT ATAGcATTTC 1620 

35 TAATCTCAGA AATGCTATA 163 9 

(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: IB 16 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3S1: 

AAAATCGCAT ATAGTAATAT GAATAACCAG ATTGTATCTA CAAAAAAGTA TATTGAAAAA 60 

50 CCAAGCGCAC CCATTAATAA TGCGAGAATA ATAATAATTT TTCTATTAAA GTGATGCGTA 120 

TCCGAAAATC TAGCAATAAT TGAATTTACT GTAAACTGGC TAATCGCTGC AGATGCTAGA 180 

AGTAATCCAT ACTGATTTGT TGTCATACCT AAATCTTTAG TTGCAAAAAG AACAAGATAT 24 0 
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TTCTyTATTT 


GTAaTAACGC 


TGCAAACATA 


TCCATAACCC 


CGCTTCTTAG 


AGCCCCTTTT 


360 




AATTnATnAA 


TTAGGGGCTC 


TTATGCAGTT 


GGTGCATTAG 


CAACCAACTG 


TATTCCTTTG 


420 


5 


TCCCl-ri i~i"i. A 


ATTTATTaAT 


TAGGCjGC i I- i 




GTGCATTAGC 


AACCAACTAC 


480 




GTTCAATTTA 


ACCGAATAGT 


TTAAAATTAA 


ATACAAACCT 


TAAATTAGTC 


TAAAACTACG 


540 


10 


CCTTTGGTTG 


TTCAACAAAG 


CTCGCCATGA 


GATTTACAAA 


AGAATCAACT 


TGTGGCAATT 


600 


GCAACATGCT 


CGGATCATAA 


ctcataaatg 


TCGAACGAAT 


CAGCGGTTCA 
CATCATAATT 


TTATCAATTT 
TCTGGCAAGA 


660 
720 




CTACnTTTC 


AAACTCAAAT 


TGTTCTTTGC 


TGATATTTTT 


IS 


TTGTAACACC 


TACACCACTA 


atcaacattt 


CTTTGCAAGT 


TGCTACTTGA 


TCCACTGTAA 


780 




TAGTTGCATG 


GTAATCTTGT 


TCTAAATTAT 


CGTTATACCA 


TTCTTTTATT 


TGATTTATAT 


840 




AAATCGGATC 


AGCTTGAAAC 


tctataaatg 


GTAACTTTGT 


AACATCATCT 


CGTCTATTTT 


900 


20 


TTGGAAAAAT 


AAAATAATGA 


TCATCATTAA 


ATAAATGTGT 


GTTAGCTAAA 


TTCATTACCT 


360 




TATTTCCACG 


AGTTATCATA 


acatgataat 


CTCTATGATT 


TGCTTTAATT 


TGTTCAGTTG 


1020 




AACCAACTTG 


CACTTGTATT 


TCAACATTAG 


GAAATTGGGC 


ATTATATAGG 


CTCAAAACTT 


1080 


CAGGAAGTAA 


GGTTTGTCCA 


ATCAAAGAAG 


AACACCCGAT 


TGATATTGTT 


CCATTCACTT 


1140 




CACCAATATG 


TGCCTGCATT 


TTGTCAAAAA 


ATAATCGCTC 


TCTTTTCAAC 


ATGTCACGAG 


1200 


30 


CATGCTCAAT 


AATCATTGTT 


CCTTCAGTTG 


TTGTAATCAA 


TTGTTTTTTT 


GTTCTGATAA 


1260 




AAATATCTAC 


TCCAAAAGCA 


TTTTCAATAG 


CmTAGTCT 


TTGTGTAACA 


GCAGGTTGAG 


1320 




ATATATATAA 


AATTTCAGCC 


GCTTTACGTA 


ACGTTTTCGT 


TTCGTCTAAT 




1380 


35 


AACGATAGTC 


TTCAATCTTC 


ATAATTTCCC 


CCCATAAATT 


ATTCAATTAT 


TGAACTTTCA 


1440 




TGGCTACAAG 


CATTCATGAG 


TTCATTACTA 


ACGAATAATT 


TCACCAATTT 


TATTGGTATG 


1500 


40 


GCTGCAGCTT 


GAATTACTTA GTTTTTCTTT 


TGTTGTTGGT 


GATTTTTAGT 
TATTTTGTGG 


TTGATTATAT 


1560 
1620 




TGCTTAGGCT 


TTATTTGTTT 


GcrrrrrixjA 


ATATTAGTTT 


CTTTTGATGA 




TTniTTGAG 


CCTTTGCATT 


AATTTTATTA 


AAGCAGTACA 


TGATTTTCTT 


TTGGAATCCT 


1680 


45 


TTAAAATCAT 


TTTCTAACTC 


TGCCATAATT 


TGATGTGCAA 


. TCATATATGC 


TTCATGAAAT 


1740 




TGCTTTTTTG 


TAATTTGCTC 


ACTTTCTAAT 


GCAAACATTA AATCATCTTC 


; ATCTACCAAC 


1800 




TCATaTcACC 


ACTTGG 










1816 


SO 


(2) INFORMATION FOR SEQ ID NO: 352: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

5 GCGTTGTCGT CGATGATTAA TAAAGTATGG GTATACCATT AAGAATAACG CTACCCAAAT go 

gAkTGCTAGT GACGTGCCGC CAATGACATC TGAAAAGTAA TGTGCATGAA AATAAAGGCG 120 

ACAAAATAAT ATGCTAAGCC ATAATATTCC CATAACCAAC GCACTCAACA CTTTTGTTAT 180 

10 TGTCTTAGCA GCAAGTGAAA TAATAATGAT CATTAAGGCG AAATATAATA ATGTGCTGGC 24 0 

GTTGGAATGT CCACTCGGAA ATGAAAAGCC TGTATCAACG GCTAAATGAT TATATGGTCT 300 

TGGACGTAAT ACAGTATCTT TAATTAATTT GTTCATGATG ACACCTGAAA CCAAATATGT 360 

CACAAACCAA ACCGCTAAAT GCCTCTGTTT AATAAACAGT ATGATTGTGA CGATAATGGA 420 

AATCAACACG ACACCTTTGA CATCTCCAAT TTCCGCACTA AACGTCATAT AGTAATTAAA 4 80 

CAAATTGTTA ACATACTGAC GTTGTGGCTC ACCGAAATAA TCTGTAAACC ATGTTAATGA 540 

TCCCATATCT ATATTTTTAA GCCATTCTTG ATTTGTCACT ACACTGTAAA ACATACCTAT 600 

AAATACAATC AGCGCGATTA AAAATAAAGG CACTGTcATT TTCGGTGATG TTAATTTTTT 660 

25 ATCTATCATC TTACAATCTC CTCGTATCAT CATTTTCATT TTACAAi^TGT TATCCATAAT 720 

ATCAATGTGC CACAAATTTC ACTTTACCGA CAATATCAAA ATTATAAAGT TCATATTGTT 780 

ATGTATATTG CAAATAAAAC ATTGTATAAT TGAAATAACA ATATTTTGCT ATTTTCAATT 840 

TAGTACGATT TATATTTATT ATACAGAGGG GGTAAGGCGT ATCAATAGAG TTATTTTTGT 900 

CTATATAGCG TTAATCATTA CATTAGTTAG ATTCTTTACC CCTATTCATC CATCATTTTC 960 

AAATTTGATT TACTGGATAT TTGTATTATA TTTTATTCCT ATTATACTAT GCGTTATCGG 1020 

TTTCAAGGCC GAAAAACTTA TTGCAACAAT GGTCATTATA CCTAATTTTT TAGGAATACT 1080 

TTATCGATTA TATGCCTACG TCACACATAT ACTCTTTATG TAAAAGGATA GTGGGCATGT 1140 

^ CTCGCAACAA ATATGCTTGC GTCGACTTGT CACCATTCGC AAATTTTATG ACATACGCCT 1200 

TTCACGGGCT ATATTCAGAC CCACGCATTC ATCCACGTAA TAAACACATC ATGTAATAGA 1260 

AAAACAGCAC ACCCAAATAT ATGGCGTTGC GCTGTTTAAC CAAGCATACT TCTATAGCTT 1320 

TAATAAGCCA GCAGAAGCAT ACCTAACCTT CTTAAATATG CTTTTCCAAA TTATCCTCAA 1380 

GTTTGAATAC GATAATACGT TCACCTGTAA CTGTACTTAA ATCACTATGG AAGCTCATCA 144 0 

CTTTGATACC TGTAATTTTA AAAATGATAT CATTCAAATC TTGCTCACCG GATTCAACTA 15 CO 

SO 

ATTCAGAACG TGTTCGTTTA ATATTTAATA ATCCTTCATT CGTACTACAT ACACGATATT 1560 

CAGCTGGCGT TAAGATACCT TGTAAACTAA TAATCACCAT ATCTCTTAAA ATGTCTGATT 1620 

55 



30 



35 



BNSOOCIO: <£P 0786519A2J_> 



1232 



EP0 786 519 A2 



10 



IS 



20 



25 



30 



35 



40 



45 



SO 



TTTCAGCTTC GATTTCACCT TTCGTTCTTT TCATATCACT AACTCCAATA ATATTAAAAT 1740 

TGATTACTTC ATCTTTGTAT CGTTATCACG ACATCAACTA TTACATTAAG TTTATCATTT 1800 

TTAGTATATT TTAAGAAGCT AGAACATTGT AGATATGATG ATATATTAGT TACTTAGCAT i860 

CGCAACATAT CATCGTTAAA TCCAACriTT AAAACX5CCCT TCCTCATTAA CGCTCATTAA 1920 

ACGCAGCCAA TGATTAGACA CCTTCCTAGC GAAATGCTCA TTATTCGCGA GTAGTCTTGC 1980 

TACAACATAG TCX3GGTGCCT GAATAACGAC AAGTAAACGA ATTGGCGAAT GATACATCGT 2040 

CCX5ATCAGCA GCCATAACAG ATTGCCATGA TAAGCCATAC ATCAGATCAC TCGCATTACC 2100 

TTGCATGACA CCAACACCTG ACGTGACGGT TTGTGTCGCT TTATTCCCAC TTCCGTAAAA 2160 

ATGCGGCGCA ACTGTCGACG CATAATATTG TAAATTAATC CATTGTGCCA CAAGTGCCGG 2220 

ACCAGAAATG ATGGTATTTA ATAATGTGCC ATCTTTATCT TTACGCCAAT CATAATTGTG 2280 

TAAAAATGTC CGCCCTTCTA AATCAATGCC TTTTGTTAAT TGGCGTCGTC CAATTATAAA 234 0 

TGATGCATTT TTAGCCAATC CCCATTCTGG ACGTACCTCA CTCCAATCAC TCGCAAACCG 2400 

CTGCGCTTCT TCCACTGGAT GATTCACACG ACCAATCGTT GGCAGTTTGT CCAAACGTTC 2460 

GCGATTCGcg TGtTCAGAAA TCATCGGCAT CGCGTCATTC AATGATTCAT ATGCATCTAA ,2520 

AGCAATAGAA GATAATGTGT CTGGCACATA TACCCATGCC AACGTATCAG TAGACGTATG 2580 

ATGTTCTGCT ACCGCAAAAA CAGTTGTCTC TGGAATATAC ACACCTGATT GTTTTAATCC 264 0 

TTGTCTGACA TTTGGACGAT TACATATCAT CGCTAATAAC TTAGCATTAA AACCGCTTGA 2 700 

TGCGCCACCA CAAGCCCCAC ATTCAAGTGA TGCATGATGT GGATTATTGT GAGAATGACT 2760 

AGCATGACCT GCTAACACAA CGAACGGCGC AAATGCTTCX3 GTTAAATCCA TCAATTTCAA 2820 

CGCTTGTAAC GCXyUVATCAA TTTGCTCTTG CTCAGTAAAT CCAACAGGTA AGTCTGATGT 2880 

TCGGTCAAAC TCACGATCAA TCGTCAACTT TGTTTCAGGC TTTTTCAACC ACTTTTGTTT 294 0 

TATTTTTTGT AAAGACGCGC GACTTTTTCT AGGCATAATC GAATTGACAA TGGTACTTAA 3000 

GCTTAAAAAT GGCCCACTTA ATTCAGGCAA TAACAGACTA GGCATGACAT TATTTTTCAT 3060 

CAATTTAAAT GTGTAAAACA TCGATGACAT TGTCTGTTGC TGTTGTCGAT AAACATTCAT 312 0 

ATCXSTAGCGG TCTGCAAATT CTTTAATGCG ATATGCCGGC GGTACCATGA CAGGTAATGA 3180 

ATCATGTTTG AATTGTTCGT CTACGGCATC TTTTTGAATA GGTAATCCAA AGAAGCCTGC 3 24 0 

AATACCAATC GTTTCAAAGG GCCCTGCTGC TTCGATATGT CTACGAAATG GTTCTGAACG 3 3 00 

AACATCTATA CAAAATGCAA TTTGCGCTTT CGTTGATGTG CCCATCTGAT TTAGCTCGCT 3 3 60 

ATTATTTTCA TCAACTGCTT GTGTGTCATT TAACAATACT GAATGTGGCT GATTAGCGTT 34 20 
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TGCTTTAATT 


TTTTGTTTTA ACTGAGATTC 


GTATGTCATT 


TCCCAGGCAA 


TTAGCCATAA ~ 


3540 






ACATTTITAT 


TCATAGTTGC 


TGCAAAATGA 


ATAAACGTTT 


GAATTTCATT 


3600 


5 


GACGTCATGT 


TGTAGTAATA 


CATCGCTAGG 


CATATCACTG 


TAGTAACACC 


ATGATGCAAC 


3660 




AGTTTGCTTA 


AACCAATTTT 


CCGATCTACT 


TTCACAATCT 


TTAGCGACTG 


ACTTAAACTC 


3720 


10 


ATCACCAACT 


AGCAATTGTT 


CGACAACTAA 


CCGAATTGCC 


AAATAATCCG 


TTAACAAATG 


3780 


TTGTTCAAAG 


TGATGCTGTT 


GTGAACGGTA 


ATACAACATA 


CCTGCCCAAC 


CCGGTAACGC 


3840 




CAAAAGATGT 


CCTTCAACAT 


AAGCTTGGTA 


GTCTTCCTGA 


TCTATTGAAA 


AATGAGTTAA 


3900 


IS 


TACTGACTCT 


ATCGTCATTT 


CAGGATCATT 


GGGTAAGCCT 


TTAATCACTT 


GGCGCTGTGC 


3960 




TTTAGTAAAA 


CTATGGTCAT 


GTTGCX3CTAA 


ATGCAACCAT 


GCATGGTAAA 


AACTTTGcTC 


4020 




ACGCTTCGGC 


ATTGTCCAAC 


TCGATAGAAA 


TTGATCGATA 


TAAAGTTTCG 


TCCATTTAAT 


4080 


20 


CATTTGACGA 


TTCACTTGTT 


CGCTAAGTGG 


CTCACCTTGT 


TCATCTATTA 


TTGCATCACT 


4140 




CATCGGACGT 


ACATCATAGT 


GATGATATGA 


TTCAGCCATA 


TCACGTTTTG 


ATTTTTCTAA 


4200 




TAGTAGATCA 


GCAACAACAT 


CAACATTTGA 


ATGATTCATA 


TATGATGCAG 


GTACGTCTTT 


4260 


TAATGTTTTA 


ATGTTATCAA 


TATAAAGATT 


GATGTAGTGT 


TGCGGGATAT 


TGTAGTGATG 


4320 




TTCAAGTAAC 


ATATCAGTAA 


CAAGTTGATT 


AAAGACACTT 


TCATCTAATT 


CACCACGTGC 


4380 


30 


CACAGCGCTT 


TCTATTAATG 


CTTTATTTGG 


GAAAATATCC 


ACATCTCGAA 


CATCACGTAA 


4440 




CCATTTTGCG 


ACATCTTCAA 


ACGTATCCGC 


TTCTAATCCT 


TCCCATGGAT 


TTCGTGCTGC 


4500 




AAAAATCGAA 


ATTGGTGATA 


ATGGTGTAAT 


AACACGTTTC 


GCATTTTCAA 


TGACTGAATT 


4560 


3S 


GATATTTAAC 


TGTGTTGTCA 


TACCTTTCAC 


CTCCTATAAA 


TACTTCTTCA 


AATAATTCGG 


4620 




ATGACTTTCT 


ATCGCTTTCG 


AGCGTGCTTC 


ACCTAGATTA 


ACTAACCACA 


CGTACAATAC 


4680 




CGCAAAAGCC 


TTAGAGTATC 


GATGCCGCGC 


CACCCAAATA 


CTTAATAAAC 


TGCCAAAGAT 


4740 


40 


TAAAATAACA 


ACACTAATGA 


TGACACTCAC 


TGTAGGCX3GC 


GTTGTCGCAT 


GTGTTGpTTAT 


4800 




ATTTTGTAAT 


ACAGCGTAAA 


AATAATTATG 


TGTGATGACG 


TAGATAAATG 


TCACGATTGC 


4860 


45 


AATCAAAATC 


ATACCAACAA 


GACGTGCCAT 


GCGTCCTTTA 


CTAAAGGCTA 


CCATTTGATT 


4920 




CCAAGATACA 


AGTAATGACC 


ATCCTAGAAT 


GAGTGCACTT 


AACACTTCAT 


ATGCACTTCT 


4980 




GTCACTACTC 


ATCCAAAATA 


GAAATGCCAC 


GATAATAGCT 


AATACACGTC 


CCATGACAAT 


5040 


SO 




GCGTCTTTAG 


CAGATGCTTG 


ill IvjC^AATA 


TTGAATCGCT 


TCACGATAGA 


5100 




ACCTGATTGT 


AAAAATAATG 


TTGCTTTAAA 


AATACCGTGC 


AATATTAAAT 


GAATAATCGC 


5160 


55 


TGCTGAATAT 


ACACCCAATG 


CACATTGAAC 


TAACATAAAG 


CCCATTTGAC 


TCATCGTAGA 


5220 
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AGAAATACTA GAAAGGATAA GTAATAATGA TAACGCAAAT CCATTATCAA ATATCGGCGC ~ 5340 

AAAACGAGTT AGAATAACAC CACCTGCATT CACAATTCCT GCATGCATAA TTGCCGATAC 5400 

TGGCGTTGGT GCCGTTACAG ATTCAATCAA CCATCGATGA AAAGGAAATT GTGCTGCCGG 5460 

TATCATGACA GCTAATACAA GTAGTACATT CGTCAACAAT GACCATGTCG GATGAACTAT 5520 

ATGTTGTGG7 ACCCGCCACT CGCCAGTCGC AATATAAATA GTTACAATTG CTCCAACGAA 5580 

TGCAAGCCAA CCACATAAAA ATGTCATGCT TGATAATTTC GCAGACTCAC GTGGCACTTT 5640 

CCAAAAACGA TTAACGTTCA TCAGCAATGT TAAACATAAT AATGTAATAC CCCAGCAGAG 57 00 

TGCCATCAGT CTTAAGTCTT CAGACATCCA TGCTAAAGAT GCAAACGACX5 TAATCGCAGT 5760 

GAACAATGGA AAGTAATGTC TATAATGATG ATCACCTAGT AAATATCGCA TTGAAAACTT 5820 

TTGAATAATA AAGCCAAGCG CCATTACAAA GCCAGCTAAT AACCAAGATA AACGATCTAT 5880 

20 TTTAAATGGA CCTAAGACAT GTTGACCATG AATACCGAAA AAGCCAATGA CTGCAAATAA 5940 

TACTGGCATG ACTAGTATGT ATAAATGTAA TTTAATATAT CTCATTGGCA TAACTGGTGC 6 00 0 

TAAAAACAAC AAGCCACTTA TCAATGCAAT GATAAGCGCA ATAACAAACA GTGAAAATAG 6060 

CAATTGAAAA CTTAACACTG CATAACCTCC TTATTTCTAA TCTCTCGCAT AATTGCTTAT 6120 

GTATAAAAAT AAAAACCTAC AATAGTAGAT TCTGTACATA ATGGCAGAAA ATTTACTATT 6180 

GCAGGTTTCA GTTTAACTAG ACACTGCATC ACGGTACGTT GATATACCTT GTTGCAGTGT 6240 

TCTCTTTAAG CGTGCTCCCA TGCACATATG TATATAAAAT GTTACTTCTG TCTGTTCAAT 63 00 

TCATCTTCAT AAATATGCTT TGCCTAGACG AGACCTAACG TGTTATTCGT TTTAAACTTA 63 60 

35 TAACATAAAfV TATAATTAAA TTTCTGCTTC ATGTCAAATT CATGAGCTTA ACCTCTATTA 6420 

AACCAATGAT TGTAAAGATT TTGTAAATGC ACCTGTACAG TTAGGCAGTA TTTCCCGTCC 6480 

TTTTAAAATA AAAAATTCGC AGTTATGATC ATAACAATTC AAGTTAGGAA AAAAATCAAT 654 0 

TACGCACAAG ATAACTATGT ACAATGAAGT TAACTCATAA GCAAAGGAGG TAATCTTAAT 6600 

GGGTATCATC GCTGGCATCA TTAAAGTTAT CAAAAGCTTA ATCGAACAAT TCACTGGTAA 6660 

ATAAGATTTC ATAACAAACA AAGGAGGTCT TTCACATGGG TATCATTGCA GGAATCATTA 6720 

AATTCATTAA AGGATTAATT GAGAAATTCA CTGGTAAGTA AGTTATAAAA ATCTCATAGA 6780 

TATGAACATC TTATTTGAAG GGGGCCATTC ACATGGAATT CGTAGCAAAA TTATTCAAAT 684 0 

TCTTTAAAGA TTTACTTGGT AAATTTTTAG GTAACAACTA ATCTCAAACA TTAACGATCA 6900 

ACAACTCATC ACTATGTTAA ATCAACATAC AGGAGGACAA AACGATGGCT ATTGTAGGTA 696 0 

CTATCATTAA AATCATCAAA GCAATTATCG ACATTTTCGC AAAATAATTT AAGCGAATTG 7020 
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TTATTGATGT 


GAGGTGAGTC 


TTGTTAGTTT 


GTTGCAAATA 




GTGTTTTTTG 


7140 




TATAGGACGT 


TCTTAGTGGG 


ACATACGGAA 


TATTCGTGAT 


v.- X X X X AVJ X ^ 




7200 


5 


TATTTTTGTG 


GCGTGTTTTA 


TGTTTGATAC 


TCGAGTTCTG 


AGAC*ATTPAT 


VjAX X X\9VjV..AX 


7260 




GCGAAATCTT 


AATGATTTTC 


ATGATCTAGC 


GCAAGATATA 


X X VJ\J\r V.,^WvW X 


r: Prtrt A A TTf^ r* 

wU-oVjAAl XoL. 


7320 


10 


GTTGCACGTT 


TAGACTGAAA 


CACTCGTGTG 


ACCGTAAGTG 


TTAAT AnT A P 

X X X XAU X W 


AX X\7AXAV7\..X 


7380 




GCATTTACTT 


CACTCATTTT 


TATGACTGTT 


AAACAATGAT 




AX XAACJaCjX iT 


744 0 




GGTACGATGG 


TTTTGCCATT 


TTTCATCAAC 


GTAAATATAA 


AAA^VlAPnrzv & 


LtACACZA X ACA 


7500 


15 


TGTCCTAGCC 


CTATGGATAA 


AATGCAAATT 


TCTGCTTTTAT 




AuAwx^ X AvvA 


7560 




TAGATTGAAA 


ACAAAAAGAT 


CCTAAGAACA 


CCTTAA 




X t_ATAAATTG 


7620 




CAAACAATTA 


AGCCACAATT 


CAAAAATGAT 


X X /%\_ X X 


TT* a A PT*T R T O 


GTGCTGGTCT 


7680 


20 


AATTTGCCAT 


TGATATGGAT 


CTTCAAATTG 


TTGCCAATCTT 

X A \J^^i^^f\X W X 


wv.^x\*>^AX X 1 


CTTGCG CATT 


7740 




GACTAAGCAT 


GCGTCGAGTT 


CTTTTGTTAA 


A X X X X V~- X X 


i. ^ i. jHAX X Vi- lo 


TACCaATAAT 


7800 


25 


GACAAATTGT 


GTATGACGAT 


CGCCATATTC 




T p 21 n r*TYi rr* a 


LaTvJTTGACG 


7860 




TTCTGCTAAT 


ATTTGTGTTT 


GTTGCGCTTC 


AGACATAC*TA 




ATGTAACTGG 


7920 




ATGAATATTG 


CAAGATGACC 


CTGCTTGAGA 


TAATAAAr'AT 




X UTaTTGTGC 


7980 


30 


TAGCCATACG 


ATACCTITTG 


ATCGAACGAC 


aVX X'^X X X VJ^JW 


A± V9^h. i. 1. X L. X>i 


Al-i_AA X WAX X 


804 0 




GAACCrriTA 


GCATGGAAAG 


GTAGACGACG 


TTTATATACA 




XACvJAXAX X V 


8100 




TTCTGTTTCA 


GGTGTATGCG 


ATGCATGCCC 


ACCAGACTCA 


rw3 X X W X X X \JU\ 


X wWAX X\3W 


8160 


35 


TGACTCGCTC 


GCTTTITCAA 


AATCAAAACG 


CTGCGTATTC 


A Af* A PTTPT^T 

'V^VJ^l^^ X X X X 


TT A A ATr^A r» 
i lAAAXWXAC 


O ^ ^ 

o22 0 




TTCAGAATTT 


GTTGTCTTAA 


TAATTTTAGC 


AGTCGGTTGC 


AATGPnPTTA 


AV-A X 4 X X X X 


o o a r\ 


40 


TAACTTCGCT 


AGTTCTTCTT 


CACTAATTAA ATCAATTTTA 


TTAATAATCA 

X XXV\X/V\XVvA 


A T A A IT' a A 

AX A^—AX UAUA 




AAATTCAACT 


TGGTCAATTA 


ATAAATCAGC 


AATCX3AACGC 


X \_nx ^ X o X X X 


X ^.lAAV. wW X 


^ A f\ f\ 




TTGATCACGA 


TCCATCAATA 


AATCTTCTGA 


GTTGATGTCA 


TGTACX3AAGC 


GGTTAGCATC 


Q A c r\ 


45 


CACAACTGTA 


ACXATTGTAT 


CTAAACGGCA 


AATCGCTGTA 


AGATCAATGC 


CAAGTTCATC 


Q C 




ATCAATATAT 


GAGAAAGTTT 


GTGCAACAGG 


TACTGGCTCT 


GAAATCCCTG 


TTGACTCAAT 


o c o 






1 CGATGCCAC 


CTTTTTTCAC 


TAAACGCTCA 


ACTTCTTTTA ATAAATCGTC 


8640 


SO 


TCTAAGTGTA 


CAACAGATAC 


AACCATTAGA 


AAGTTCGACT 


AATTTTTCAT 


CTGTACGCGA 


8700 




TAGTCCCCCA 


CCATCTGCGA 


CAAGATCTTT 


ATCGATATTT 


ACTTCACTCA 


TATCATTTAC 


8760 




AATTACCGCX3 


ATACGTCGAC 


CTTCTCGATT 


TTGTAAAATA TGATTTAACA ACGTTGTCTT 


8820 
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ACTTCAATTT ATTTGTAAAT AGGAATAATT CTGTTTTACA TTATATAGGA GCGTTTCCTC 8940 

TTTCGCAATC TTCGATAATA AAAAAATAGT ATACTTAATT AAATTATTGA GCGCTTTACT 9000 

TTATAATGGA GACAAAGATA TATCTCACGA AAGAGAATCG AGGTGTATAA ACATGTTATT 9060 

TGTCATTTTA GTTTTATATG TTACTGGTAT TGCATTTATT CTACTCAGTG TTTTTGGTTC 9120 

AAAGACTGAA GGATTATCTA CGAAACATAC TTTATATACC ATTGGCAGTG CTATTATAAC 9180 

GATTGCTATT TTCATTTCAA TTGGCTATGC CATTCAATAC TTAACTGCAG CGCTTTATGG 9240 

TTTGTAAGGT GAAGGTGATG AGTAACGGGT AGTTCGGGAG AGGTrAACTT GCGTTGATTT 9300 

15 TGATAAAGTG ATCATAGCTT TTAGTACTTG AGGATTTTTA TTGTTGCTGT TACGAATGTG 9360 

GTCATGTTTA ATGCGGGACA GTAATTTAAG TTGTTTTTTT ACAATTGAGA GTGTGATATT 9420 

TCGATTCGGT TCGAATTACT TTACATGGGA ATAATATAAA TTAAAAAGAA GCGGCCTAGT 94 80 

GTCAGTTGTG AATATACTGA ACATTGGTCG CTTTATTTAG TAGTATGATA TGTAGTTTAG 9540 

CTATTAATTT TTTTCAGGTC ATCCTTAATG CTGTCTATCT CAGACATGGC ACTTTTAACC • 96 00 

CAATCTCCTT GAGCTGCACC TTTAAAATTA GCTTTAAAAg CtTCGCAATG TTGCGCCATT 9660 

TGTTCAATTA ATACTTTTTC TTCACCTTTT AATCCGTTTT CAATATCTTT GTATTTATGC 9720 

TTATGTTCAG GTGCAATAAC TGTGCGAATA TTTTCTTTTT GCGCTTCCAT TTTAGATATG 97 80 

AGATTAAGTG TTTCTACTGT AGTACTTATA TCTGGCATTC TTAAGGTCAT ATCTGGTTCT 984 0 

ATTAGAGTCA TTTAATCTCC TCCAAATTAT CAGTCACTTA GCTTATCTAA CTGCTTTTCA ^ 9900 

TAAGACTTTT TTAAGTCTTC TTTATATTCT TCTAATTTCC CATTCTTGCT TTCTGA 9956 
3S (2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2411 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

45 

TTTCTTTT A A CAGaTTTATC cCATTAATTG TTCTACAAGC CCACGaTGAG CAATATCATT 60 

TTTAGCAACC ATTAATAAAC CAGAAGTATC CATATCTATA CGGTGAACAA TACCTGGACG 120 

50 AATTTCTCCA TTAATACCTG ACAAATTTTT AATTTGATAC ATTAAAGCAT TAACTAATGT 180 

ATTGGTATAA TGCCCTGGTG ATGGATGAAC TACCATGCCT TTCGGlTfAT ATACAACTGC 24 0 

AACATCGTCA TCTTCATAAT AAATATCTAA ATTTAAATTT TCAGGTAGAA TATCAGCTTC 300 
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AACAACTTTA TCGTTTGCAA 


CGACTAAACC 


TGCTTTAATC CAATPTTfSTii TrTrv^T^^nn 


420 




AGACCAATCA TTATTTAATT 


CAGGCAGCAA 


CTTATCTACA CGCATACCTG TTTfSTT< i 


480 


5 


ATCTGTAATG 


TTAAATTCAT 


AAGTCTCCAT 


TACTTAACCT CCTTCTCCTT TTTJlTTfV3JVA 


540 




GTATCCTTTA 


ATAAGGCAAT 


AATAATTAAT 


ATTACACCAA TTGTTAAACT* TnAJ^TmYzrr 


600 


10 


ATATTAAATA 


TTGGAAAATC 


ATAACCAAAA 


ATATTTGTAT CAATAAAfiTr* ]iAr*AL.rn*rr^ 


660 




CCTGTTAAAA 


TTCTATCAAT 


AAAGTTTCCA 


AGTGCACCTG CAAAAAGTAA apTaaTarr-a 


720 




ACTTGCATAA ACAAATTATA 


TTGAGCATCT 


TTAATAAAGA AATATACTHa nr'r^&TTna'r 


780 


15 


ATAATAATGG 


TAATAATAAA 


GAAAAATGTC 


ATTi^l'TCCAC TC!AATATTPP rr'a*TY^*^R^r«» 


840 




CCATTATTTC 


GATGTGaTGT 


TATGTTTAAA 


AAGTGCGGTA TPAPTTPaaa. Tv^aivpr^/^/^A 


900 




ATTTTCATTG 


TAGTAGCTAT 


AATATATTTA 


GTAACTTGGT CAAATATaAP nar-aan'rar-r 


960 


20 


GCTATTAAAA 


TGGAAGTGCC 


AATAAAATAT 


TTTTTGTGCA TT*!"!* CGTrrr* Trr" a a t a r *t» 
******** v*^ii.rt X X X X \v>j X X WW X ^vmAa X y f%f\ 1 


1020 




CGTCCATGAG 


ACAACTCTTT 


ATATTATAGC 


TT ACACCTGC T AAT AAA a a a a riT a a a t r 


1080 


25 


TTACATTAAA 


TCTAATGTTA 


CTAACTCAAT 


ACTTGATAAA PTAPTATVJTT TTr^ana'TTtKiv 

* * * xw\ \„ X X n. X v> X X X x ^xnLaiA X. X AA 


1140 


ATATGAACTT 


AATTATTCAT 


TTATCATATT 


TAAGATGA("!A T*TAAaaA'T*Ta rr-a a iior^is./-5^ 

X «#»^«f\ X XnnAMnX XA ijuAAAuClAuG 


1200 




CTGGAACATA 


AATCCCTAAA 


AAGACAGTAG 


TAAGATATTT TCTAATTAAA A AT*raTr*T*r a 

* * *^ X X X xVi^xnnx x^W\ AnX XAXwX Lfx 


1260 


30 


CTGCTGTTCT 


CTATTTATAC 


AATACTTCGT 


ATTGAATGGC TTCGCTATftP r*paTr*TY:!r*r*a 


1320 




CATTACTGTA AAATTCTATA AATAGAATTT 


TTGATGATGG GTCPCTTPrT AfifsriTPPPP'r 


1380 




CTCAGCCTCG 


GtCTTCGACT 


GGCACTGCTC 


CCTCAGGAGT CTCGCCATTA ATAPT'APGTa 

■Si- w * w«-»ww«-i>w X wxwwwwnXX^ '^Xnw X^\ww Xx^ 


1440 


35 


TTAACATGTA 


ATTTTACTTT 


TAAATACTTT 


AAAAAAATAA GACATGAATC GTPTAPAPTT 


1500 




AATTGGACAA 


ATTCTATGAG 


AATAGATATT 


GTTAATTTAA GAAAGTAGGf* TATTTTfSAi^T 


1560 


40 


TtCACTCGAA 


TGTCAGTTCG 


AGGAATAAAT 


AAAGTTAAAC GAGAGCTAGG TTTrGTATTa 


1620 


ATGGCAATTA ATATAAGGAA AATAGCAGCT 


CAACGAGCTG TACATTATAA- AATAPATATP 


1 £ o n 




AAAAAAGCTG 


ATTTCTATCA 


AATAATTAAT 


AGAAATCAGC TTTTTTACAT TGPPTAAGAA 


1 A A 


45 


CTTAATGTCC 


CAAGCCCTAA 


AACTTGTTGT 


TATTTATTTG ATTTAGCAGC GATAPCSTTH ' a 


1 Q r\ A 




TATCTTAAGT ACATAAATGC TAAAAGTATA 


AACCAAATCG GAATAAAATA AATTGr* A ppt 






CTTGTATCAA 


CATTAATAAA 


TAATAACCCG 


AACACAAAAA TGAAGAATAC AAATATTAAG 


1920 


SO 


TAGCCCATAT 


ATTTGCCACC 


TAATAGTTTG 


TACGTAGCAT TTTTATGTAG ATCTGGGTTT 


1980 




TTACGACTAT 


AATTGATATA 


TGCAATGATA 


ATCAGACCCC ATACAACTAA AAATAACACT 


2040 




GTAGAGATGG 


TAGTCACATA 


CX5TAAATACT 


TTTGTCGCAT CTGGGAAAAT ATAGTTTAGT 


2100 
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TTATTCGTCT TAGAAAAGTT CGGAGGTGCT TGTyGTTGAC TTGATAAACC GaAAAGCATA 2220 

CGGCTATTTG AGAATATACC ACTGTTACAT GATGAAGCAG CAGCGGTTAA TACTACAAAA 2280 

TTAATCAAGC CCGCAGCAAA CGGAATTCCG ATCAATGCGA ATrATTTTnC GAATGGACTG 2340 

TTATCAGGAT CAACTTGCTG CCAAGGGGTA ATAGACATGA TAACCGCTAA CGCCCCAACG 24 00 

TTnnATATTA A 2411 
(2) INFORMATION FOR SEQ ID NO: 354: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



30 



20 <xi) SEQUENCE DESCRIPTION; SEQ ID NO: 3 54: 

GnGAATTATT TTTAATAATG AAAGGATTAC TTnCATGGGT TTTTACTAGG AnTACCCAGA 6 0 

AGGTCAAAAT ATTTTTGATG CGCTAAGTCA ATATGAAGTT AAGCGACGCG GCGATATGGA 120 

AGAGGATCCA TCATATAAAC AACTCATTTC TTATTGTTTA CTTGAAAATG AGCATGGCGA 180 

GATATTAGTG TATGAACGAT TATCTGGCGG TGGAGAAGCT CGATTGCATG GACAATCTTC 24 0 

AATAGGTGTA GGCGGTCATA TGAATGATGT TCCAGGAGCA GAATCTATTA ACGAAGTATT 300 

GAGAGTTAAT GCACAGAGAG AATTAGAAGA AGAAGTAGGT TTAAGTGAGC AAGATTCACA 36 0 

AAATATGGAA TATATCGGTT TTATTAATGA CGATAATAAT GAAGTGGGCA AGGTACATAT 420 

TGGTGTTGTA TTTAAAATCA CTGTAAGTAC GAATGATGTA GAAGCTAAAG AAACAGATAC 4 80 

TTTACGAATA AAATGGGTTG AAAAAGGCAA CATAGAGTCA TATGATGATT TCGAAACGTG 54 0 

GAGTGCATTA ATCCTTCAAG ATTTATAATC AAACGAGGTG ACATATATGT CAGATATTAT 600 

40 TCCAG 605 
(2) INFORMATION FOR SEQ ID NO: 355: 



35 



45 



SO 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 
TTTATTAGCA CATCCAAACT ATTCATATGT TGGACAATTT TTAAACGAAC TAGGATTTAA 60 
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ACAATTAGAC ACTGAACATT TAGCTGATTT AAATCCAGAG CGTATGATCA TTATGACAGA ' 190 

TCATGCTAAA AAAGATTCTG CTGAATTCAA GAAGTTACAA GAAGATGCAA CATGGAAAAA 24 0 

GTTGAATGCA GTTAAAAATA ATCGCGTGGA TATTGTTGAC CGTGATGTTT GGGCAAGATC 300 

TCGTGGCTTA ATTTCTTCTG AAGAAATGGC TAAAGAACTT GTTGAATTAT CAAAAAAAGA 36 0 

ACAAAAGTAA GGTGGAAGTA AATGGCTATA AAAGAAATAA GTAGCCAATC TGCCATAGAT 42 0 

CATAAAAGAA AAAGACGCAC AACACTCACG TATATAGTGA GTTTGTGCTT TCTTTTTATT 4 80 

TGTATATATT TAAATATGGC GATTGGTTCT TCGAAAATTA ATTTTAGCGA TATCATTCAC 54 0 

^5 TATGTTACTG GTCATACAGA TACGAAAGCA ACGTTTTTAT TGCATAATGT ACGTATGCCA 6 00 

AGGATGATTG CAGGGTTATT TATTGGCGGT GCATTAGCGG TATCTGGTTT GTTAATGCAA 660 
GCAATGAC 

20 (2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 737 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

ATACAAAAAA ACATATCGAA AATAAAGCTA AAAGAAACTA TCAAGTTCCA TATTCAATTA 60 

ATTTAAATGG TACATCTACA AACATTTTAT CGAATCTTTC ATTTTCAAAT AAACCTTGGA 120 

35 CAAATTACAA AAATTTAACT AGTCAAATAA AATCAGTACT GAAGCATGAT AGAGGTATTA 180 

GTGAACAAGA TTTAAAATAT GCTAAGAAAG CTTATTATAC TGTTTATTTT AAAAATGGTG 240 

GTAAAAGAAT CTTACAGTTG AATTCAAAAA ATTACACAGC AAACTTAGTT CATGCGAAAG 300 

ATGTTAAGAG AATTGAAATT ACTGTTAAAA CAGGAACTAA AGCGAAAGCA GACAGATATG 360 

TACCATACfAC AATTGCAGTA AATGGCACAT CAACACCAAT TTTATCAAAA CTTAAAATTT 420 

CGAATAAACA ATTAATTAGT TACAAATATT TAAATGACAA AGTGAAATCT GTATTAAAAA 480 

GTGAAAGAGG CATCAGTGAT CTTGACTTAA AATTTGCGAA ACAAGCAAAA TATACAGTAT 540 

ATTTCAAAAA TGGAAAGAAA CAAGTAGTGA ATTTAAAATC AGACATCTTT ACACCTAATT 600 

TATTTAGTGC CAAAGATATT AAAAAGATTG ATATTGATGT AAAACAATAC ACTAAATCAA 660 

AAAAAAATAA ATAAATCTAA TAATGTGAAA TTCCCAGTAA CAATAAATAA ATTTGAAAAC 720 

ATAGTTTCAA ATGAATTTGT GTTCTATAAT GCAAGCAAAA TTACmATTAA TGaTTTAAGT 780 
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(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 4 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

70 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 57: 
AAAGTAAAAA TAAATCTCCC TTTTTAACTT TCGTTTCTGC CATAGCCATT GCTTCTTCTG 60 
TGATAGTTGC TACAATATCT TTTCTTTCAC GGTTAAAATG TTCAACTTGT TCTGCTAAAA 120 

IS 

ATGCAGCTTC TTCTTCGACG TCAGTCATCA ACAATTCGCa AGCTAATGAT GCGTCATCTA 180 
AACGACCTAC AGCATTAAGT CTAGGTCCAA TAATAAAACC AATTGTTTCT TCATCAATAT 24 0 

20 TGTCATTGTA TCCCGCTTCT TTTAGCAATG CTTTAACAGA GGTCGGACAT TGATCATTTA 300 

AGACTTTTAA TCCTTGTtTC ACTAATGATC G A TTTTCATC AGTTAAGGAT ACTAAATCCG 360 
CAATGGTACC TATCGCAACT AATGCTTTAA AATAATCAGG TACATTTtCA ATCAATGCTT 420 
GTGCTAATTT GTATGCAACA CCTOCACCAC ACAATTGTTG GAACGGATAA TTAAACGATG 4 80 

GATGCATTGG ATGTACGATT GCATATGCTT CTGOTAATGT ACTACCAATT TCAT 534 
(2) INFORMATION FOR SEQ ID NO: 358: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

40 GGTGAGTCAA ATTAAATGAA TCTAATAAGT CATAACTATC TATTTGTAAT GTGCAACGCT ' 60 

TAACGCATAT ACAAAATGAA TGTGCTGATA ATGATTTACT CAAATTAAAA GGTGATTTTT 120 

ATTCAATGAT GAATGAAAGT TGCCTTTTTA TTTTTGGTAA AAGTTAATGC GTCAGTGAAT 180 

TGTGTAAGTT TTTCAAAAAG TAAAAAGAAA TAATAAAGGT GAATTATTAG AATTCCArAA 24 0 

ATAATTCATT ACATTCATAA AGCATTTTAC AAATGGTAAG AAAATGAGTG TTACAAATCT 300 

AAATATTGCA AAAGAAGCTG ATTTAGTCAC AAAAAATGTC CTATGTAATA ATTCGAGAAA 360 

GATGCACTAT ATACGGTCTT CTTACTATTC AAATGTAAAA GTTGCTTATT TGCGTGGCTT 4 20 

TTTGTTTTAT AAAAGTATAA AATTTTACTA TAATATATCT TGTAGAGAAC AATGAAATGA 4 80 
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ATAGCAAACT GTATTACTTT GATACAAAAA TGGTTGTAAT AAATATTTAT CGATATGACG 60 0 

ACTTGAATAT GATAAAGTGA CATATTTATG TATATGACTA TTTCGCAAAA TGTAATCGAG 66 0 

GTAGAArrrC TTGACAATTC TGTCAGTTTA TAAGATGTTA TAAATATGTA GTGTATAAGG 720 

AGGCAAACAA GATGACTGAA GAATTCAATG AATCAATGAT TAACGATATT AAAGAAGGTG 780 

ACAAAGTCAC TGGCGAGGTA CAACAAGTTG AAGACAAGCA AGTTGTTGTT CATATCAACG 84 0 

GTGGTAAATT TAATGGGATT ATTCCTATTA GTCAACTATC TACGCATCAT ATTGATAGCC 900 

CAAGTGAAGT TGTAAAAGAG GGCGACGAAG TTGAAGCATA TGTCACTAAA GTTGAGTTTG 96 0 

IS ATGAAGAAAA TGAAACTGGA GCTTACATCT TATCTAGAAG ACAACTTGAA ACTGAGAAGT 102 0 

CTTATAGTTA TTTACAAGAA AAATTAGATA ATAATGAAAT CATCGAAGCG AAAGTAACAG 1080 

AAGTAGTTAA AGGTGGTTTG GTTGTTGATG TAGGACAAAG AGGTTTTGTT CCGGCTTCAC 114 0 

TAATTTCAAC AGACTTCATT GAGGATTTCT CTGTGTTTGA TGGACAAACA ATTCGTATTA 1200 

AAGTTGAAGA ATTGGATCCT GAAAATAATA GAGTCATTTT AAGCCGTAAA GCAGTTGAAC 1260 

AAGAAGAAAA CGATGCTAAA AAAGATCAAT TATTACAATC TTTAAATGAA GGCGATGTTA 132 0 

TTGATGGTAA AGTAGCGCGT TTAACTCAAT TTGGTGCATT TATAGACATT GGCGGTGTTG 1380 

ATGGTTTAGT GCATGTATCT GAACTTTCTC ACGAACATGT TCAAACACCA GAAGAAGTAG 144 0 

TTTCAATTGG TCAAGATGTT AAAGTTAAAA TTAAATCTAT TGATAGAGAT ACAGAACGTA 1500 

TTTCATTATC AATCAAAGAT ACGTTACCAA CACCTTTCGA AAATATTAAA GGTCAATTCC 1560 

ACGAAAATGA TGTCATTGAA GGTGTCGTAG TAAGATTGGC AAACTTTGGT GCATTTGTTG 162 0 

35 AAATTGCACC AGGTGTACAA GGACTTGTAC ATATTTCTGA AATTGCACAC AAACACATTG 1680 

GTACGCCAGG TGAAGTGTTA GAACCTGGTC AACAAGTAAA TC5TTAAAATA TTAGGTATTG 1740 

ATGAAGAGAA TGAAAGAGTA TCACTATCTA TTAAAGCAAC ATTACCAAAC GAAGATGTTG 1800 

TTGAAAGTGA TCCTTCTACG ACTAAGGCGT ACTTAGAAAA CGAAQAAGAA GATAATCCAA 186 0 

CAATTGGCGA TATGATTGGT GATAAACTTA AAAATCTTAA ACTATAATTT AATATTTAAT 192 0 

AGTCAACTCC ACA-rGXTTAT GATTGcATGT GGAGTATTTT TATGTAACAA AATATACTCG 1980 

GAATGATAAC GTGGgACAAA TTTAACTAAG TGTTTAAAAA GATArAGTTT TAAGTGCtGa 204 0 

tTTTTATCAT TACAGTAATA AACTCATTTT GAATACACAG TCTCATGTGA TATTATTAAA 2100 

AAGATATaAG AAAGAGAGGA AGTTAGCTTA TGACTAAACC TATAGTAGCT ATTGTAGGTA 2160 

GGCCTAATGT AGGTAAATCT ACAATTTTTA ATAGAATAGT TGGAGAACGT GTTTCGATTG 2220 

TGGAAGACAC GCCAGGTGTA ACACGAGATC GTATTTATTC TTCAGGTGAA TGGTTAACAC 22 8 0 
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AAATTAGAGC GCAGGCAGAA ATCGCCATAG ATGAAGCGGA TGTTATTATT TTTATGGTTA ' 2400 

ACGTGCGTGA AGGATTGACA CAAAGCGATG AAATGGTCGC TCAAATTTTA TACAAATCTA 2460 

AAAAACCGGT CGTATTAGCG GTTAACAAAG TAGATAATAT GGAAATGCGT ACAGACGTGT 2520 

ATGATTTCTA TTCATTAGGA TTTGGTGAAC CGTATCCGAT ATCAGGGTCA CATGGTTTAG 2580 

GTCTTGGTGA CTTGTTAGAT GCAGTTGTTT CTCATTTTGG TGAAGAGGAA GAAGATCCTT 264 0 

ATGATGAAGA TACAATTCGA CTATCCATTA TTGGACGACC AAACGTAGGT AAATCAAGTT 2700 

TAGTAAATGC TATTTTAGGT GAAGATCGCG TTATCGTTTC TAATGTTGCA GGGACAACGA 2760 

GAGACGCTAT TGATACAGAG TATAGTTATG ATGGACAAGA TTATGTTTTA ATCGATACTG 2820 

CTGGTATGCG TAAAAAAGGA AAAGTATATG AATCAACTGA GAAATATTCA GTATTAAGAG 2880 

CTTTAAAAGC GATTGAACGT TCAAATGTTG TTTTAGTGGT CATAGATGCA GAACAAGGCA 2940 

20 TCATTGAACPl AGATAAACGT GTTGCAGGAT ATGCACATGA ACAAGGTAAA GCAGTCGTGA 3 000 

TTGTCGTAAA TAAATGGGAT ACTGTGGT^AA AAGATAGTAA AACGATGAAG AAATTTGAAG 3060 

ATGAAGTACG TAAAGAATTC CaATTTTTAG ATTATGCACA AATTGCTTTT GTGTCTGCTA 3120 

AAGAACGCAC AAGATTACGT ACATTATTCC CTTACATCAA TGAAGCAAGT GAAAACCATA 3180 

AAAAACGTGT TCAAAGTTCA ACTTTAAATG AAGTTGTTAC TGATGCAATT TCCATGAACC 3240 

CTACACCAAC AGACAAAGGT AGACGTTTGA ATGTCTTTTA TGCAACACAA GTTGCTATAG 3300 

AACCACCGAC ATTTGTTGTA TTTGTTAATG ATGTAGAATT AATGcATTTT TCTTATAAAC 3360 

GCTATTTAGA GAATCAAATC CGTGCCGCTT TTGGTTTTGA AGkTACACCA ATTCATATTA 3420 

TAGCTCX3AAA GAGAAATTAA CGATTGGGGG ATAACAATGA CTAAAATTAC CGTTTTTGGT 34 80 

ATCQGAAGTT TTGGGACAGC CCTTGCCAAT GTTCTTGCAG AAAATGGACA TGATGTTTTG 3540 

ATGTGGGGTA AAAATCAAGA TGCTGTTGAT GAATTAAATA CATGTCATAC AAATAAAAAG 3600 

TATTTAAAAT ACGCOAAATT A ^^^l 
(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

< C) STRANDEDNESS : doubl e 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 
CTTTCGGAAA TTAGGATCnG nGCTATCTTG GCCCCAATTA CCAAGGGAAC TAnTGGCACC 6 0 
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AATGCtTCTT TCATCTCCAT GCCCTGTTGC TCATTATTAA TAACACGGTC TATTAACACA 
ATGGCATTTG TTACTACX5AT TCCAATTAAC ATTAGCATAC CAATTAAACT TGGTACTGAT 
ATTGTTTCTC CTGTGATTAA TAGTGCAATA ATTACACCGA TAACTGTAAA TGGTAAAGAG 
AATAAAATTG TAAATGGTGC TAGGCCACCT TTAAATGTAA TAACTAGGAT TAAATATACG 
ATAATGATTG CAGCTAACAT TGCAAAGGCT AATTGTGTCA TTGCATTGTT AATATCATCT 
GATGCACCAC CGATATTAAC CTTTACATTA TTCGGTTTAT CCAAATTATT TATTTTAGAC 
ATCACTTGTC GTGTTGTGCC ACCCACATCT TTATTTGTTA CTTTAGCAGA TACCGTCGTT 54 0 

^5 GCATAATCTC CTTGTTCITG CGTCAATTTA CTTGGTGTCG TTGTTTTAAC TAACGTAGCG 600 

ATATCTCCCA ATTTAATCGT ACCACCAGTC GGCTTTTTCA AAG 643 
(2) INFORMATION FOR SEQ ID NO: 360: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

TTTCAGGCyG TGCAAGGGCT TTTTCTTTTG CTTTTAAGAT TATGATTTAT CGTGCAAAGT 6 0 

30 

TAAGTGGTCG TATATAGTTT TAGTTTTAAA AAGGTAATTA AATAAAATAG TTTGCCGAGG 120 

GAGATGTCAA PiATGATTPAA ATACCTAGAG GGACGCAGGA TATTTTACCT GAAGATTCAA 180 

3^ AGAAATGGCG TTACATTGAA AATCAATTAG ATGAATTAAT GACATTTTAT AATTATAAAG 24 0 

AAATAAGAAC ACCAATTTTT GAAAGTACAG ATCTTTTTGC AAGAGGTGTT GGTGATTCAA 300 

CCGATGTCGT ACAAAAAGAA ATGTATACAT TTAAAGATAA AGGCGATAGA AGTATTACAT 360 

TAAGACCTGA GGGAACAGCT GCAGTTGTGC GTTCATATAT TGAACATAAA ATGCAAGGTA 420 

ATCCAAACCA ACCAATTAAA CTTTATTACA ATGGACCGAT GTTTAGATAT GAACGTAAGC 480 

AAAAAGGACG CTATCGTCAA TTTAATCAAT TTGGTGTAGA AGCTATTGGT GCTGAAAATC 54 0 

CTAGCGTAGA TGCAGAAGTA TTAGCTATGG TTATGCATAT TTATCAATCA TTTGGATTAA 600 

AACATTTAAA GCTTGTTATT AATAGTGTAG GGGATATGGC GTCTCGAAAA GAATATAACG 660 

AAGCGTTAGT GAaACACTTT GAACCAGTAA TTCATGAATT TTGTTCAGAT TGTCAATCAC 720 

GTTTGCATAC AAATCCGATG CGAATTTTGG ATTGTAAAGT AGACCGTGAT AAAGAAGCGA 780 

TTAAGACTGC ACCTAGAATC ACTGATTTCT TAAATGAGGA ATCTAAGGCA TATTATGAAC 84 0 
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GTGGATTGGA TTATTATACA CATACAGCAT TTGAATTAAT GATGGATAAC CCTAACTATG ~ 960 

ATGGTGcCAT TACAACGCTT TGTGGTGGTG GCCGTTATAA TGGTTTATTA GAATTGCTAG 1020 

ATGGTCCAAG TGAAACAGGT ATTGGTTTTG CGCTAAGTAT AGAACGATTA TTGCTTGCAC 10 80 

TTGAAGAAGA AGGTATCGAA TTAGATATTG AAGAAAACTT AGATTTATTC ATTGTTACAA 1140 

TGGGTGATCA AGCAGATCGA TATGCTGTGA AGCTATTAAA TCATTTGAGA CATAATGGTA 12 00 

TTAAAGCAGA TAAAGACTAT TTACAGCGTA AAATTAAAGG ACAAATGAAA CAAGCAGACC 12 60 

GTTTAGGTGC CAAGTTTACA ATCGTTATTG GTGATCAAGA ATTAGAAAAT AATAAAATCG 13 2 0 

ATGTTAAAAA TATGACAACT GGTGAATCTG AAACAATTGA ATTAGACGCA TTAGTCGAAT 1380 

ATTTTAAGAA GTAGAGAGGG CGTTAAAATA TGAGTAAGAG AACAACTTAT TGTGGATTAG 14 4 0 

TTACTGAGGC ATTTTTAGGA CAAGAAATTA CATTAAAAGG ATGGGTTAAC AATCGTCGTG 15 00 

20 ACCTTGGTGG ATtGATTTtC GTTGATTtAA GAGATAGAGA AGGAATTGTA CmAGTCGTGT 156 0 

TTAATCCTGC ATTTTCAGAA GAGGCaTTGA AAATTGCTGA AACAGTACGT TCTGAATATG 162 0 

TTGTAGAAGT TCAAGGTACA GTTACGAAGC GTGACCcTGA AACAGTTAAT CCTAAAATTA 16 80 

AAACTGGCCA AGTTGAAGTA CAAGTTACAA ATATTAAAGT GATTAATAAA TCTGAGACAC 174 0 

CACCATTTTC TATAAATGAA GAAAATGTTA ACGTTGATGA AAATATTCGA TTAAAATACC 180 0 

GTTATTTAGA TTTACGTCGT CAAGAGTTAG CGCAAACATT TAAAATGAGA CATCAAATTA 166 0 

CACGTTCTAT TCGTCAATAT TTGGATGATG AAGGGTTCTT TGACATCGAA ACACCAGTAC 192 0 

TAACGAAGTC AACACCTGAG GGTGCACGTG ACTATTTAGT ACCATCTCGT GTTCATGATG 1980 

GTGAATTTTA TGCATTACCA CAATCACCAC AATTATTTAA GCAATTATTG ATGATTAGTG 204 0 

GATTTGACAA ATACTACCAA ATCGTAAAAT GCTTCCGTGA CGAAGATTTA CGTGCAGATC 2100 

GTCAACCTGA ATTTACACAA GTCGATATTG AAATGAGTTT TGTAGACCAA GAAGATGTGA 2160 

40 TGCAAATGGG TGAAGAAATG CTTAAAAAAG TTGTTAAAGA AGTTAAAGGC GTTGAAATTA 2220 

ATGGCGCTTT CCCACGCATG ACATATAAAG AAGCGATGCG TCGCTATGGT TCTGATAAAC 2280 

CAGATACACG TTTTGAAATG GAATTAATTG ACGTTTCTCA ATTAGGACGT GATATGGACT 234 0 

TTAAAGTATT TAAAGATACT GTTGAAAATG ATGGTGAAAT TAAAGCAATT GTCGCTAAAG 24 00 

GTGCAGCTGa ACAATATACT CGTAAAGaTA tGGGaTGCTT TAACAGAATT TGTaAACaTC 24 60 

ymTGGtGCTA AgGtTAGCGT GGGGTTAAAG TTGTGGGAAG GTGGTTTTGA CAAGGTCCCA 2520 

ATGG 2524 
(2) INFORMATION FOR SEQ ID NO: 361: 
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(A) LENGTH: 1507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 



10 


TCGTTGAGTA 


AAAGTCCAGA 


AAATTGGATG 


AGTAAACTTG 


ATGATGGAAA 


ACATTTAACT 


60 




GAGATTAATA 


TACCGGGTTC 


ACATGATAGT 


GGCTCATTCA 


CTTTAAAGGA 


TCCAGTAAAA 


120 


15 


TCAGTTTGGG 


CAAAGACTCA 


AGATAAAGAT 


TACCTTACCC 


AAATGAAGTC 


GGGAGTCAGG 


180 


TTTTTTGATA 


TTAGAGGTAG 


AGCAAGTGCT 


GATAATATGA 


TTTCAGTTCA 


TCACGGCATG 


240 




GTTTATTTGC ATCATGAATT AGGAAAATTT 


CTCGATGATG 


CTAAATATTA 


CTTGAGTGCT 


300 


20 


TATCCAAACG 


AAACAATTGT 


GATGTCTATG 


AAAAAGGACT 


ACGATAGCGA 


TTCTAAAGTT 


360 




ACGAAGACAT 


TTGAAGAAaT 


TTTTAGAGAA 


TATTATTATA 


ATAACCCGCA 


ATATCAGAAT 


420 




CTTTTTtACA 


CAGGAAGTAA 


TGCGAATCCT 


ACTTTAAAAG 


AAACGAAAGG 


TAAAATTGTC 


480 


25 


CTATTCAATA 


GAATGGGGGG 


TACGTACATA 


AAAAGTGGTT 


ATGGTGCTGA 


CACGTCAGGT 


540 




ATTCAATGGG 


CAGACAATGC 


GACATTTGAA 


ACGAAAATTA 


ATAATGGTAG 


CTTAAATTTA 


600 




AAAGTACAAG 


ATGAGTATAA 


AGATTACTAT 


GATAAAAAAG 


TTGAAGCTGT 


TAAAAATTTA 


660 


30 


TTGGCTAAAG 


CTAAAACGGA 


TAGTAACAAA 


GACAATGTAT 


ATGTGAATTT 


CTTGAGTGTA 


720 




GCGTCTGGAG 


GCAGCGCATT 


TAATAGTACT 


TATAACTATG 


CATCACATAT 


AAATCCTGAA 


780 


35 


ATTGCAAAAA 


CGATTAAAGC 


AAATGGGAAA 


GCTAGAACGG 


GTTGGCTGAT 


TGTTGACTAT 


840 


GCAGGATATA 


CGTGGCCTGG 


ATATGATGaT 


ATCGTAAGTG 


AAATTATAGA 


TAGTAATAAA 


900 




TAAGGATTCA 


ATAATGATAT 


TAAGACGAGT 


ATGAAAATAG 


TTAGATTCTA 


ATTATTTTCA 


960 


40 


CTACTCGTTT TTATTTTGAA AATAAGTAAT 


AATTCAACAA 


TATTATAAAT 


TGAACAGATT 


1020 




GTTTGTGAAA 


rrriTGATAA 


TATTAAAGTG 


AAAAAGTGTT 


ATAAATTGAT 


AAATATATGT 


1080 




AATTAACAAA 


AACAAATCAT 


TTTAAAAAGA 


AGAGAGTTGT 


AAGATGATGa 


AACGATTAAA 


1140 


45 


CAAATTAGTG 


TTAGGCATTA 


TTTTTCTGTT 


TTTAGTCATT 


AGTATCACTG 


CTGGTTGTGG 


1200 




CATAGGTAAA 


GAAGCGGAAG 


TTAAGAAAAG 


CTTTGAAAAA 


ACATTGAGTA 


TGTACCCTAT 


1260 




TAAAAATCTA 


GAGG ATTTAT 


ACGATAAGGA 


AGGCTATCGT 


GATGATCAGT 


TTGATAAAAA 


1320 


SO 


TGATAAAGGT 


ACATGGATTA 


TAAATTCTGA 


AATGGTTATT 


CAACCTAATA 


ATGAAGATAT 


1380 




GGTAGCTAAA 


GGCATGGTTC 


TATATATGAA 


TAGAAATACC 


AAAACAACAA 


ATGGTTACTA 


1440 




mTATGTCGAT 


GTGACTAAGG 


ACGAGGATGA 


AGGAAAACCG 


CACGACAATG 


AAAAAAGATA 


1500 
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(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1216 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 

TACATGTTTC GGATGCTACT TTATTTAGTT TGAAGGGTGC ATTATGGACG TTAGCGCAAG 60 

,5 AAGTTTATCA AGAATGGTAT TTAGGATCGA AGTTGTATGA AGATGTTGAA AAGAAAATAG 120 

CACGAACTAC TTTTAAGACA GGTTATATTT ATCAAGAAAT TATTXTGAGA CCAGTAGATG 180 

AAGTTAAGGT ACTTCTGAAT GATTTAAAAG GTGCTGGTTT CGAATTAGGT ATTGCAACAG 240 

^0 GTCGTCCTTA TACTGAGACT GTTGTGCCAT TTGAAAATTT AGGATTGTTA CCATATTTTG 300 

AAGCTGATTT TATTGCAACA GCAAGTGATG TTTTAGAAGC AGAGAATATG TATCCGCAAG 360 

CACGACCATT AGGAAAGCCG AATCCTTTTA GTTATATCGC AGCTTTATAT GGTAATAATC 4 20 

GCGATAAATA TGAATCTTAT ATCAATAAGC AAGATAACAT TGTAAATAAA GATGACGTAT 4 80 

TTATAGTAGG CGATTCGTTA GCTGACTTAT TAAGTGCTCA AAAAATAGGT GCAACGTTTA 54 0 

TTGGAACATT AACAGGTTTA AAAGGTAAGG ATGCTGCAGG TGAGTTAGAA GCGCATCATG 600 

CCGACTATGT TATTAATCAT TTAGGTGAAC TTAGAGGTGT ACTAGATAAT TTGTAATTTG 660 

ATTGTTGTTT GACAGCATAA CTTGTAGTGA ATGATTGAAC CAAAGGTTTC ATATTGAGTT 720 

ACAATGAAAT TAATAATGAA AAAATGCCAA GAAGCAATGG AAGTAATCCA ATGTCTTCTT 780 

GGCATTTTGA ATTTACATAA ATTGTTTATG ACTGTACCGT CAATTCAGTT GTGAAAATTT 840 

GATTGTATTC ACCAACTTGT TTAAGTTCAT CAATTATATT GTTTGAAACA GGTTGATCAA 900 

40 CGGATAAAAT CATTaJsCGCA TCTCCGCCCG CTTCAGTTCT ACCTAAAGTC ATAGATGCAA 960 

TGITGATATT GTATTTACCT AACAATGCGC CAGTTTTTCC TACCATACCT GGAGTATCAT 1020 

TATGATATGA CACAATTTGA TATTGATTTG GCTTTAAGTC TACAGAAAAA TTATTAATTC 1080 

TAACAATTCT TGGACCGAAA CCTGTAAAGA CAGAAGCGCC AACTTTAACG GAATCGCnAT 114 0 

CGnTTGATAG TTCTACCTCT AAGTAGTTAC TAAAACCTGT CTCTGCTTTA TTATTTTCAA 12 00 
TATTTAATGT CACTTG 

50 

(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 867 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 



W 



20 



2S 



30 



ATCAAAATAT 


TTTAAATAAT 


ATTGATGGTC 


ACATTGTAAA 


TrmAATAGAA 


AATAAATTTG 


60 


ATCAAATATT 


ACAAGAACCA 


TTAAATCCAT 


TAAATTATGA 


TACTGTCAGT 


GGATTAGCTG 


120 


GGATAGGGAG 


ATATTTGCTA 


AATAGAGTAG 


ATGAGAATGA 


ATTTAATGTT 


AAAGCATTAA 


180 


AAAGCATATT 


AGTATACTTT 


AAAGATATTC 


AATATTCTAA 


AAATAGCTGG 


GTAGTCCCAC 


240 


AAGArAGTCA 


ATTTTTAGAG 


TCTGATAAAA 


ATTATTTTAC 


TGAAGGTAAT 


ATCAATCTTG 


300 


GCCTTGCACA 


TGGaGTGCTA 


GGACCGATGT 


CTTTATTTGC 


ATTATGCGTG 


ATTAAAGGAA 


360 


TTACGATTGA 


AAATCATCAG 


CACATATTAA 


AAGACATGTA 


CAAATTTATC 


ATGGACGAAA 


420 


AATTTTGTAA 


CCACGAAAGA 


TGGTTGCAGC 


GTTACGATTT 


AATTTCTGAA 


CGTAATCATT 


480 


TCAATTTTAT 


TCGGAATGGT 


TGGTGTTATG 


GCAATACGGG 


TGTAATGACG 


ACGTTGTTTT 


540 


TAATCGGCCA 


AGCATTACAA 


GATGATGAAA 


TAATTAAAAT 


GTCTAAAAAA 


GTGATGCTAC 


600 


AAGTAGTAAA 


TGATAAAGAT 


GAAAATTTAA 


TAAGTCCAAC 


TATTTGTCAT 


GGATTGTCAT 


660 


CACAAATATT 


AATGTTAACA 


ATTATGAATT 


TGAATTTTGA ATTAAATGAA GTGTCTGATT 


720 


ATATCACTGT 


ATTAATAAAT 


AAACTGATTT 


CTCATTATAA 


GGAAGATTAT 


CTGGTGAATT 


780 


TTATAGACAT 


TAATGAAAAT 


AAGCAAGATG 


TATTTAAAAG 


TAGGAAAGTT 


GGCCTTTTAG 


840 


AAGGTGAATT 


AGGGGTCATC 


TAACATT 








867 



(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10813 base pairs 

(B) TYPE; nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 

TACCTTTTCT TtTAAATCAT TTTATATTTT CCCaCTAATA TCCGcTGtTA ATCaiATCCtG 60 

ACATCCtTGT ATCaCTATGA CAATTAATTG TTAAATACAT GAATTTCTAC ATTTTATGAA 120 

SO AAAATCCATT TTTATTACAA TTCAACACTT TATATGACAA CTTCATTACA GTTACTTTTA 180 

TTGTTGATTG CTTACATTGT TTTCTAAAAA AAATTTGTTA TCATAATTAA CGTTGAATAA 24 0 

AGAAAAAAAT TAACTTGGGA GATAAAAATG GAATATAAAA AGATACTAAT TCGTTTATTA 3 00 
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CAATCGCATG CCGCAGTTAA TTATTATAGT AAAAACCAAT GTACATGGTG GGCATTTAAA 420 

CGTCGCGCAC AAGTCGGTAA ACCTGTTTCT AATAGATGGG GCAATGCTAA AAATTGGTAT 4 80 

5 

TACAATGCAC GTAAATCAAA ATATGCGACT GGTCGTACAC CAAGAAAATT TGCTGTCATG 540 

CAATCAACTG CAGGATATTA TGGACATGTC GCAGTTGTTG AACAAGTATA TAAAAACGGT 600 

AGTATTAAAG TTTCAGAATA CAACTTTTAT CGCCCATTAA AATACAATAC ACGTGTACTA 660 

10 

AGCAAAAAGG CAGCACGTAA CTTTAACTAT ATTTACTAAT CAAAAAACTT CTATCACGAA 720 

CGCTTCAATT TCCTGTATGC GTGTGATAGA AGTTTTTATT TTATGAAATT ATATTATTAC 7 80 

TTCTACAAAT TTCAAATTGC CGTAATTGAA CGTATATTTC TTCTTCAACT ATTATTTCAT 840 

CTTTAGCATA ATCTATATAT AAAATTTTAT GCTATTATTT AAATAATTCG CTATAACTTA 900 

ACATACGTTT TCGATATAAA CCTTGTTCTA AATCTCAATA ATTTTTTGCT GTTTTCATCG 960 

20 TCATTAGTTA AAAAAATAAT TTAACTGAGT TTTTGAGACC TGTTTAAATT GATGACTCAA 1020 

AACCTTTATC CCCTTTTCAC TCGGTTTAAT TGCTTTAATA TTTAACACAG TCTCATTGTA 10 BO 

ATTTTGCTAC TAAGTTTGAA ATATTTCGAT TCGAATGTGA AcATCATTTA TTATTACTTT 114 0 

CGACAGCTAG AAAAATATTG TTAAACCAAA ACTATAATTA CCACTTTTAT ATAGAACTAT 1200 

ATATAAATTT TAACTTTGAG ACAGAACTGC TAGTCAGGTT TATGAATATA TTTCTTTAGT 1260 

TTACTTGaTA TACTTATTGG TAAATCATTA TTTATTTGAA GAGCATTTAT AACAAAAAGT 1320 

30 

ATAATCCCAG TTATAGCGAT TCCATGACCA TTTTTATCAG TCAAAGATTC ATCAACAAAA 1380 

ACTTTTTGAT AAATTTATAT ATTTGTATAA TTTATTATGG TAGATATCTA CACCCGTATC 144 0 

TATAACAGCT ACAGTTATCA TTGCGTCTTA TCCAAATAGT TTTTAAGAAA TAAATATAAT 1500 

35 

TCATTAATAT CATGGCTATT TGTAACGTCA GtGCTTaAGA CCTTGTCAAT TATTGTTTTA 1560 

TCTGTTTCTA AGAGCCCCTG ATTTTCTGCA TACATATTTT GTATATCGCT GTTAATCTCA 1620 

TTTAATTTTT TAAATTTTTT TCTAACCTTT TTCAAAATAA AATGCTCTTT AATTCCATCT 1680 

CTTAAATACA TATAAATCTC TATTATAATC TCCATAAATG TCATATCTCC TCTCTCAAAA 174 0 

AACATTATAC TCAATATATT AAAAATAAGA AAGGTmTT CAGAATTTTT AGTATTTTCA 1800 

45 GATATTTATT CTTAAGTATC TTATTATTAC TTCTTTTTAA CGCTAGTGGa ATAATTAATA 1860 

ATAATTGTAT TTAAATGCAA TGCTGTTGTT CTTTCATACT TACAAGCAAG TCATAAGAAA 1920 

TGAGAAATTA AATTCTTATA TGATGAATAA ATGATGACTG CAGAAGAAAT TGCTAAAAAA 1980 

GGAGTGGAGT GAGGCATTCT ACTGTTCATA TTGTTAACAA ATATCAAAAT GATAGGAACT 204 0 

TGAAGAATCT AAGACTGTTA AAATCXSAGAC AAGAAGAAAC TCGTTCCAAA TTTGAATCCT 2100 
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AAAACTTCTA TCACGAACGC TTCAATTTCC TGTATCCGTG TGATAGAAGT TTTTATTTTT 
^ GTTCATATTA ATTTATCTAA GCGCTACGAT GGaCTGACTT AAACTTTTCT TTAAAATCGC 

TATTCGCCAT TTACTATTGT TGTCTAATTT CTTGTAAAAT ATGTTCCGCT GCTTGTGTAT 

TTGCACGGGG TTCTTTTTTC AAAGCTTCAG CTACTTTAGC AATTTCATCA CCTTTTGCCC 2400 

CTACAACGAT AGCTAATGAT TTATATTGTA AACTCATATG ACCTTGTTGA ATACCTTCTG 24 60 

ACACAAGCGC GCGACATGCT GCAAAGTTTT GCGCTAAACC AACGGCAGCA ACTACATGAC 2520 

CTAATTCTTG TGCTGACTCT ACATTTAGTA GCTCTAATGA AGCTTTAGCA ATTGGTAATA 2580 

75 CmTGTACC ACCGCCAACG ATTGCCAATG TCATAGGCAC TTCAATTGTA CCAATCAATC 2640 

GTTGACGATC TTGATCGTAA CGCCATGTAG CAATACCACG ATACTGTCCG TCACGACTCG 2 700 

CGTATGCATG CGCACTTGCT TCTGCACCAC GCGTATCATT TCCTGTTGCT AAAACAACAG 2760 

CATGTATGCC ATTCATAACA CCTTTATTAT GTGTTGCTGC ACGATCAATA TCTACTTGGG 2820 

CCAATACAGA AGCACGTTCC ATTCGTTTGG CAACCTCTTC TCCAGTTCTC TCGCCCCTTG 2880 

CTAAATCTTT AACATCAATT TCGCCTTGAA CTTTAACAAC GGACGCTGTT GCATGATTC3G 294 0 

ATAAAATACT CATTAAAATG TCGCTTTGCG GAAATTCATT TTTTAAAAAT GCAGTTATGG 3000 

CCTCTAAAAT CGTATTAAGC ATATTAGCGC CCATAGCATC TTTCGTATCA ACAAATACTT 3060 

TTAAAGATAG TAACTGTTGC TCAGGAAATG TATCAATCGC TATACGTTGG TAACCACCAC 3120 

CACGCGCTTT AATAGAAGGA TATGCCTCAT CCX3CAATTTT ATGAATTTGC TTTTCTAAAG 3180 

CTTTAATGTC TGCTGATAAT TTTTCAGTAT CGTCAACGCC ATCAAAGACG ATTTGACCTA 324 0 

TCATAATACG TTCAGAAGAT ACCGTTTTAA ATCCGCCAGT CTGATTCACT AGCTTTGCAC 3300 

CATAACTAGC TGCAGCGACA ACTGAAGGCT CTTCCACCAT CATAGGTACA ACATATGCCT 3360 

TATCGTCCAC AATGATATTC GGTAATAATC CAACX5GGTAA TGCACCTTGC GCGATGACAT 3420 

40 TTTQPJ^TTPJK ACTATTGGCT ACTTCTTCAT CGATTAATGG ATGATTCAGT AAAATGTCGA 34 80 

ATTGTTCTTC TGATAACCAT TGCTTATCAA CCAATTGTTG TAACTTTTCT TTACGAGATA 354 0 

AATGTCGAAA ATTCTTATCT AAATTTTGCA TGGACGTACT CCITTTACTT CACATAATTT 3600 

TTAACATTTC AATCACTACT ATTTTTACCA CAAAATAACG TCATTCGTCT TAAAATTCAA 3660 

TTGAATAATT GTCGTTTTGA CTTTAAAATA AAACAAGGTA AATTAAAACX3 CTTACAAGAA 3720 

ACGACAAATC ATTTTTAAAT TTAGTATATT TCTTTGTATA AAATTAGCAT ATTCTGATAT 3780 

GATACAAGTG TTGCTTTTAT AAATTTGAAA GGATGTAAAA CCTTATGACA ATAGGTATCG 384 0 

ATAAAATAAA CTTTTACGTT CCAAAGTACT ATGTAGACAT GGCTAAATTA GCAGAAGCAC 3900 
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CTGTAAACCA AGACATCGTT TCAATGGGCX; CTAACGCTGC TAAGGACATT ATAACAGACG 4020 

AAGACAAAAA GAAAATTGGT ATGGTAATTG TGGCAACTGA ATCAGCAGTT GATGCTGCTA 4080 

AAGCAGCCGC TGTTCAAATT CACAACTTAT TAGGTATTCA ACCTTTTGCA CGCTGCTTTG 4140 

AAATGAAAGA AGCTTGTTAT GCTGCAACAC CAGCAATTCA ATTAGCTAAA GATTATTTAG 4200 

CAACTAGACC GAATGAAAAA GTATTAGTTA TTGCTACAGA TACAGCACGT TATGGATTGA 4 260 

ATTCAGGCGG CGAGCCAACA CAAGGTGcTG GCGCAGTTGC GATGGTTATT GCACATAATC 43 20 

CAAGCATTTT GGCATTAAAT GAAGATGCTG TTGCTTACAC TGAAGACGTT TATGATTTCT 4 3 80 

GGCGTCCAAC TGGACATAAA TATCCATTAG TTGATGGTGC ATTATCTAAA GATGCTTATA 4440 

TCCGCTCATT CCAACAAAGC TGGAATGAAT ACGCAAAACG TCAAGGTAAG TCGCTAGCTG 4 500 

ACTTCGCATC TCTATGCTTC CATGTTCCAT TTACAAAAAT GGGTAAAAAG GCATTAGAGT 456 0 

^0 CAATCATTGA TAACGCTGAT GAAACAACTC AAGAGCGTTT ACGTTCAGGA TATGAAGATG 4 620 

CTGTAGATTA TAACCGTTAT GTCGGTAATA TTTATACTGG ATCATTATAT TTAAGCCTAA 468 0 

TATCATTACT TGAAAATCGA GATTTACAAG CTGGTGAAAC AATCX^GTTTA TTCAGTTATG 4 74 0 

GCTCAGGTTC AGTTGGTGAA TTTTATAGTG CGACATTAGT TGAAGGCTAC AAAGATCATT 4 800 

TAGATCAAGC TGGACATAAA GCATTATTAA ATAACCGTAC TGAAGTATCT GTTGATGCAT 4 860 

ATGAAACATT CTTCAAACGT TTTGATGACG TTGAATTTGA CGAAGAACAA GATGCTGTTC 4 920 

ATGAAGATCG TCATATTTTC TACTTATCAA ATATTGAAAA TAACGTTCGT GAATATCACA 4 980 

GACCAGAGTA GTCGGTGTAT TTAAAACACA TATAATAAAA CCTAAAAGCA GCAGTAAGAC 5040 

CACTTCTAAT TGAAATCGTC TTACTGCTGT TCTCTATTTA TAACACTTCG TATTGAATGA 5100 

ATTCATTATG CCTATTTGAC ACATTATTGA AGTTTTCCTA ATGCCTGGAT CCTTTATACG 5160 

TTACGGCrrC GTGCTATGTT TTGGTACATA AAGCTTTGAC ATATCGATAT TCTCCAACTC 5220 

TAACAGCTTA ATTTTATTAT TAATCX3TTCC ACCX»ACCCT GTTAAGCTAC CCGTTTTACC 5280 

GACAACACGA TGACATGGCA CGATAATAGA TAATGGATTA CTTCCGACTG CACCTCCAAC 534 0 

CX3CTTGGGCT GACATTTTTG GCTTCSTTAAG CAGCTTGCCT ACTTTTTTGG CAATAGCACC 5400 

ATACGTTGTT AGAGTCCCAT AAGGAACCTG TCTTAATTCA TTCCAAACAC ACTGTTGAAA 54 60 

ATGACTACCT GTTGGCTTTA AAGGTATTGT GATTTCAGGA TTGTCACCTT TAAAATACGC 5520 

GTCTAACCAC TGTGTCGCCT CTCTAAATAT CGCTAAAGAC GTATTTTCTT CCCTAGTACC 5580 

ATCACCTTGT TGATTTTCAA ACAAAACAGC GGTCAGACTT ACCCCATCAC TCAAAAGTTC 564 0 

CAATCGTCCT ACAGGCGAAT CATAGTAACT CTTATACTCC ATAAAAATTC CCCCTTTTTC 5700 
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ATAAGTCGTC AATTACGTAT 


ATAAACACGT 


AATACCAGCT 


ATCACTTTGC 


TGCAATATAC 


5820 


5 


AGTTACATAT 


V. X X M*-. X 


GTGCTAACCT 


CTTACTTTGT 


AAACCAAATC 


TTAAATTAAA 


5680 




ATATTGAAAA 


X Va X vi^^A X 


CCTTAATATT 


TTATTAAACC 


TATAATTACT 


TATTAAAAAT 


5940 






AX X\JAXAAAG 


TTTTAAAAAT 


ATTCTGTTTT 


ATCACCTACT 


ATTAGTGGAA 


6000 


10 


A AfiT A r* A ATT 


/^/^ ft ft ffWII^qpm m 


ATAGTTTGCA 


TAACGCTTCA 


AAAGTAATTT 


CTTTTTTGTT 


6060 




± X V._n>l/\i\ 


ft ft ft ^p^«np n ft 
AAATTTAGAG 


GTGATGTTAT 


ATGAATAACG 


GTTTTTTCAA 


TAGCGACTTT 


6120 






TTCGAAGAAT 


GATGAAAGAT 


ATGCAAGGTT 


CAAATCAAGT 


CGGAAACAAA 


6180 


IS 


AACi J. ACT ATA 


TTAATGGTAA 


AGAAGTTTCA 


CCTGAAGAAC 


TAGCGCAACT 


CACACAACAA 


6240 






ACTCTGCTGA 


ACAAAGTGCG 


CAAgcTTTTC AACAAGCAGC 


ACAAAGACAA 


6300 






AAGGTGGCAA 


CGGCAATTAT 


TTAGAACAAA 


TTGGTCGTAA 


CCTTACGCAA 


6360 




/"» ft ft O ft »»T 


ACGGTTTATT 


AGATCCAGTC 


ATTGGTCGTG 


ATAAAGAAAT 


TCAAGAAACT 


6420 






TAAGTAGACG 


AACTAAAAAC 


AATCCTATAT 


TAGTTGGAGA 


AGCTGGTGTT 


6480 


25 


ft ft ft ft t'^^*r^ 

1 AAAAC X G 


CGATTGTTGA 


AGGTTTAGCA 


CAGGCAATCG 


TTGAAGGAAA 


TGTACCAGCA 


654a 


^ ft ft *T^/^ ft ft A /~» 

trV-AATCAAAG 


ACAAAGAAAT 


TATTTCTGTA 


GACATTTCAT 


CATTAGAAGC 


TGGAACGCAA 


6600 




X AX ^VjXVjVjTG 


CTTTTGAAGA 


AAATATTCAA AAATTAATCG 


AAGGTGTTAA 


ATCTTCACAA 


6660 


30 


AAX VjWV-vj X AL. 


TATTCTTTGA 


TGAAATCCAT 


CAAATTATCG 


GTTCAGGTGC 


CACAGGAAGT 


6720 


1 X Vk^AVjij X A 


GCaAAGGGTT 


ATCTGATATT 


TTGAAACCTG 


CATTAAGTCG 


TGGTGAGATT 


6780 




TT'X A IT* A TTV 


OT'/^/*^ft ft Ik ^ 

UTGCAACAAC 


ACAAGATGAA 


TATCGAAACA 


ATATTCTTAA 


AGATGCTGCA 


6840 


35 


X X t\t\S^\J\^\3\^J\ 


(jAX X X AATGA 


AGTGCTTGTT 


AATGAACCAA 


GCGCTAAAGA 


TACTGTTGAA 


6900 




ATT^TTA A A H/^ 
AX X 1 XAAAAw 


(j X A TTCGCG A 


AAAATTCGAA 


GAACACCATC 


AAGTAAAATT 


ACCAGATGAC 


6960 




fT*F A*n* A A A A/^ 


ft •iv'^nv "i^iv* ft 
w\ X v»TvjTTGA 


CTTATCAATT 


CAATATATTC 


CACAACGATT 


ATTACCAGAT 


7020 


40 


A A A a A^i'V^ 


A XV* xv» X"X AGA 


TATTACAGCA 


GCACATTTAT 


CTGCGCAAAG 


TCCAGCTGTC 


7080 




AT A & Arf'yp'TY^ 
VjMXAAAuX X\v 


AAAuTGAAAA 


ACGAATTTCT 


GAATTAGAAA 


ATGATAAACG 


TAAAGCAGTA 


7140 




Aij X X\aAAvr 


ft ft^PftVmk m «L «h m 

AATATAAAAA 


AGCTGACGAC 


ATTCAAAATG 


AAATCAAATC 


ATTACAAGAT 


7200 


^9 


AAATTAGAAA 


ATAGTAATGG 


TGAACATACT 


GCTGTTGCTA 


CAGTTCATGA 


TATTTCAGAT 


7260 




ACTATTCAAC 


GATTAACTGG 


TATTCCAGTT 


TCTCAAATGG 


ATGATAACGA 


TATTGAACGT 


7320 


SO 


TTAAAAAATA 


TTTCTAATCG 


TTTAAGAAGT 


AAAATCATAG 


GTCAAGATCA 


AGCTGTAGAA 


7380 


ATGGTTTCAC 


GTGCAATTCG 


CCGTAATCGT 


GCTGGGTTTG 


ATGACGGCAA 


ccxrrccAATT 


7440 




GGCAGTTTCC 


TATTTGTTGG CCCTACTGGT GTTGGTAAAA CAGAGCTTGC 


TAAACAATTA 


7500 
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GACACAACAG 


CTGTTTCAAA 


AATGATTGGT 


ACAACTGCTG 


GTTATGTTGG 


TTATGATGAC 


7620 




AATTCAAATA 


CGTTAACTGA 


AAAAGTACGC 


CGTAATCCAT 


ACTCAGTCAT 


TCTATTTGAT 


7680 


5 


GAAATCGAAA 


AAGCAAATCC 


ACAAATTTTA 


ACATTGTTAT 


TACAAGTAAT 


GGATGATGGT 


7740 




AATTTGACTG 


ATGGTCAAGG 


TAATGTCATC 


AACTTTAAAA ATACAATTAT 


TATTTGTACA 


7900 


1 u 


TCAAATGCTG 


GCTTTGGCAA 


TGGCAATGAC 


GCTGAAGAAA 


AAGATATTAT 


GCACGAAATG 


7860 




AAAAAATTCT 


TCCGCCCTGA 


ATTCCTTAAC 


CGCTTCAACG 


GCATCGTTGA 


ATTCTTACAT 


7920 




TTAGATAAAG 


ATGCATTGCA 


AGATATCGTC 


AACTTATTAT 


TAGACGATGT 


ACAAGTTACA 


7980 


15 


TTAGACAAAA 


AAGGTATTAC 


GATGGACGTT 


TCTCAAGATG 


CGAAAGATTG 


GTTAATTGAA 


8040 




GAAGGCTATG 


ATGAAGAATT 


AGGTGCACGT 


CCATTAAGAC 


GTATTGTTGA 


ACAGCAAGTA 


8100 




CGTGACAAAA 


TTACAGATTA 


CTATTTAGAT 


CATACAGACG 


TTAAACATGT 


GGATATAGAT 


8160 


20 


GTTGAGGATA 


ACGAATTAGT 


CGTAAAAGGT 


AAATAACGAC 


ACTTTAACAT 


ATCGCGCATC 


8220 




AAAAATGAGC 


ATCAGGTCGC 


CCTTGCCTGT 


GCTCATTTTT 


TTAATTATTT 


CCCTGGAAAA 


8280 




TGATTCGCTG 


TGTGCTGTTC 


TGTTCCACAA 


CAATCACGAT 


TAATGTCACA 


TGTACCACAT 


8340 


25 


TTTCCTTGTT 


TTGAACGCTT 


GAAAAATTTT 


ACTAGTGTAT 


ATAAGGCATA 


TCCGAAAATT 


6400 




GCTAAAAAAA 


TTAAAATGTT 


AATAATGACT 


GACACTTTAA 


CCACTCCTTA 


AACAAATAAA 


8460 


30 


TGTCCX3ACTT 


GATAAAAAAT 


GAATGTTAAG 


ACATATGCAG 


TGACTAGAGG 


ATAGGCAACT 


8520 


GCAAGTGCCG 


TCCATTTCCA 


TGAATAAGTC 


TCTTTACGGA 


TTGCTGCTAC 


TGTAGAAACA 


8580 




CAAGGAATAT 


ACAATAGTAT 


AAATATCATA 


AATGCATACG 


CAGATAGCGG 


TGTGAATTGA 


8640 




TTTTGAATCA 


CATTAACAAG 


GCCTGCATCA 


CCTGATGAAT 


AGATAATCGC 


CATCGAACTT 


8700 


ACGATAACTT 


CTTTTGCTAA 


AAATCCTGGC 


AcTAACGTAG 


CACCTGCTTG 


CCATGTTCCA 


8760 




AATCCGAGCG 


GTTGCACTAA 


CATACCAAAG 


AAACTACCAA 


CCATATGTAA 


AAAACTTTGA 


8820 


40 


TTGATATTCA 


CATTGATACC 


ATGTGGTCCT 


ACATAACTTA 


ATAGCCAAAT 


GACTACTGAG 


8880 




CCGCCAAAAA 


TAAATGTACC 


TGCTTTACGA 


ACAAAGCCCT 


TAGCCTTTTC 


CCAAGTACTA 


8940 




CGCCACAACG 


TTTTAATGGA 


AGGCACACGG 


TATGTTGGCA 


ATTCCACAAT 


AAAGATTGCA 


9000 


45 


TTATCATTTT 


TTAAAATCGT 


CTTAGTAAGT 


ACTGTACTGA 


CTAAAAATGC 


CATAATAATA 


9060 




CCTAAAACAT 


ACAGGCTTAA 


TACTACTAAA 


GATTGATTCT 


CTTTGAAAAA 


GATACCTACG 


9120 




AACAACGCAT 


ACACTGGCAG 


TCTAGCAGAG 


CATGACATGA 


ATGGTGCAAT 


TAATATCGTT 


9180 


SO 


GTTAAACGCT 


CTTTTTCATT 


TTCAATACTG 


CGCGCAGCCA 


TAATACTCGG 


TACATTACAA 


9240 




CCAAATCCGA 


TAATCATTGG 


TATAAAAGAC 


TTCCCGCTTA 


AACCGAACGA 


TTCCATAATA 


9300 



55 



1253 



EP0 786S19 A2 





AAAAAGAGCA 


CAACAATTTG 


TGGTACAAAG 


ACTAATACTG 


ATCCTACACC 


AGCAATAATG 


9420 




CCATCTGTAA TTAAATCTTG 


TAAAAATGGT 


ATAACACCAA 


GATAATTCAT 


AATCGTCTTC 


9480 


5 


ACACTATCTG 


TAAATGTACC 


ACCTATAAAT 


GCATCGAGTT 


GATCCGACAA AGGTGTGCCA 


7b40 




ATCCATGTAA 


ATGTAGTTTG 


AAAGATCAAC 


CACATAATTG 


CTAGAAAGAT 


AGGCATCCCT 


A f A A 


10 


ATATATTTAT 


GTGTTAATAT 


CTTGTCTATT 


CTAGAGCTGA 


AATATTGCTT 


ATCTTCATCT 




GGATACGTTA 


CCACGTCTTG 


CAATAACGTC 


TCAATATAAT 


GATTGCGTAT 


ACGCTCCATC 


Q "7 O A 




TCTCGACGAA 


CAGATACAGC 


CCCTACTTGT 


TCAGCAACTT 


GATCACGTAA 


ACTCGACAAT 


Q "7 O A 


IS 


TTATTTACAA 


CCTCTGAATT 


AAGTTCGTTT 


GCAATTTGCA 


TGTTATTTAA 


TAAGAATTGA 


7040 




ATCGCAATAA 


ACCTAGCTTG 


ATACTTATCA 


TGAGATGTCT 


CTGTCATTAT 


TATTTGACAC 


9 900 




ATATTTTTAA 


TTGTCTCTTC 


AATCTTCTCA 


CCATAATTGA 


TTTTAAAATG 


CGGTTGATAC 


Q Q /" 

9 960 


20 


CCTTCCCCTA 


GATGCTTTAT 


TTCGCCAAGT 


AAATATTTTG 


TTCCTTTGCC 


TGTACGTGCC 


1 A A O A 




ACAACTGGAA 


AAATAGGTGT 


TTTTAACTTT 


TTCATCAATT 


TATGATAATC 


GATTTTTATC 


1 A A o 




CCGCGCTTTG 


TAGCTACATC 


AATCATATTT 


AATCCGATGT 


ATATTGGTTG 


ATTAAGTTCT 


1 A 1 >l A 


25 


AACAATTGTA 


CTGTTAATTG 


CATATTTCTT 


TTTAGTTGAC 


TCGCATCAAC 


AATGTTAATG 


1 A A A 




ATTCCTGAAA 


ATGAATCGTT 


TAATAAATAG 


TCTGTCACTA 


CAGTTTCATC 


TTTAGAAATC 


T A O ^ A 




GGCGATAAAT 


CATATGTACC 


TGGTAAATCA 


ATTAATTGTC 


CTACATTTTC 


TTTAAGTTTC 


1 A "a O A 


30 


ccTAcri-riT 


TCTCTACCGT 


TACGCCACTC 


CAGTTGCCTA 


TATATTCATA 


CGAACCAGTT 


1 ft Q A 




AAAGCGTTAA 


ACAAAGATGT 


TTTACCAACA 


TTAGGATTTC 


CTAAAATACA 


ATAATTTTCC 




3S 


ATTCGTCCGG 


CTCCTATTCT 


TCTAATGCAA 


TAGAACAAGC 


ATCGCAATGT 


CTAATACTTA 




ACTGTTGTCC 


GTTTACTTCA ATAATACATG 


GCCCTTTAAA 


TAAACATTTT 


TGTTTAATCG 


X U 3 o u 




TTATGATAGC 


GTCATCTGTT 


AACCCAAAGG 


CACTTAGACG 


ATACAACATA 


TTCTCATTAG 




40 


CAATATCCAT 


TCGCTTTATT TTATAAGCCT 


TATTCATTTC 


ACCATTTTTA 


ATGTTTAACA 


1 ft C fi A 
X U O O U 




TACTATTTCG 


CTCTCCTATT 


AGAAATAATA 


ATCATTATCA 


CTTAAAAATC 


ATAACXrCTTA 


10740 




AAATTGTAGC 


TCGCAATACT 


TTATTTAAAT 


AATTTTCATT 


TTTCATGTAA 


AATTTGTGAC 


10800 


4S 


ATTGCAAAAA 


TGT 










10813 




(2) INFORMATION FOR SEQ ID NO: 365: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) 


SEQUENCE DESCRIPTION: ; 


5EQ ID NO: 


365: 








TGATGGATTA 


GCAGACATTT 


TACGAGCGAA 


TGGTTTCAAA 


GTGTTTGGTC 


CAAATAAGCA 


€0 


5 


AGCAGCTCAA ATCGAAGGCT 


CAAAATTATT 


TGCTAAAAAG 


ATAATGGAAA 


AATATAATAT 


120 




TCCAACTGCT 


GATTATAAAG 


AAGTTGAGCG 


AAAAAAGGAT 


GCTTTAACAT 


ATATTGAAAA 


180 


10 


CTGTGAATTG 


CCCGTTGTTG 


TCAAGAAAGA 


TGGGTTAGCT 


GCTGGGAAAG 


GCGTTATTAT 


240 


TGCAGATACT 


ATTGAAGCAG 


CCAGAAGTGC 


TATTGAGATT 


ATGTATGGTG 


ATGAAGAAGA 


300 




AGGTACTGTT 


GTATTTGAAA 


CGTTTri'AGA 


AGGTGAAGAG 


TTCTCGCTAA 


TGACATTTGT 


360 


15 


TAATGGTGAT 


TTAGCAGTAC 


CTTTCGACTG 


TATTGCACAA 


GATCATAAAC 


GCGCATTTGA 


420 




TCATGATGAA 


GGACCAAATA 


CTGGTGGTAT 


GGGGGCTTAT TGTCCmgTAC 


CACATATTAG 


480 




TGACGATGTT 


TTAAmACTTA 


CAAATGAAAC 


AATTGCACAw 


CCCATTGCAA 


AGGCAATGCT 


540 


20 


TAATGAAGGT 


TATCAATTCT 


TCGGTGTATT 


ATACATTGGT 


GCTATTTTAA 


CTAAAGATGG 


600 




TCCAAAAGTA 


ATAGAATTTA 


ATGCCCGTTT 


TGGTGATCCT 


GAAGCTCAAG 


TATTATTAAG 


660 




TCGCATGGAA 


AGTGATTTAA 


TGCAGCATAT 


TATTGATTTA 


GATGAAGGAA 


AACGTACTGA 


720 


25 


ATTCAAATGG 


AAAAATGAAT 


CTATTGTAGG 


GGTCATGTTG 


GCATCAAAAG 


GATATCCTGA 


780 




TGCATATGAA 


AAAGGGCATA 


AAGTAAGTGG 


CTTTGATTTA 


AATGAAAACT 


ArriUXJlTAG 


840 




TGGATTAAAG 


AAGCAAGGTG 


AT AC CTTTGT 


TACTTcAGGT 


GGTAGAGTTA 


TACTTGCCAT 


900 


30 


CGGAAAAGGT 


GACAATGTAC 


AAGATGCACA 


GCGAGACGCA 


TACAAAAAAG 


TATCACAAAT 


960 




ACAAAGTGAC 


CATTTATTCT 


ATCGTCATGA 


CATTGCGAAT 


AAAGCACTAC 


AACTTAAATA 


1020 


35 


AGTAAATTTA 


AAATACTAAG 
TTGTAGAGTC 


aTTAGCTATG 


AACGAATCTA 


TAACGATAGA 


I'l-rrrrcATA 


1080 




GcrrrriTAG 


TAGGACATTG 


ATTTCTGTAC 


CAAATTTGTG 


ATTATGCATA 


1140 




TGTAATACAA 


AAGAGGCGCC 


ACAACATGTT 


TGGATGAACA 


AAATAACATG 


TTTGTGGCAC 


1200 


40 


CTCTTTTGTT TAGTATGGAA TAAATGGTTT TCTTTTTCTA TACAATGAAT 


TTCTAATTTA 


1260 




GTATCTATAC 


AATTATGGAT 


AAAATTTAAC 


CTACACGACC 


AAGACGAACA 


TCATCTATGC 


1320 




CCGTGATGGG 


TAAGGTGATT 


GAACAATAAT 


ATGCCATAGT 


AATAATGGCA 


ATTAAAACTA 


1380 


45 


TAATAAAGAT 


TATATCTTTA 


TATGAGAAAG 


GTACGTTGTA 


ATAGTAAGTA 


CGAGGACCAT 


1440 




CTCTAAATCC 


TTTCGACTCC 


ATCGCAACTG 


ATAATTGATG 


TGCCTTTCTA 


ATATTTTGGC 


1500 




TTAATAGAGG 


TATAATTAAA 


TGCTTAAATC 


GCTTTAACCC 


TCTATAATTT 


GCCGCGTCTA 


1560 


SO 


TCATCTGATA 


GCGCAiTrrr 


AAAGATCTGC 


GAAgcTGTAt 


TAAAGAACTA 


ATCATTAAAG 


1620 




GTATCATACG 


AATGGCAGCC 


ATGAATGCAT 


AAGCAACTTT 


TGATTTAACC 


TTTAAATGTT 


1680 
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25 



30 



SO 



ATGAAATAGC AATXSGTTCrrr AATGATACAT GTAAACCACG AACTAAACTT TCTOTTXSTAA 
TATGGATAAA TCCGAATTTC AAAATTGTAT GGCTACCATT CCCGTATAAA ATCATGAACA 
CX3GAAGAGAG TAATGCAAAG CCAATACTTA TAGTTATAAA AATTCCTGTA ATTTTAAACT 
GAGTACCATT AAACATCAAT AAGAAAACTA ACATTAAGAT AGTCATATAA AGCATAAAAT 



1800 
I860 
1920 
1980 

CGAAATTATG CACAAATATA ATAAAGAAAA ATAGTATAAT TCCAAGAAAT AGtnTCGrTA 2040 

2100 



TAATGTTGAC ATCATGAACA AATGATTGCC GAACTTTCCA TTGCTCATAC ATTCGTATCA 
CCATCACAAT CTAGTAACGC ACCATCTGAA ATTTTAAGTC TTCTTGATGG ATAACGTTCA 



2160 

ATTATTTCAT CGTCATGTGT AaCCATGACA ATACTTTCTC CCAAATTAAT TCGCTTTTCG 2220 
AAAAGTTTGA TCAACTGGAA TGTAITATGG CTATCAAGTC CAAATCTCGO TTCATCTAAA 2280 
AAGATAATAT CAGCTTTAGA ACTTAGTGCG GTAgcTACGC TAAGGCGTCG TrTTTGACCA 
20 ATAGACAACT CATAAGGATG TTGATCTTTT ACATTITGTA AATCTAAAAG TTTTAAAAGT 

TGTATCGTTT CATCATCACT TTGATCTTTA GAAAGGTGAT TAAAATGAAT GTTAATrrCA 
TCATAAACCG AATTTGTTAT AAATTGTAAT TCTGGGTrrr GATAAACTAG GTACATGTGT 
TTTGCTGCAT GT-TTAATTTT TGTTAAACGC TGATTTTCAA AATAAACATC ACCTTGATAT 
TTAATCAATT GCATAATTGA TTCAAGCAAG GTTGTTTTAC CACTACCATT TGCCCCTCTA 
ATTGTAATCC ACTCACCTAG ACCAATTTCT AAATCTGAGA ATCAGAGCAA TGTTGATTTA 
CCGCGAATAA TACGTCCATT TTTAAATTGT AATAAGTCTG AGTTTCTTGT TGGAAAGTCA 
ACACGACrrC GTGCGAATTC CCATGCACGT GGATGCCACA CACCATATTC ACTGAGTAAA 
TGAACATACT TCTGTAATAT GATTTCAGGA CATTCATCGG CAA1X3ATATT TCCGTTATAA 2880 
TCCATCAAAA TGACGCGGTC GACATGATTC CAGATCTGTT TAACTTTATC TTCAACGATT 2940 
ACAACCGTTT GATCTTCCCA AAGTTCAATT AGTTTAGTCC ATAAATCTTC TGTTGCTTGA 3000 
ACATCTAACA TTGCTGTCGG TTCATCTAAA AACAATGTTT TTCATTCTTC AAGAATGGTT 3060 
TCAACAATTG CCAATTTCTG TTTCATCCCG CCACTTAAAT CTT1X3ATATA CGTTTCAGGG 3120 
GTAACATTTA AATTGACCAT ATTTAAAGCA TTGATAATTA ACGCATCCAT GTCTTCACCT 
GGTAATTGTC TATTTTCTAA AACGAATGCA AGTTCTTCGT ATACTPrTGG CATACAAAAC 
TGGCTATCAG GGTCTTGGAA AATAACGCCA CTTAATGGGT CAACGATTAG TTCATCATAT 
rrCATAGGTA ATTCAATTAA ATTAGGAACA ATACCACTTA ATACATTCAG AACrTOTACTT 
TTACCGCAAC CAGAAGGACC GAGTAAAAGT ACTTTTTCIT TCTCTTGAAT AGTGATATTT 
AAATGATCGA AAATTTTACG TTGACCACTT GGATATTTTA ATCGTAAATC ACTTACrrTT 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



3180 
3240 
3300 
3360 
3420 
3480 
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ATTTTGTTAC GCCTGTCTTA TCTAAAGCTT TTACTAAAAG GTAAGATAGG ACXSCCGGCGA 3600 

CTACTGCACC ACTAATTAAT CTAAATACGA TGAATAATGT TAAGTTCCAA CCTGCAACTT 3660 

CATTTAAATA ACCATAGAAA TAATCTATCG GGAAAGCCGC GATTGCTGTA CAAAAACCTG 3720 

CTAACATAGC TACCATAACT GAACGTGATT GATATTTAAA AATTGCAAAG ACAAGTTCAC 3780 

ACGCTAAACC TTGTATAAAA GCGTAAACGA TTGTCGGAAT ATCGAAACGA CCCATAATGA 384 0 

TAGTTTCGCC GGCACCTGCA GCAAATTCAG CCAGTAAAGC AATACCTGGT TTTGGAATAA 3 900 

TTAGATAGCA GACAATCGCT GCCATGAACC AAACCCCGTT TGTTAATTGT TCGAGGTGAA 3 960 

GGCCTGTAGC TTGCACACCA TTGTAAACAA ACCACCATAA ATTGTAAATA ACTGCGAATA 4 020 

CTACTGAAAT AAGTACGGTT ACTAGTATTT CAGATAGCTT TAAACCTTTT GACATTTTTA 4 08 0 

CATCCTCCTA ATAAAAAAAC GCACAACCAT CCATAGGAAA GTTATGCGTT CACAATATAT 414 0 

2Q ATTAQTAAAA CATATGTATA GTAACA C TTT CCTACGCTAG TTCAAGCTAG ATCAGGTTCA 4 20 0 

AAGGGTTTGA GGGCAAGCCT CATCTCAGTA TAAAACACCC CTAGTGTGTG CGATTTATTT 4260 

AATTAATTAT ACTGTAAGAC GTTTGTAAAC TTATGTCAAT AGGTTGTCTT CATGAAATTT 4 320 

25 CGTTTAATTC GATTTAAAAT TTATAATATT AGCATTGGAT TTAAATTGAA GATGTAGTAG 4 380 

GAATGTTAGT AATTAAAGAT ATAAAAATAT GTGACATGTA ATAATATTGA GCTGATAAAT 444 0 

GAAGAGGGAT ACTTATCAAT CATACCTCTT TAACAACAGT GAAGAACCCG TGCATAATGG 4 500 

CTTACGAATT ATAGTTTATA AGGAAGAAGA GGGATACATG CGCCGAGCAC ATGCATAAAA 4 56 0 

GCCCCTAACA ACTAAAAGTT GTAAGGAAGG AGAGGGATAC ATGCGCCGAG CACATGCATA 4 620 

AAATCCCCTA ACAACTAAAA GTTGTAAGGG GATTTAAATT AATTTAGTGT ATCTTGGATA 4680 

TCTTGTTTTG kTTGaTTAAT ATCTTCTGTT TTTTCTTCTT TTTTATCTTT TAATTTTTCT 4740 

TCAACTTCTT TAGCTTTTTC TGCTGCTTTT TTATTTTGAT TTTCATTAGA CATGATTAAT 4 800 

TCCTCCCAAA TTGGATAATT ATTTATATAT AAATCTTACC CGGTTGTACT TTCGTTAAAC 4 860 

TTTTCTAAGT CTATAGCACT ATTTATTCAT TTATCTAAAG ACAACAACAT TAGATTAATA 4 920 

TATAATGATT TTGAGGTGAA CATAATGTCT TTTCTTAGGA AACACGCCGA AATTATTTTT 4 980 

45 AGCTATTTAA TCGGTm%iK:GT TcACTCTTCA CTGGkcTCAT TATTTTAATT AACTTGCCAT 504 0 

TAATTAAACA ATTAAATGGT GGTAAAAAAG TTGATACACA TGTTCATAAT GTGTGGGAAT 5100 

TTCTGAATGC ATTTTTCAGT GAAATTATTA AAGTAATGAG TCGATTTATA GGTAATTTCC 5160 

CtATAGTTAG TGCAATTGTG ATAATTATAT TCGGTATTTT AGTTATGTTC ATTGGTCATA 5220 

CATTACTTAG AACTATTAAG TATGACTATG ATATTTCTAT CTTTTTCTTA GTTATCGGTA 5280 
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30 



35 



40 
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rrrrcATTAT tccatttaca attcatatag gatatatcgt ctataaagat gaattgaatc 


5400 




AGGAAAATGT AAAAAATCAT TTCATGTGGA TAATTCTGAG rTATGGTATA AGTTACTTAA 


5460 


5 


TTACACAAAT TGCATTGTAT GGCAGAATTG ATGCTAATCA AATAGAGTCA ATTGATATCT 


5520 




TAAGTGTCAA TGCTTTCTTT ATAATTATGT GGTTACTTGG TCAAATGGCT ArTTCGAATT. 


5580 


10 


TCTTGrrCTT GCGCCGAGCT TTACCTTTAA CAAAGCAAGA ATTAGGTGAA GAGGAGCCAG 


5640 




AA1TATCAAG AACAAGTAAA GGGAATGTCA CGAATCAAAC TAAAATTCAC TTGAAACAAC 


5700 




TCCAAGATAA GACTACAGAA TATGCACGTA AGACAAGAAG AAGTGTCGAT TTAGATAAAA 


5760 


IS 


TTAGAGCTAA AAGAGATAAA TTCAAAAAGA AAGTTAATGA TATTATCGAT ATTCAAGAAG 


5820 




ACGATATTCC TGATTGGATG AGAAAACCGA AATGGGTTAA ACCAATGTAT GTCGAACTAT 


5880 




TTTGTGGTGT CGTCATCTTT TTATTCACAT TTTTAGAATT TAATAATCGT AATGCATTAT 


5940 


20 


TTGTATCTGG TGATTGGAAA TTATCACAGA CACAATATGT TATTGAATCG GTTACATTAT 


6000 




TAATTCTGTT ATTCATTATT ATCGCATATA TCGCTACAAC GTrAACTITC CACTTGAAAG 


6060 




GTAAoi i-i-i-A TTATTTACAA TTATTTATGG GGAGCATTTT ATTCTTTAAA TTGTTAACGG 


6120- 


2S 


AATTTATAAA TATAATGATT CATGGACTAT TACTTTCAGT GTTCATTACG CCAACATTAC 


6180 




TATTAATGTT ATTGGCAATC ATCArTTCTT ATTCGTTACA ATTACGAGAG CGACCATAAT 


6240 


30 


TAAAAGCATT ATAAAAGTAC TATCTATTAA ACATTTTGAT GTGTACGCTA TAAGTTAGAT 


6300 


ATATCTCTAA CTTACTTAGA TACAGGTCAA TGAAGTTTAT GGATAGTACT TTTTTTGTAA 


6360 




CTAGATTTGA TTGATTCAGG TGATGTGAAT TAAGTATTGA rAATTGTATA CAAAGTITTA 


6420 


35 


GTGCAAATAA AATAGTTGAA AAGTTATCCA rTTGTAAAAT CAAGAAAACT AGTAAATAGT 


6480 




TGAAGCGACT TATGGaATTT GCGAAACGAT ATATAGTATT TCCTTTGTAG AAATTTmACA 


6540 




TATATCATTC AAATTACTAA TTTGTTAAAA TCAACAGTAA GATTAGAAGT AGATGATATT 


6600 


40 


GAAATTTGGC AAACAaTtTA ATCTATATAA AACTACAACG AAACACAGAA AGGAAGTTGT 


6660 




CAGATGAAAA TAGCAACTCT GAACAAAGGC AAAGAAACAA AATATTTTAA TGGATATCCT 


6720 




TTAATTGAAG AAGAGGATAT CTATTCACAA GATCATTTAA AAGAAGGAGA TATrrTTCAA 


6780 


4S 


ATTGTGACTG ATAAATCACA ATAT 
(2) INFORMATION FOR SEQ ID NO: 366: 


6804 


SO 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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IS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 
aaaAGAGACG CATTaAAACA AGCTATACAA ATTATCGATA AATTAACATG GGGTGTTTAG 60 
TGGTGGTTAA AGAAATTTTG AGACTATTAT TCTTACTAGC GATGTATGAG CTAGGTAAGT 120 
ATGTAACTGA GCAAGTATAT ATTATGATGA CGGCTAATGA TGATGTAGAG GCGCCAAGTG 18 0 

ACTTTGAAAA AATCAGAGCT GAAGTTTCAT GGTAATAGCT ATTATCATTT TTGAATTAAT 24 0 

TATATTAATG TGTGTAGCAA TAGCACTGGA GGTGTTGTAA ATATGTGGAT TGTCATTTCA 3 00 

ATTGTTTTAT CTATATnTTT ATTGATCTTG TTAAGTAGCA TTTCTCATAA GATGAAAACC 36 0 

ATAGAAGCAT TGGAGTATAT GAATGCTTAT CTTTTCAAGC AGTTAGTAAA AAATAATGGT 420 
GTTGAAGGTT TAGAAGATTA TGAAAATGAA GTTGAACX3AA TTAGAAAAAG ATTCAAAAGC 4 80 

TAAAGAGAGG CGTTGGCTTC TCTGCTCTAT CyAAAATAAT GAAAGGAGCC saACATGTTA 54 0 

2Q GaCmAAGtCA CTCAAATAGA AACAATTAAA TATGATCGTG ATGTCTCATA TTCTTATGCT 600 

GCTAGTCGTT TATCTACACA TTGGACTAAT CACAATATGG C^TCGTCTGA CTTTATGCAG 660 
AAGCTAGCAC AAACAGTTAG AACTAAAGAA GATTTAACTG AGTACAATAA AATGTCTAAG 720 
2S TCTGAACAAG CCGATATAAA AGATGTTGGC GGATTTGTCG GTGGATATTT AAAAGAAGGC 780 

AAACGGCGTG CTGGTCAAGT CATGAATCGT TCAATGCTAA CACTTGATAT CGATTATGCA 84 0 

GCCCAAGATA TGACTGACAT ATTATCTATG TTTTATGATT TTGCATATTG TTTATATTCA 900 
ACACATAAGC ATAGAGAGAT AAGTCCAAGA CTGCGTTTAG TGATTCCTTT AAAACGAAAT 960 

GTAAATGCAG ATGAGTATGA AGCTATTGGG CGTAAAGTCG CAGATATCGT TGGCATGGAT 1020 

TACTTCGATG ATACAACTTA TCAACCACAT AGGTTAATGT ATTGGCCTTC AACTAGTAAC 1080 

GATGCGGAAT TTTTCTTTAC CTATGAAGAT TTACCTTTGT TAGACCCAGA TAAAATATTA 1140 

AATGAATATG TTGATTGGAC TGACACATTA GAATGGCCAA CGTCTTCAAG GGAAGAGAGT 1200 

AAGACTAAAA GATTAGCAGA TAAGCAAGGC GACCCAGAAG AAAAGCCGGG AATTGTTGGT 12 60 

GCATTTTGTA GAGCCTATAC GATAGAAGAA GCTATAGAAA CTTTTATTCC TGATTTATAC 1320 

GAAAAACATT CTACTAACCG TTATACCTAT CATGAAGGTT CAACTGCAGG TGGATTGGTG 13 80 

TTATACGAAA ATAACAAGTT TGCCTATTCf CATCATAATA CGGATCCCGT AAGCGGTATG 1440 

CTTGTGAACA GTTTTGATTT AGTACGCATA CACTTATATG GTGCTCAAGA TGAAGAAACT 1500 

AAAACAGATA CTCCGGTTAA TCGACTACCT AGTTATAAAG CAATGCAGCa AAGAGCGCAA 1560 

50 AATGATGAGG TTGTTAAAAA GCAATTAATT AATGACAAAA TGTCTGATGC AATGCAGGaT 1620 

TTCGATGAAn GAGAAAATAG CGATGATGCA TGGTCTGAGA CGTTnGAAAT TACTTCGAAA 16 8 0 
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(2) INFORMATION FOR SEQ ID NO: 367: 




5 


\xi tofcUUbNCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 
l^; STRANDEDNESS : double 
(D) TOPOLOGY: linear 




10 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 






AAGATTAGGC ATCAAGACAG GGTCGCGATT GTTTGAAATC CCACATAGnA ATGATATTTa 


60 


IS 


CaTTATCAAT CCaAGTATGC GTAAATATCT TAATGTTTCA GTTGCTATTT CTAAGATTGC 


120 




ATTGCGTTAT AITCCACCTG AAGATTTACA CCAATATAGT ATTCACGAAT TTTTTATGGA 


180 




TGTTACTGAT AGCTATCATA GATTTAGTTC TACAGTACAT GCATTTTGCG AAAGACTTAA 


240 


20 


ACGTGAAATT TATGAAGAAA CAGGCATTTA TTGTACTGTG GGCATTGGTT CTAATATCTT 


300 




ATTAAGTAAA ATTGCTATGG ATGTTGAAGC GAacATAGTc AAAATGGTAT AGCTGAATGG 


360 




CGATATCaAG ATGTACCAAC GAAATTATGG CCAATTCmGC CCtTGCGAGA rrTTTGGGGT 


420- 


25 


ATTAATCGTC GAACAGAAGC CAAATTGAAT AAAAGAGGAA TTTTTACTAT AGGAGATTTA 


480 




GCGAAATATC CATATAAATT TTTAAAAAAA GAGTTCGGTA ITTTAGGTCT TGATATGCAT 


54 0 


30 


CTACATGCGA ATGGGATAGA TCAGAGTAAA GTACGTGAAA AGCACAAGAT CAGCAATCCA 


600 


TCGATATGCA AAAGTCAAAT ATTAATGAGA GATTATCATT TTGATGAAGC AAAAGTAGTA 


660 




ATGCAAGAGT TAATTGAAGA TGTTGCTAGC AGAGTTCGAG CAAGAAAAAA AGTGGCAAGA 


720 




ACGATACATT TTGCCTTTGG CTATAGTGAT GAAGGCGGTG TACATAAGCA ATATACTTTO 


780 


AAAGATCCAA CAAACTTAGA AAAAGATATT TATAAAGTAG TAATCCATTT CGCAGATAAA 


840 




TTATGTAATA AACAAGCACT ATATCGTACG CTAAGTATAT CTTTGAGTCA ATTTATTAAT 


900 


40 


GAGGATGAGC GACgTTAAGT CTGTTrGAAG ATGAATACCA ACGCAAACGT GACGAATGTC 


960 




TAGCTAAAAC GATAGACCAA TTACATTTGA AATACGGCAA AGGTATTGTG TCCAAAGCAG 


1020 




TATCGTTTAC AGAAGCAGGT ACAAAACACG GCAGATTAGG TTTAATCGCT GGACATAAAA 


1080 


45 


TGTAATGACT ATACGGTTTA AGTAATATAT AACTGTCATT CGTATAAAAT AAGTCTCTAA 


1140 




AGATAAATAT TTCATATATC ACAATAGATT TTCACAATAA TATCTAAGAA TACATGGAAT 


1200 




TTATCAAAAG AGACTTAATA ATTATTGGAT ATAACAATCA AAATCACTCA ATGCTTGCAT 


1260 


50 


ACCGCGTTCT CGGTCAGTAG GGTTTTTGAA ACTAATTTTT AAAGCACCGT ATATATCTTC 


1320 




GCGTACTTCT AAGATTCTTA AGTTGCTTAT AGATATGTTA TGTAAACTCA GGATATAAGT 


1380 
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TAGTCCACCT AGTTGTTTAG CGGGTAGTGC GTCGCGATAC GATTTAGCTT GGGCAAAAAA 
TGATAACAAT TTTTCAGAAT CATTGCTTTC AATTAGTCTt TCTAAATCTT GAAACTGACT 
TTTTAGCrGT CGAATCATTT CTAAAATATA CGTTTTATTA CTCAAGGTGA TATCTTTCCA 
CATTTGTGCA TTACTACTAG CTATACGAGT GATATCACGA AAACCACCAG CTGCAAGTTT 
ATTAACTAAA TGATGTTCTT GACCGTTCTT TTGACTAACA TGAACTAAAC TAGATGCAAC 
GATATGAGGT AAATGACTTA CGACGCTTGT TACGTAGTCG TGTTCTTCAG CAGTAGTTAC 
AATAAATTTA GCAAGAGTAG GTGATAACAG TTCTTTTAAC GTGTTTG 
(2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 
AACAAAAGGC ATAAGTTCGT GAATTAATGC GTATACAAGG ATAAAGCTTA TAACAGTAGT 
AATTGTTGCT ATCAAACGAA CAACATATAT TCTATTTTCA GATAGCAATC GATTCACTAA 
TCGATAATTT ACGTATACAA GAATTAACAA CAGCACAATA TAAACAATAA TCATATTTGG 
CCCCATTTAA TTTTTAAATT TGTTTTCACA TCATTTTCTA CTTTATTATC ATACTAATTT 
TAAAGGCAAA GGTGGACATC GGCAACCTCT CGTAAACTAT TTATCAAAAA TAAACGTATC 
TCATTGTTAT GATATTTATA AATCAATTCG TTTTTATTAA AGTCTTTTTC AACAAGTTTG 
TCACTATCTA TTAAATAATC ACGCATGCAT CCTTTTAAGA AATCATCTTT ATAACTTGGT 
GTGTACCATT TTCCATCCTC TTCAATGACA ATGTTGCCAA TATCAAATTC AAGGACCTTG 
CCGTCCTCTG AAGT 

(2) INFORMATION FOR SEQ ID NO: 36 9: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 9: 
GTCCATATTT CCCGTCCAAC CAACTAAAAA TGGGTAAATC CCTGGATTTa AATCTACTTC 
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10 



IS 



20 



2S 



30 



35 



40 



45 



SO 



GATAATTTCn AATCTAGCGA CCATCATTAC GAGATGATIT GATATAAGCA CAGTTTGGAT 
GTTGACCAAT ACTATCGCCT TCTTCTTCGA TCATATCTAT TTTAATACCA TCATCAGCTG 
CAATTTCTAA TGAAGATTTA ATTCGGTTAT CAAATGTTGA ATATCCCATT GCTCCACCCA 
CAATAGCGAC ATCTGTACCA TGTCCTTGGT GTGTTTGAGC AAATGATTCA TAATAATGTA 
TTTCAATATT TTTAGGAATA TCTCCCAATA TTGCGCGTGC TGAATTCCCA ATCTTTACTG 
CACCAGCCGT ATGAGAACTT GAAGGGCCCA TCATAACTGG TCCGATAATA TCGAAAGCAC 
TTTGATAATC ATAGCTCTTT GCCATAATTA AACACTCTCC TTAATATGAT TCTTTTrGCT 
CGGCAriTTA AAGTrGATAT TCATTAAATT AAACTTATTA ATTAGTGTTC CAATAATATA 
GGCTAAGACA ATGCTGACAA TAATCACTGT TGCGATTGTT ACAATTGATG TCACTGCATT 
ATTAAAGCCA AACAATACGA TGGCGCCTGC AATTGGTGTT GCCATACCTT TGACACCTAT 
TACTAGTCCG CTAAATGTCA CGATACATGC GTTGACGACC CCAATCAGTG CAriTGTACC 
ATATAGTTGT ACTGGATATT GCGCTATTAA ATCAATTTGC GTCAATGGCT CAATACAAAC 
TGCAAATGCT TTTGACGGTC CACCAATGTT TAATTTTCGG AATAAAATAA GGTTAACAAA 
TGAGCTACCT GTACATGTTA GTGCTCCAAT AGCCATAGGA ACACCTGTCA GTCCTAATAA 
ACTTGTTAAT ACCATTGAAC TTAGCGGTGT CATACCTCTA ACAGGAATCA CTAGTCCTAA 
AATGACCGCT AATGCATATG GATTGTTATC ACCTACCGCA GTGACAGCAC TACCTArTTC 
TTTTAATGTT GCTAGCACAC CAGGTGTAAT GATTGATCCA AGTCCGAAAG CAATTCCTGG 
TGCAAATAAG ATCACCACAA TTAAGTCCAA GCCTTCTGGA ACTTTCTTTT CAATCCATTT 
AATTAAAAAA GcTACGCCAT AAGCTGCGAT GAATGCTCGT AATAATTTAA AGTCATGTAA 
TACTAAACCA ACAATGACCG CAAATACTGG TGCAACGCCT AAGTTTAAGC ACGTTAGAAT 
ACCTACTGCG ATACCGCTTA AACTTCCTGC TAAATCCCCA ATATCTTOTA GAAATTTAAT 
ATCAAATACG CCACCAATAG CATAACTTAA GAATGCTTGT GGTAGAAATG TCGCACAAGC 
TGCACCGGAT AATGCTTGTA GTCCTTGTTT ACCGTACGGT GCATACTTTA AAAATAGCGT 
CATGATCACT AAAACCAAGA CTAATGTGCC TACACCTAAC AGAATATCGA TTTCCCAAAA 
ACCTCTCTCT ATGTrTATrr TATTTTCAGA CCATAAACAT CGTACACCCT TAAGAAAACG 
TTTTCAACTT TTATCTGTTA TCAAATCaAA TATTTAAGTC AAATATTTCA TATTrGTCAA 
AGATTTTTAA AATGGATTGT TTCAAAAACT ACTTATGTTG TCGTTAATAT TTACTAATTA 
ACTTTTTACT CTATATTTCA AACAGTTGTG TGACAGTTTT TTGATAACTT TTTTACATCT 
GAAAGTAAGT AATATTTCTA AAAACTTTTA ATATTTATAC ACTTTATCTT TCGAGCTArT 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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35 



40 



45 



ACACATATAT 


TTGCAATAAG 


ATAATTAAAG 


TAGGATATTA 


TTTTTAGTTT 


TCTGATAGGA 


1980 


ATGATGATAG 


TCTATAGGTT 


GAATCTTTAC 


rrrrrriTAA 


AGCTAAATTT 


ACATCAACTT 


2040 


AACAATGGTT 


GGTTATACTG 


AAGATGAAAA 


TATTTAGTAT 


AACTTAGTGG 


AGGCGATAAA 


2100 


GGTGCAATTG 


AGTCATTCCG 


TTAAAGTTGC 


AATTTCTATC 


TATTTAGCAC 


TTATCTTTAT 


2160 


AACGTTCACT 


TCTTATTTAG 


TCATTATTTT 


ATATACGAGT 


ATGACTGGAC 


ATGATGTATC 


2220 


ACATTTCGTG 


TTAGATAGTC 


AGCATTCTCA 


TCATGGATCT 


CTTACGCAAA 


AACATTTGAG 


2280 


TCTTCCTGAA 


ATCTCATTTA 


AATAGTTTAT 


CCTCTGTGTT 


TCAACATTCA 


TTTCCCATAT 


2340 


CGATTCATTT 


ATCTATCATC 


TAGACCACTA 


CATCTTAGAT 


GATTTTrrrA 


TTTTCTCATT 


2400 


TCACTCTTTC 


TTAAAAGTCG 


ATATAATGAA 


TTAAATCATT 


ATCATACACC 


GACATATTTT 


2460 


ATGTTGTTGG 


TGTTAAGTTT 


aAAGGGGTGA 


GATACTTGGC 


GAATaATCaT 


TCAGCTTT 


2518 


(2) INFORMATION FOR SEQ ID NO: 370: 








(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 790 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS; double 
<D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 






ATACTACTGG 


ACCGTCnACC 


AATTGACAAA 


ACCTGAAAAC 


TGGATTTTTA 


TTGCAGGATT 


60 


ACATCGAACC 


TGGAAGAACA 


ATCCCCAGGA 


GTTGATGATA 


TTCGAGATAT 


GGCATACAAT 


120 


CAAGGTAGTT 


TAGATAAGAC 


AATTTATGAA 


ATTTCTAAAC 


GCACAGTACT 


ATTTTTAATA 


180 


CAGAAAGATA 


TTACGGTATA 


TAATAAGACG 


ATTGACTGTT 


TAAATTATTA 


TAACTATAGT 


240 


GACGAAAGAA 


TAAAGGATGA 


TTAAATGAAT 


TCACAAGAAT 


TATTAGCAAT 


TGCTGTGGAT 


300 


GCAATTGACA 


ATACCCCAGG 


CGAAGATACG 


ATTTCTTTAG 


AAATGAAAGG 


TATCAGCGAT 


360 


ATGACAGATT 


ATTTTGTTGT 


AACGCACGGA 


AATAATGAAC 


GACAAGTTCA 


AGCGATTGCT 


420 


AGAGCGGTGA 


AAGAAGTAGC 


CAATGAACAA 


AATATAGAAG 


TAAAACGTAT 


GGAAGGATAC 


460 


AATGAAGCGC 


GTTGGATATT 


AATTGACTTA 


GCTGATGTTG 


TGGTACATGT 


TTTCCATAAA 


54 0 


GACGAAAGAA 


ATTATTATAA 


TATTGAAAAG 


TTATATCAAG 


ATGCACCATT 


AGAATCATAT 


600 


AGTCAGGTTG 


CGTATTAATT 


ATGTCGCAAT 


ATGCAQAAAT 


GAGCCTAGTG 


TACGATCAAT 


660 


TGACTCAAGA 


TCAACCATAT 


GAAAAATGGT 


TTGAAATTGT 


AAAAAATCAC 


TGCAAAGATG 


720 


AATCAAATAT 


TTTAGATATT 


GGATGCGGTA 


CTGGTAGTTa 


ACAGTTCAAT 


TAGAAGCTTT 


780 
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(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 
ATAGATGAAG GTGCAAATAT TGAAaTAGGT TATTTACCTG GACGCTTGAA ATGGTTAGTT 
GCTGATTTAT TAACTAAACA AGGATTAAAA GTAGTTAACG ACGATATGAC AGGAAGAACG 
TTAAAAGATC GTAAATTATT AACAGGTGAC AGTCCTTTAG CTTCAAATCA GITAGGAAAA 
TTAGCAGTTA ATGAAATGTT AAATGCAATA CAAAATAAAT AATTAAATAT TAATTAGAGG 
20 AGCCTCATAT GTAAATGTAT GAGGGCTCTT TTTTTTGGCA AAATTTAAGT GATACTTCTA 

AAATAGAACC TATTATGAGT ATGATTTAAG AAAACGCTTG CAAAACTAAT AACCGCAACT 
AGCGATATGG AGGAAACATG ATGTCTTATA GCATTGGAAT TGATTATGGA ACTGCTTCAG 
25 GCCGTGTGTT TTTAATTAAT ACAACTAACG GTCAAGTAGT ATCAAAATTT GTGAAACCAT 

ATACACATGG TGTCATTGAG AGTGAATTAA ATGGTTTGAA AATACCACAT ACATATGCAC 
TTCAAAATAG TAATGATTAT tTAGAAATTA TGGAAGAAGG AATATCATAT ATAGTACGTG 

30 

AATCAAAAAT AGATCCAGAC AATATAGTAG GTATTGGTAT AGACTTTACT TCATCTACTA 
TTATTTTTAC TGACGAAAAC CTTAACCCGG TACATAACTT AMACAATTT AAAAACAATC 
CACATGCGTA TGTGAAACTT TGGAAACATC ATGGTGCATA TAAAGAGGCA GAGAAATTAT 
ATCAAACTGC TATTGAAAAT AATAATAAGT GGTTAGGCCA TTATCGATAT AATGTTAGTA 
GTGAATGGAT GATTCCCAAA ATAATGGAGG TCATGAATCG AGCACCAGAA ATTATGGAAA 
AAACGGCTTA TATTATGGAA GCGGGCGATT GGATTGTAAA TAAATTAACT AATAAAAATG 
TACGCTCGAA TrGTCGATTA GGTTTCAAAG CATTTTCGGA AGAAGAAACA GGGTTTCATT 
ATGATTTATT TGATAAAATA GACCCCAAAT TATCAAAAGT AATTCAAGAT AAAGTATCTC 

4s CACCGGTTGT TAATATTGGT GAAGCAGTAG GGAAACTGGA TCATAAAATG GCACAGAAAT 

TAGGATTATC AAAAGAAACT ATGGTAAGTC CTTTTATTAT TGATGCCCAT GCTAGTTTAT 
TAGGTATTGG GTCTGAAAAA GATAAAGAAA TGACTATGGT GATGGGAACA AGCACATCCC 

SO ATCTTATGTT AAATGAAAAG CAACATCAAG TGCCAGGTAT ATCAGGTTCT GTAAAAGGAG 

CAATTATTCC AGAATTATTT GCTTATGAAG CGGGGCAATC AGCAGTAGGT GAnTCTTTG 
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CTGTATTTGA ATTAATGAAT GAAAAGATAA AACATCAAAT GCCAOGTGAA AGTGGGCTCA 1500 

TTGCTCTTGA TTGGCATAAT GGAAATCGAA GTGTATTAAG TGATAGCAAT TTAACAGGTT X560 

GTATCTTTGG ATTAACTTTA CAAACTAAGC ATGAGGaTAT TTATAGAGOn TATTTaGaAG 1620 

CTACaGCATT TGGTACtAAG ATGATTATGC mACAGTATCA AGATTGGCAT ATGGaAGTAG 1680 

aAAAGGtATT TGCaTGTGGc gCTAtACCtA AAAAGAATGC TGTTATGATG GATATCTATG 174 0 

CGAATGTACT GAATAAAAAA CTAATTGTTA TGGATAGTGA GTATGCACCA GCAATAGGCG 180 0 

CAGCAATATT AGGTGCAGTC AGT 1823 
(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 

25 ACGATCATCT GCATCAGCGA ATTCCGATGc AaTTcATATG tCCtAATAAC AAAAGAATAG 60 

GTTTTaAAAG ATATGCATAC CGTAATGaTA GATATAGTTT TAAACGTGAC TTCAAGCTAT 12 0 

ATGaATGTGA TGACTGTTCA TCATGTTCTT TGAGACATCA ATGCATGAAG CCAAATTCGA 180 

AATCCAATAA GAAAATTATG AAGAATTATA ATTGGGAATA CTTTAAAGCC CAAATTAATC 24 0 

AAAAGCTTTC TGAACCAGAA ACGAAAAAAT CTATAGTCAA AGAAAAATTG ATGTAGAGCC 300 

TGTTTTTGGA TTTATGAAGG CTATTTTGGG TtTCACTCGA ATGTCAGTTC GGGAATAAAT 36 0 

35 

fiPAGTIMAC GAGAGCTAGG TTTTGTATTA ATGGCACTTA ATATAAGGAA AATAgcaGCT 420 

CAACGAGCTG TACATTATAA AATACATATC AAAAAAGCTG ATTTCTATCA AATAAATAAT 480 

AGAAATCAGC TTTTTTACAT TGCCTAAGAA CTTTAAGGAA CTTAATGTCC CAAGCTCTTT 54 0 

40 

TTTGTTATAT CTAATTCGTA ATTTATGATT GTTTATTCGG TCCTTTGATG TTCACTAAAT 600 

GTGACTTTAA ATCTTGTTCT AATTGTTGTA ATTCTTTTTC AGCTAATTGT CGTrCTTCGC 66 0 

GACCGTGTTG TTGAATAATT AATGTTTCTT CAATTGTCTC AATAATGTTA CGCTGTGTAC 72 0 

GTTTCAATGT ATCAAGATCA ACAATGCCAC GCTCATTTTC TGTTGCAGTT TCAATCGCAT 780 

rrrGTTTCAA CA'ITTCAGCA TTTGCTGTTA ATAAATCATT AGTTGTATCA GTGACAGCTC 84 0 

so GTTGTGCAGC AACTGCATTA CGCTGTCTCA TTAATGTAAG CGCAATGGCC ATTTGATTTT 900 

TCCATAGTGG AATACTTGTC AAAATTGAAC TTTGTATCTT CTCGGCAAGT GCTTGATTAA 960 
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AGTCATATAT GCGTTTATCT AGTCTATCTA TAAATTCCTG CATATCTGCA ACTTGTTGAA 
TATCCATTTG ATTAGTGGAt TgtGCGCTTG CTGTTGCAAT TGTGGTAGCT rrrCATTrrc 
TAATTGCAAC rTTTTTTGcT GTGCAGCAAT GATATGCAAT GATAAGTCAT CAAAGTATTG 
TTTGTTTTTA TCATATAGCG TATCTAATAA TTCAATATCT CTTGTTAAAT GTCTTTCATG 
TTTCTGCAGT TGAATCGTTA TGCGATCGAC TTGAGCACTA ACTGATTGCA TTCTTGAAAA 
GATTTCATTG ATAGACGACT TTGCTCTGCT AAAAATTCTT ITTAACATAG ATGGTrTATC 
AGTATTTAAC TCATTTGGAT TAACTGACTT TAGTTTTGAC ATTAGATCTG ACAAAGTATC 



1560 
1600 



1080 
1140 
1200 
1260 
1320 
1380 
1440 

TCCAATAGGA CCAACATCTT TACTTTGTAC TTCATCCAAC ATTTGATGTG AAAATTGAGA 1500 
CAITTGTTTC TGkAAATCAG AACCAAACGC TAATAAACCT TCATTGTCTA AAGGGTTAAT 
TTGTTTACTG ATTGTGTCTA CCTGTTTTTG TTGTTCAATT 
20 <2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

30 

ATGTTGATAA TGGTAGCTTC TACAAGAATA AAGACCAACA AGTTGGTGCA ACAATTCTTG 
aTAGTAAAAC TGGTGGTTTA GTTGCTATAT CTGGTGGACX3 TGATTTCAAA GACGTCGTTA ' 
ACAGAAACCA AGCAACAGAT CCTCACCCTA CTGGTTCATC TTTAAAACCT TTCTTAGCGT 

3S 

ATGGACCTGC CAITGAAAAT ATGAAATGGG CAACAAACCA TGCGATTCAA GATGAATCTT 
CATATCAAGT TGATGGTTCT ACATTTAGAA ACTATCATAC GAAGAGTCAC GGTACTGTAT 
CTATTTATGA TGCTTTACGA CAAAGTTTCA ATATCCCAGC TTTAAAAGCT TGGCAATCAG 
TTAAGCAAAA TGCTGGTAAT GATGCACCTA AGAAATTCGC TGCCAAACTT GGCTTAAACT 
ACGAAGGCGA TATTGGTCCA TCTGAAGTAC TTGGTGGTTC TGCTTCAGAA TTCTCACCAA 
45 CACAATTAGC ATCAGCATTT GCTGCAATCG CTAACGGTGG TACTTATAAC AACGCGCATT 

CAATTCAAAA AGTAGTTACT CGTGATGGTG AAACAATCGA ATACGATCAT ACTAGCCATA 
AAGCGATGAG TGATTACACT GCATACATGT TAGCTGAGAT GCTAAAAGGT ACATTTAAAC 
CATATGGTTC TGCATATGGC CATGGTGTAT CTGGAGTAAA TATGGGTGCT AAGACAGGTA 
CTGGTACTTA CGGTGCTGAA ACTTATTCAC AATATAATTT ACCTGATAAT GCAGCGAAAG 
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AAGTTAAACA ATATGGTGAA AACTCATTTG TrGGACATAG CCAACAAGAA TATCCACAGT 900 

TCTTATATGA AAATGTGATG TCAAAAATTT CATCTAGAGA TGGCGAAGAC TTTAAACGTC 960 

CTAGCTCAGT AAGTGGTAGT ATCCCATCAA TCAATGTTTC TGGTAGTCAA GATAACAACA 1020 

CTACAAATCG TAGTACACAC GGTGGTAGTG ACACATCAGC AAACAGCAGT GGTACTGCAC 1080 

AATCAAATAA CAATACTAGA TCTCAACAAT CTAGAAACAG CGGTGGATTA ACAGGTATAT 114 0 

TCAACTAATC CACTCAACAT AAAATCCTCA GTTATACCAT ATTTATGGTG TAGCCGAGGA 1200 

TTTTnTTAGG TTCTTCATCT TTTATGG 1227 
(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 



2S 


CCATATGGtG 


CAACATTCTT 


CGTATTTAGT 


GATTATTTAA 


AACCAGCGTT 


ACGTTTATCA 


60 




TCAATTATGG 


GATTAAATgC 


aACGTTCATC 


TTCACACATG AyTCaATTGC 


AGTAGGTGAA 


120 




GATGGTCCTA 


CTCATGAACC 


AATTGAGCAA 


TTAGCTGGAT 


TAAGAGCCAT 


TCCAAATATG 


180 


30 


AATGTTATCC 


GTCCTGCTGA 


TGGTAATGAA 


ACAAGAGTAG 


CATGGGAAGT 


TGCCTTAGAA 


240 




TCTGAATCTA 


CACCTACTTC 


ATTAGTATTG 


ACACGTCAAA 


ACTTACCGGT 


ATTAGATGTA 


300 




CCAGAAGATG 


TAGTTGAAGA 


AGGCGTTCGA 


AAAGGTGCCT 


ATACAGTTTA 


TGGCTCTGAA 


360 


35 


GAGACACCAG 


AATTCCTATT 


ATTAGCTTCA 


GGTTCAGAAG 


TTAGTCTTGC 


AGTTGAAGCT 


420 




GCTAAAGATC 


TTGAAAAACA 


AGGTAAATCA 


GTACGTGTTG 


TTTCAATGCC 


TAACTGGAAT 


480 


40 


GCATTTGAAC 


AACAATCTGA 


AGAATATAAA 


GAATCAGTTA TTCCATCAAG 


CGTAACAAAA 


540 


CGTGTTGCGA 


TTGAAATGGC 


TTCACCGCTT 


GGATGGCATA 


AATATGTAGG 


TACTGCAGGT 


600 




AAAGTTATTG 


CTATTGACGG 


CTTTGGCGCA 


AGTGCACCTG 


GCGATTTAGT 


AGTTGAAAAA 


660 


45 


TATGGATTTA 


CAAAAGAAAA 


TATCTTAAAC 


CAAGTTATGA 


GCTTATAAGA 


ATAATTTATA 


720 




AAGCGAGTAT 


GTTTAGAAGT 


CTAGGATGCA 


TAATCTTAGG 


CTTCTTTTAA 


AGTGTTGAAA 


780 




TTTAGAGTAT 


AGCACTTAAA 


CTACATCATA 


AGTGATAAGT 


TATGAAAGTA 


TACTATTTCA 


840 


SO 


GATTAATCTT 


TAAAAGCTCT 


GTTATAACAG 


CATGATTTTT 


GATATTATTT 


TTAGTATCGA 


900 




TATTAAAATA 


CTTGAATAAA 


CTAGTTCTTG 


AAATAATGTG 


ATGAATTTAG 


TAAAATTCAG 


960 
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TGCATTAATT TTAGGTTTAA TTGGAGGTTT CCTTTTAGCT AGAAAATATA TGATGGACTA 
CrrOAAGAAA AACCCACCAA TCAACGAAGA AATGCTTCGT ATGATGATGA TGCAAATCGG 

TCAAAAACCT TCTCAGAAGA AAATTAATCA AAaxy.TX5ACG ATGATCAATA AAAATAT^ 
TCAAAATATG AAGAGTGCGA AAAAGTAAAT TCGCAATTGA TAGAGGCTAT TTTCCAGATA 
TGGAAATGGC CTCTrTTTAT AATCAAATTA ATAAGAATAA ATATGTTTAT TAAAATTAAA 
GTTAACAAAA TGACGAATAG ACTGAGAAAT GCTATAATTC ATrOGTATG ATTTACAGAG 
AGTTTATTTA ACGAGAAGGT GTCyGCGTGC TCTATTTAAT ATTrTCAATC ATTCTAGCTT 
-TATTTATGGG AACTATAGTT ATAGTTATTC GTATGAAAGC TCAAAATTAT CCGGTAAAkG 
AGAAAAAAAT AGTTTTGCCa CCgTTTrTTA tGGCgACCGG TCCATTGATG TACGTCGTTC 
CaTATTTTAG GCTAACAGGA TCGGAAATGC TAGAAGCCTT TATAATTGGT TTGCTTTTTT 
CtACAGTTCT AATTTGGACT TCTCGATTTG AAGTCAAAGG TACAGAAATT TATATGAAAC 
GATCTAAAGC ATrrcCAGTT ATTTTGA-ITT CATTACTTAT CATTCGTACT GTGATGAAAA 
TATTCATTAG TAATGAAATA GATCCTGGAG AATTAGGCGG CATGTTCITT TTATTAGCAT 
TCTGTATGAT TGTTCCTTGG AGAGCAGCAA TGCTATATAA ATACAAAAAA CTAAAGAAAA 
CATTAATCAA TTAATrACTT TTaAAACCAC TTGTGATCGA CTTCTAAATC AGTCAATGAG 
TGGGTTTAAT nTTACTTGGA AAAGGnGGAA AGG 

30 

(2) INFORMATION FOR SEQ ID NO: 3 75: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3787 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : do\ible 

(D) TOPOLOGY: linear 

<^i> SEQUENCE DESCRIPTION: SEQ ID NO: 375: 
ACATTTGATC AAnTTATCGA CATTAAAGAT GAATTCAnTT GATCGTnTCA ATXSATTATCC 
TGTTGAAGTA GCACGTTTGC TTGATATAGT GGAmATaAAA GTACACGCAT TACATTCAGG 
,5 TATCCACGTT GATTAAAGAT AAAGGGAAAA TAATTGATAT TCArTTATCT GTAAAAGCCA 

CTGAAAATAT TGATGGCGAA GTGCTGTTCA AAGCAACACA ACCTTTAGGT AGAACAATGA 
AGGTTGGTGT TCAAAATAAT GCAATGrCAA TTACTTTAAC GAAACAAAAT CAATGGCTTG 
ATAGTTTGAA GTTTTTAGTT AAGTGCATTG AAGAAAGTAT GAGAATCAGT GATGAAGCAT 
AAAGAAGCAT TTAATGGCGT TGTCGTGTTA ACTGCTCCAT TAATTGTCAT TAAAATTCTG 



SO 



so 

120 
180 
240 
300 
360 
420 
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40 



4S 



SO 



CAACAAGTGT 


ATCCAATTGT 


AGCATTAGGA 


ATGATATTAT 


CGATGAATGC 


CATTCCTAGT 


540 


GCAATTACAC 


AAAATATAGG 


GAAGTATCAT 


AGTGACGAAG 


CATATGCAAA 


AGCAGTCGCT 


600 


TATATACAAT 


TAGTTGGTAT 


ATTATTATTT 


ATTGCTATTT 


TTGTGTTTGC 


GAACAATATT 


660 


GCACATATGA 


TGGGTGATGG 


CCATTTAACA 


CCAATGATTC 


AAGCTGCAAG 


TTTAAGCTTT 


720 


ATATTTATAG 


GTATGCTTGG 


CGTGTTAAGA 


GGTTATTATC 


AATCTGCAAA 


TAATATGACA 


780 


GTTCCGGCTA 


TTTCCCAGGT 


TATAGAACAA 


GrTATACGAG 


TAGGTATTAT 


CATTGTTACT 


840 


ATTGTTATTT 


TTGTAGACAG 


AGGTTGGACG 


ATATATGAAG 


CGGGAACAAT 


TG CT ATTTTA 


900 


GCATCAACGA 


TAGGTTTTTT 


AGGTTCTTCA 


ATTTATTTAG 


TAGCGCACCG 


ACCrriTAAG 


960 


TTTAAAATGG 


TAAATAACAC 


TGCAAAGATC 


GTTTGGAAAC 


AGTTCGCACT 


TTCGGTTTTG 


1020 


ATTTTCGCTA 


TCAGTCAATT 


AATCGTAATT 


TTATGGCAAa 


TGATTGATAG 


TGTTACTATT 


1080 


ATTAAGTCAC 


TTCAAGCGAT 


ACGCGTGCCA 


TTCGATGTTG 


CCATAACTGA 


AAAAGGAGTC 


1140 


TATGACCGTG 


GTGCATCATT 


TATTCAGATG 


GGATTGATTG 


TAACTACAAC 


ATTTAGTTTT 


1200 


GCGCTCATTC 


CTCTGTTAAG 


TGACGCAATC 


AAAATGAATA 


ATCAGGTACT 


TATGAATCGT 


1260 


TATGPUU^TG 


CGTCATTAAA 


GATTACGATT 


TTAATAAGTA 


• 

CAGCAGCGGG 


AATAGGATTA 


1320 


ATTAATTTAT 


TGCCTTTAAT 


GAACGGTGTG 


TTTTTTAAGA 


CGAATGATTT 


AACCTTAACG 


1380 


TTAAGTGTTT 


ATATGATTAC 


GGTCATTTGT 


GTATCGTTAA 


TTATGATGGA 


TATGGCATTA 


1440 


TTACAAGCGC 


AACATGCTGT 


GAGACCTATT 


TTTGTTGGTA 


TGACGGCAGG 


ATTGGTTATT 


1500 


AAATTTATAC 


TTAATATCAT 


TTTGATTCGT 


TTAAGTGGCA 


TTATTGGTGC 


GAGCATTAGT 


1S60 


ACTGTTGTAT 


CATTAATTAT 


ATTCGGTACG 


ATTATCCATA 


TTGCTGTCAC 


GAGAAAATAC 


1620 


CACTTATATG 


CGATGAGACG 


ATriTri'ATC 


AATGTTGTTT 


TAGGTATGGT 


ATTTATGTCG 


1680 


ATTGTTGTTC 


AATGCGTGTT 


AAACATAGTG 


ACAACACACG 


GTAGAATCAC 


TGGACTCATT 


1740 


GAATTATTAT 


GTGCAGCAGT 


ATTAGGTATC 


ATTGCATTGT 


TTTTCTATAT 


TTTTAGATTT 


1800 


AATGTTTTGA 


CATATAAAGA 


GTTAACTTAT 


TTACCATTTG 


GTTCAAAGTT 


GTATCAAATT 


1860 


AAGAAAGGAA 


GACGTTGATG 


GCACATACCA 


TTACGATTGT 


TGGCTTAGGA 


AACTATGGCA 


1920 


TTGATGATTT 


GCCGCTAGGG 


ATATATAAAT 


TTTTAAAGAC 


ACAAGATAAA 


GTTTATGCAA 


1980 


GAACGTTAGA 


TCATCCAGTT 


ATAGAATCAT 


TGCAAGATGA 


ATTAACATTT 


CAGAGTTTTG 


2040 


ACCATGTTTA 


ItSAAGCACAT 


AACCAATTTG 


AAGATGTCTA 


TATTGATATT 


GTGGCGCAAT 


2100 


TGGTTGAAGC 


TGCTAATGAA 


AAAGATATTG 


TCTATGCGGT 


TCCGGGTCAT 


CCTAGAGTTG 


2160 


CTGAGACAAC 


TACAGTGAAA 


TTACTGGCTT 


TAGCAAAGGA 


CAATACTGAT 


ATAGATGTGA 


2220 
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50 



ATC3ATC3GCTT CACACTGTTA GATGCGACAT CATTACAAGA 
CGCATACATT GATTACGCAA GTTTATAGTG CAATGGITGC 
TAATGGAACG ATATCCTGAT GATTACCCTG TTCAAATTGT 
GTGCGGATAA CGTTGTGACA TGCCCATTAT ATGAATTGGA 
ATAATTTGAC GAGTGTArrC GTACCAAAAA TCATAACATC 
TTGATTTTGC AACGGAAGTG ATTGATACTT TAGTTGATGA 
ATAAAGTGCA AACGCaTGmA AcgCTAAAGC GTTATTTACT 
TCGAAGCTAT TGACAATGAA GATGATTGGC ATATGATTGA 
TACAAGTGTT ATTGCATACT AGTATTGGTA AAAAAGAAGG 
TGATTACAAG TCTTAATGCT AAAATGATTC GTAGACACCC 
ATGCTGAAAC TATCGATGAC TTAAAAGAAA TTTGGTCTAA 
AACAGCCAAG AGTTAAATTT GAAAAAGTAT TTGCAGAGCA 
AGACGAAGGA TAAGTCATTT GATGAGGCCG CGI-TAAAGCA 
GTAATACATG AGATTAGATA AATATTTAAA AGTATCACGG 
AGCAAAAGAA GTAAGTGATC AAGGTAGAAT TACAATAAAT 
ATCGGATGTT AAAGTTGAAG ATGTGCTGAC GATrCGCTTT 
TAAAGTAACT GCATTAAATG AACATGCATC TAAAGATAAC 
CATTGAAGAG CGTCGACTTG AAGAAGCGTA AATTGGAGGT 
GTAGAACATA TAGAAAATCA GTACACGTCG CAAGAGAACA 
ATGAAAATGC GTGTTGTTCg TAGGCGTATT ACAGTATTO5 
ATTGTTGTTT TATCaATCTT GCTTGTTGTC CAAAAACATC 
GAGCGAAAAG CGAAAGAAGC ACAGTTTCAA AAGCAACAAA 
GAAAAGTTGA ATAATCTGAA TGACAAAGAT TACATTGAAA 
TACTTAAGCA ACAAAGGTGA AGTGATTTTT AGGTTGCCAG 
TCAAAATCTT CGAAAAAATA AATCCAAATT GATTCAAAAT 
GAAAAAA 

(2) INFORMATION FOR SBQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



AGTAACACTT 
TGCTAATTTG 
CACTGGTGCA 
TCATGATGAA 
GACATATTTG 
AGATAAAGGT 
TGAAGAAACA 
AGAACTAGGA 
GTATATCGAC 
ACACATATTT 
GGCGAAAGAT 
TTTTTTAAAT 
GTGGCTAGAA 
TTAATAAAGC 
GGTAATGTTG 
GGTCAAAAAT 
GCGAAGGGTA 
GACAAGCAAT 
AGAAAAAACA 
CGGGCGTATT 
GCAATGATAT 
ATGAAGAAAT 
AAATTGCGCG 
AAGACAAAGA 
TATCCGAGTA 



AATGTTAGAA 
AAAATCACTT 
CGAAGCGATG 
AATGCATTCA 
TATCATGACT 
TGTCCATGGG 
TTTGAATTGT 
GATATTTTAT 
ATTAAAGAAG 
GGTGATGCCA 
GCTGAAGGTA 
TTATATGAGA 
AAAGGGGAGA 
GACGTACGCT 
CTAAAGCTGG 
TAGTAACAGT 
TGTATGAAAT 
GAAAAATAAA 
ACGTCAAAAA 
aCTTGCGATA 
TGATGCACAG 
TGCGTTAAAA 
TGATGATTAT 
TTCGTCTAGC 
TAGACATTGT 



2340 

2400 

2460 

2S20 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3787 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

^ TAAACCATTT CAAACTGAGG AACGCnAAGA CGGsACgTTT CCAGATTTAG AAGTATTTAA 60 

AAATGAATGT GATTTAAGCT ATGACATAAC GTCACTTTAT ACTTTTAAGC AACCTGTATC 120 

ACCACACCTT GCATTTAAAA TGACAGATCA AATTTTTCTA AATAAGCAGC GTGTATTAGA 180 

10 

TAAGGTAAAA GTTTTAGATA AGGAATTTGA TTTTATCTTA ATTGAGGGTG CTGGGGGAAT 24 0 

TGCCGTACCA ATATATGAAG GTACAGATGA TTTCTACATG ACTAAAGATC TAATCAATGA 3 00 

TTGTGCAGAT TGTGTCATCA GTGTGTTGCC ATCAAAATTA GGTGCTATTA GCGATGCCAT 360 

15 

TGTTCACCAA GATTATGTTA ATCAGAATGT ATCGGCGAGT AATTTTTTAA TAATGAATCG 420 

CTATACAGAC AGCTATATTG AAAAAGACAA TCAAATGACG ATTGGAAAAT TAACAAATAA 4 80 

^0 AACAGTCTAT ACATTTGAAG AACATGCCAC GTATGAAAAT TTCTCAGAAG CATTTTTAAA 54 0 

ACAATTAATA GGAGTTAAAA ATGAATTACA CACAACAACT TAAACAAAAA GACTCAGAAT 600 

ATGTTTGGCA TCCATTTACA CAAATGGGTG TATATAGCAA AGAAGAAGCA ATCATCATTG 660 

25 AAAAAGGAAA GGGTAGTTAC CTTTACGATA CGAATGGCaA TAAATATTTA GATGGTTATG 720 

CATCGTTGTG GGTCAA7GTG CATGGTCATA ATAACAAATA CTTnAATAAG GTAATTAAAA 780 

AGCAACTCAA TAAAATTGCC CATTCTACGC TGCTAGGATC ATCAAATATT CCGTCAATAG 84 0 

AACTTGCGGA AAAATTAATC GAAATCACGC CAAGTAATCT AAGAAAAGTA TTTTATTCTG 900 

ATACAGGCAG TGCGTCTGTT GAAATCGCAA TAAAGATGGC ATATCAGTAT TGGAAAAATA 960 

TTGATAGAGA AAAATATGCC AAGAAAAACA AGTTTATAAC GCTAAATCAC GGTTATCATG 1020 

35 

GGGATACGAT TGGTGCGGTA AGTGTTGGTG GTATCAAGAC CTTTCATAAA ATATTTAAAG 1080 

ACTTAATATT TGAGAATATT CAAGTAGAAA GCCCATCTTT CTATCGCAGT AATTACGATA 114 0 

CTGAAAATGA AATGATGACA GCTATTTTAA CGAATATAGA GCAAATTCTA ATTGAAAGAA 1200 

40 

ATGATGAAAT CGCAGGGTTT ATATTGGAAC CGTTGATTCA AGGTGCGACA GGCTTGTTTG 1260 

TTCATCCTAA AGGCTTTTTG AAAGAAGTCG AGAAATTGTG CAAAAAATAC GATGTCTTAT 1320 

45 TAATTTGTGA TGAGGTAGCA GTTGGTTTTG GGAGAACTGG AAAGATGTTT GCATGCAATC 13 80 

ATGAAGATGT TCAACCGGAT ATTATGTGTT TAGGTAAGGC GATTACTGGT GGCTACTTAC 1440 

CACTTGCAGC TACATTGACA TCTAAAAAAA TATACAATGC ATTTTTAAGT GATTCGCATG 1500 

SO GTGTGAATAC CTTTTTCCAT GGTCaTACAT ACaCCGGAAA TCAAATCGTT TGTaCGGTTG 1560 

cATTaGaAAA TATaAGaCTT TATGaAAAAC GTaAGTCnAT TGTgCACATa TTGaAACGaC 1620 
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(2) INFORMATION FOR SEQ ID NO: 3 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 

(B) TYPE: nucl ic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



60 



240 
300 
360 
42a 
431 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 
ATGATTTTtA aAAATCATTA AGTTAAGGTr GATACACATC TTCTCATATG ATCAAATGGT 
TTCGCCAAAA ATCAATAATC AGACAACAAA ATGTCCGAAC TCGATATTIT ACACGACTCT 120 
CTTTACCAAT TCTGCCCCGA ATTACACTTA AAACGACTCA ACAGCTTAAC GTTGGCTTGC 180 
CACGCtnTTAC TTGACTGTAA AACTCTCACT CTTACCGAAC TTGGCCGTAA CCTGCCAACC 

20 AAAGCGAGAA CAAAACATAA CATCAAACGA ATCGACCGAT TCTTAGGTAA TCGTCACCTC 

CACAAAGAGC GACTCGCTGT ATACCGTTGG CATGCTAGCT rTATCTGrrC GGGCAATACG 
aTGCCCATTG TACTTGTTGA CTGGTCTGat ATCcGTGAGC ^AAAAACGGCT TATGGTnTTG 

2S CGAGCTTCAG T 

(2) INFORMATION FOR SEQ ID NO: 378: 

<i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 2006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 
TTTnTATAAC GTATTATAAA TCGTTAAAAA TTTTGGTTGT GTTTGCGTCA CGTAGACAAC 

^ CTCCATAAAG TTACTTAATC ACTCTCATCA TACAATAATT TTTACTCAAA TTCGAAnAAT 

TATAAAAATT AAATATAGAT AGGCTTTGAA AATTAGTITT ATACAAGGTT AGTAGCTGTA 
ACTGTAAAAT GTTCTTAATA TTGTCAAAAT GTAATGCTTG AAAGCGCTTT TAAAaAATAT 

^5 TATTATATAC ATGGTTAGAC AAATAGACAA ATCACTATAC AAATATTOGG AGGAATATTT 

TATGAAATCA ACACCACACA TTAAACCAAT GAATGACGTC GAAATTGCAG AAACGGTTCT 
ATTGCCAGGA GATCCGTTAA GAGCTAAGTT CATTGCAGAA ACTTATTTGG ATGATGTCGA 

SO KCAQrrTCM.T P.QP^CGPJ^ ACATGTTTGG TTTTACCGGA ACATATAAAG GTAAAAAAGT 

TTCTGTCATG GGTTCAGGTA TGkGTATGCC ATCTATTGGC ATTTACTCTT ATGAATTAAT 
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5 



10 



20 



2S 



. 30 



CATTGATTTA 


TATGATGTGA 


TTaTTkCACA 


AGGTGCCTCT 


ACTGATTCAA 


ATTACGTTCA 


660 


ACAATATCAA 


TTACCAGGTC 


ATTTTGCGCC 


AATTGCTTCT 


TATCAATTAT 


TAGAAAAAGC 


720 


AGTTGAAACA 


gcacx;tgaca 


AAGGTGTACG 


TCATCATGTA 


GGTAATGTGT 


TATCAAGTGA 


780 


TATTTTCTAT 


AACGCGGATA 


CAACAGCGAG 


TGAACGTTGG 


ATGCGTATGG 


GTATTTTAGG 


840 


TGTAGAAATG 


GAATCaGCTG 


CaTTATACAT 


GaATGCAaTT 


TACGCTGGTG 


TCGAAGCATT 


900 


AGGTGTGTTC 


ACAGTGAGCG 


ATCATTTAAT 


TCATGAAACG 


TCAACAAcAC 


CTGAGGAAAG 


960 


GGAACGTGCA 


TTTaCAGATA 


TGATTGAAAT 


TGCACTGTCA 


TTGGTGTAGA 


TGATTATGAA 


1020 


TGTTGAATAT 


TCTAAAATAA 


AGAAAGCAGT 


ACCTATTTTA 


TTATTCTTAT 


TTGTATTCAG 


1080 


TTTGGTTATA 


GACAACTCAT 


TTAAATTGAT 


TTCTGTAGCC 


ATTGCTGATG 


ACTTAAACAT 


1140 


ATCTGTAACG 


ACAGTAAGTT 


GGCAAGCGAC 


ATTAGCCGGT 


TTAGTAATTG 


GTATTGGCGC 


1200 


TGTAGTATAC 


GCTTCATTAT 


CTGATGCCAT 


TAGTATACGC 


ACACTATTTA 


TTTATGGCGT 


1260 


GATATTAATC 


ATTATCGGAT 


CAATTATTGG 


TTACATTTTC 


CAACATCAAT 


TCCCATTACT 


1320 


TTTAGTTGGA 


CGTATTATTC 


AAACTGCCGG 


TTTAGCTGCT 


GCAGAGACAT 


TATATGTGAT 


1380 


ATATGTTGCA 


AAGTATCTTT 


CTAAAGAGGA 


CCAGAAGACT 


TACCTTGGCT 


TAAGTACGAG 


1440 


CAGTTATTCC 


TTGTCATTAG 


TTATCGGTAC 


ATTATCAGGT 


GGATTTATTT 


CTACGTATTT 


1500 


ACACTGGACA 


AATATGTTTT 


TAATTGCATT 


AATCGTAGTA 


TTTACGTTGC 


CATTCCTATT 


1560 


TAAATTATTA 


CCAAAAGAAA 


ATAATACGAA 


TAAAGCTCAT 


TTAGATTTTG 


TTGGCTTAAT 


1620 


TCTAGTGGCA 


ACTATTGCTA 


CAACAGTCAT 


GCTGTTTATT 


ACGAACTTTA 


ATTGGTTATA 


1680 


TATGATTGGT 


GCCTTAATTG 


CX3ATTATCGT 


TTTTGCGCTA 


TATATTAAAA 


ATGCGCAACG 


1740 


TCCATTAGTA 


AATAAATCAT 


TTTTCCAAAA 


TAAACGTTAT 


GCTTCATTTT 


TATTTATAGT 


1800 


ATTTGTAATG 


TATGCTATCC 


AATTGGGTTA 


TATTTTTACG 


TTCCCATTCA 


TAATGGAGCA 


1860 


AATTTATCAT 


CTGcAACTAG 


ACACAACATC 


ACTGTTATTA GTACCGGGgT TaTATAGTAG 


1920 


CAGTCATTGT 


TGGtGgCACT 


AAGTGGgTtA 


AAATCGGgCG 


rAATATCTGG 


AATTCXAAAA 


1980 


CCAAGCGGAT 


TATCACAGCC 


AATTAA 








2006- 


(2) INFORMATION FOR SEQ ID NO: 379: 









45 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

so 



55 



1273 



EP 0 786 519 A2 



TC3GGGAAAAA AAGACCCAGC AGTATTAGAA GAATCGTTAA ATAlTrCTAT TGAAGAAATC 
AATCGTATCA TAAAATTAGT CGAAGAATTA CTTCAATTGA CTAAAGGAGA TGTAAATGAC 
ATTTCTTCTG AAGCACAGAC CGTGCATATT AATCAOXSAAA TTCGCTCGCG AATACACTCA 
TTAAAACAAT TGCATCCTGA TTATCAATTT GATACGGATC TGACATCTAA AAATCTAGAA 



60 
X20 
180 
240 

ATTAAAATGA AACCTCATCA ATTCGAACAA TTATTTTTAA TCTTTATTGA TAATGCAATC 300 

AAATATGATG TGAAGAATAA GAAAATTAAA GTTAAGACAA GGTTAAAAAA TAAGCAAAAA 360 

ATAATTGAAA TTACAGATCA TGGAATTGGT ATTCCAGAGG AAGATCAAGA TTTCATTTrT 420 

,5 GATCGCTTTT ATCgAGTGGA TAAATCTCGT TCAAGAAGTC AAGGCGGTAA TGGACTCGGA 480 

TTATCTATTG CTCAAAAAAT CATTCAATTA AACGGAGGAT CGATTAAAAT TAAAAGTCAA 540 

ATTAACAAAG GAACAACGTT TAAAATCATA TTTTAATCAT GACTGAGACG TCAATCAAAG 600 

TCATAGGATC AATTTTITAA GTACACATTA GCTGTGACTA ATGTATAAGA ACAACTATAA 660 

AACAAATAAA CAGTGGTfCT TTATCATTTC TGTTGTACTC CCAAAATTTA CAATAAAATA 720 

CATCTATAAA CCTAGAAGAA TCAACGCTTT TGTTGATTCT TCTITTTAGC AGATAAATAG 780 

GTAAATCTAC TTTAACAAAT AACTAAATAG TGATATTATT ACATTGTAAG CGTTTCAACA 840 

TTTTTGTGGA GGGTGTAAAA TGACTAACGA AAGAAAAGAA GTtTCAGAGG CTCCTGTAAA 900 

^ CTTCGGTGCG AATTTAGGTC TAATGTTAGA TCTATATGAT GACTTTTTAC AAGATCCATC 960 

ATCTGTACCA GAAGATTTAC AAGTCTTATT CAGCACAATT AAGAATGATG ACTCAATTGT 1020 

ACCAGCTTTA AAAAGTACAA GTAGTCAAAA TAGCGACGGC ACAATTAAGC GTGTCATGCG 1080 

3S TTTAATTGAT AATATTCGCC AATACGGGCA TCTTAAAGCC GATATTTATC CTGTAAATCC 1140 

TCCAAAAAGG AAACATGTAC CTAAATTAGA GATTGAA6AC TTTCATTTAG ATCAACAGAC 1200 

TTTGGAAGGT ATATCAGCAG GAATTGTTTC AGATCACTTT GCCGACATTT ATCATAATGC 1260 

TTATGAAGCA ATTTTAAGAA TGGAAAAACG TTACAAAGGA CCAATTGCAT TTCAGTATAC 1320 

ACATATTAAT AACAATACCG AACGTGGTTG GTTAAAAAGA AGAATTGAAA CGCCATATAA 1380 

AGTAACGTTA AATAATAACG AAAAAAGGGC ACTATTCAAA CAATTAGCGT ATCTTGAAGG 1440 

GTTTGAAAAA TATCTTCATA AAAACTTCGT TGGTGCAAAG CGTTTTTCAA TTGAAGGGGT 1500 
AGACGCACTT GTACCGATGT TACAACGTAC TATTACGATT GCTGCGAAAG AAGGTATTAA 
AAATATACAA ATAGGCATGG CTCACCGTGG ACGTTTAAAC GTITTAACGC ATCTCTTAGA 
AAAACCGTAC GAAATGATGA TTTCAGAATT TATGCATACA GATCCAATCA AATTCTTACC 
TGAAGATGGT AGCTTGCAGT TAACTGCTGG ATGGACTGGT GATCTGAAAT ATCACCTTGG 



1560 
1620 
1680 
1740 



55 



BNSOOCID: <eP_07e6519A2.l_. 



1274 



10 



IS 



2S 



EP0 786 519 A2 

AAGTCACTTG GAAATTGTTG CACCTGTTGT TGAGGGGCGT ACGAGAGCAG CACAAGATGA I860 

TACACAACGA GCTGGGGCTC CGACGACTGA TCATCATAAA GCAATGCCAA TTATTATACA 1920 

TGGCGATGCT GCTTATCCTG GTCAAGGAAT TAACTTCGAA ACAATGAACT TAGGAAACTT 1980 

GAAAGGCTAT TCTACGGGTG GTTCATTGCA TATTATTACT AACAATAGAA TTGGATTTAC 2040 

TACAGAACCA ATTGATGGAC GTTCAACAAC TTATTCTACA GATGTGGCCA AAGGTTATGA 2100 

TGTGCCAATA TTCCATGTCA ATGCAGATGA CGTTGAAGCT ACTATTGAAG CAATTGATAT 2160 

TGCAATGGAA TTTAGAAAAG AGTTTCATAA AGACGTCGTT ATTGATTTAG TAGGTTATCG 2220 

TCGTTTCGGA CATAACGAAA TGGATGAACC ATCAATTACT AATCCaGTTC CTTATCAGAA 2280 

TATTCGCAAA CATGACTCTG TTGAATATGT GTTTGGTAAA AAGCTTGTTA ATGAAGGTGT 2340 

CATTTCAGAA GATGAAATGC ATTCATTTAT AGAACAAGTC CAAAAGGAAC TAAGACAAGC 24 00 

20 TGATGATAAA ATTAATAAAG CTGATAAAAT GaATAATCCA GATATGGAAA AGCCTGCAGA 2460 

TCTTGCATTA CCGTTACAAG CAGACGAACA ATCATTTACT TTTGATCACT TGAAAGAAAT 2520 

AAATGATGCA TTGTTAACAT ATCCGGATGG CTTTAACATT TTGAAAAAGT TAAACAAAGT 2580 

TCTTGAGAAG CGTCATX3AGC CGTTTAATAA AGAAGATGGT TTAGTTGATT GGGCACAAGC 2640 

AGAACAACTT GCATTTGCGA CAATTTTACA AGATGGTACA CCGATTCGCT TAACTGGTCA 2700 

AGATAGTGAA CGTGGTACAT TCAGTCATAG GCATGCCGTG TTACATGATG AGCAAACAGG 2760 

TGAAACATAT ACACCTTTAC ATCATGTTCC TGATCAAAAA GCGACATTTG ATATACACAA 2820 

TTCTCCGCTT TCAGAAGCAG CAGTAGTTGG TTTTGAATAC GGCTATAATG TGGAAAACAA 2880 

AAAAAGCTTC AATATTTGGG AAGCACAATA TGGTGATTTT GCAAATATGT CACAAATGAT 2940 

TTTTGACAAC TTCTTATTCA GTTCTCGCTC AAAATGGGGA GAACGTTCAG GATTAACATT 3000 

ATTCTTACCT CATGCATATG AGGGTCAAGG GCCTGAACAT TCATCAGCAA GATTAGAGCG 3060 

40 ATTTTTACAA TTAGCTGCTG AAAATAATTG CACAGTTGTC AACTTATCTA GTTCAAGTAA 3120 

TTATTTCCAC TTATTGCGTG CACAAGCGGC TAGTTTAGAT TCTGAACAAA TGCGACCATT 3180 

GGTTGTTATG TCACCAAAAA GCTTACTGAG AAATAAAACA GTTGCAAAAC CAATTGATGA 3240 

ATTTACTTCT GGTGGATTTG AGCCAATTTT GACAGAATCA TATCAAGCGG ATAAGGTTAC 3 300 

AAAAGTTATT TTGGCAACTG GTAAAATGTT CATTGATTTA AAAGAAGCAT TAGCTAAAAA 3360 

TCCAGACGAA TCAGTATTAC TCGTTGCGAT TGAAAGATTG TATCCATTCC CAGAGGAAGA 34 20 

GATTGAAGCA TTACTAGCAC AATTGCCAAA CCTTGAAGAA GTGTCATGGG TACAAGAAGA 3 4 80 

ACCTAAAAAT CAAGGTGCAT GGTTATATGT CTATCCATAT GTTAAAGTGC TAGTTGCAGA 3 54 0 
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AGAAATTCAT AAACTTGTTC AAAATAAAAT TATAGAAAAT GCATTAAAAA ATAACTAGGG 
GGAAATAAGT CATGCCAGAG GTTAAAGTTC CAGAATTAGC AGAATCTATT ACAGAAGGTA 
CCATTGCAGA ATGGTTGAAA AACGTAGGGG ATAGCGTAGA AAAAGGTGAA GCTATTCrnx3 
AATTAGAAAC TGATAAAGTT AATGTCGAAG TTGTATCTGA AGAAGCAGGT GTATTATCTC 
AACAACTTGC AAGTGAAGGC GACACTGTAG AAGTTGGACA rGCAATTGCT ATCATCGGCG 
AAGGTAGTGG CAATGCTTCT AAAGAAAATA GTAACGACAA TACTCCACAA CAAAATGAAG 
AAACAAATAA TAAAAAAGAA GAAACAACAA ATAATTCGGT AGATAAAGCT GAAGTAAATC 
,^ AAGCAAATGA TQACAATCAG CAACGTATTA ATGctACGCC TTCTGCGCGT CGATATGCTC 
GTGAAAATGG TCTGAATCTT GCTGAAGTAA GTCCGAAAAC AAATGATCTG GTTCGTAAAG 
AAGATATTGA TAAGAAACAA CAGGCACCGG CATCAACACA AACAACACAA CAAGCATCTC 
CAAAAGAAGA GAAAAAATAC AATCAATATC CTACAAAACC AGTGATTCGT GAAAAAATGT 
CACGTAGAAA GAAAACAGCT GCCAAAAAAT TATTAGAGGT ATCTAATAAT ACAGCTATCT 
TAACAACATT TAACGAAtTG AcATGACAAA TGITATGGAA TTCCGTAAAC GTAAGAAAGA 
ACAATTTATG AAAGATCATG ATGGTACTAA ATTAGGATTT ATGTCATTCT TTACTAAAGC 
TTCTGTAGCA GCTTTGAAAA AGTATCCAGA AGTTAATGCA GAAATCGACG GCGACGACAT 
^ GATTACGAAA CAATATTATG ATATTGGTGT AGCTGTTTCT ACAGATGATG GATTATTAGT 
ACCArrrCTA AGAGATTGTG ATAAAAAGAA TTTTGCAGAA ATCGAAGCAG AAATTCcTAA 
TTTAGCAGTT AAAGCaCGAG AGAAAAAACT TGGCTTAGAT GATATGGTTA ATGGTTCATT 
35 TACGATTACA AATGGCGGTA TTTTTGGATC AATGATGAGT ACGCCAATTA TCAATGGTAA 
TCAAGCTGCA ATCTTAGGCA TGCATTCAAT TATTACAAGA CCAATTCCGA TTGATCAAGA 
TACAATCGAA AATCGTCCAA TGATGTATAT TGCATTAAGC TATGATCATA GAATTATT 
(2) INFORMATKMI FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

XA) LENGTH: 2222 base pairs 
(B) TYPE: nucleic acid 
4S (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



so 



ss 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 
ATCAGTCACA CGGTAGGCAT ATAAAATGAG TCGTTTCTAC AACATTTTTA AACAGTTCAT 
TCAATATTAT TTTTATCTAA TAATnATATT GGGAGOATTA TACCTTTATA CACACCATCC 
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ATGCTATTTA GCTAAAGCTA AAAGACCAGA CACTATGCAT ATTTCAACTG GAAATATGTG 24 0 

GCGATACTTA GTTGCAATTA TTGCCTGTAT GATTTGGTAC CTTAATAAAG CGCATGTAAG 300 

TATCATCGGT ATAATTATTG GTTTAATGAT TTCATATGTT GTAGTTATCA TACGTCCTTT 360 

ACTAAAGGTG AGCAAATAAA TTAAGAAAGA GGTGAGATTA TGGATCACAA ATCCCCGCTC 420 

GTGAGTTGGA ATTTATTCGG TTTTGaTATC GTTTTCAATT TATCAAGTAT ATTGATGATA 480 

CTTGTTACGG cGTTTCTTGT TTTTCTACTT Gc TATCATTT GTACGCGTAA TTTGAAAAAA 540 

AGACCAACTG GCAAACAAAA TTTCGTTGAA TGGATTTTTG ATTTCGTGAG GGGAATCATT 600 

GAAGGTAACA TGGCTTGGAA AAAAGGTGGT CAATTCCACT TCTTAGCAGT AACGCTGaTT 660 

CTGTACATTT TTATAGCTAA TATGTTAGGT CTTCCGTTTT CTATAGTAAC GAAAGATCAC 720 

ACATTGTGGT GGAAATCACC GACAGCnGAT GCAACAGTGA CTTTAACGTT GTCTACAACG 780 

20 ATAATACTGT TAACTCACTT TTATGGAATT AAAATGCGTG GTACGAAACA ATATCTTAAA 84 0 

GGTTATGTAC AGCCGTTTTG GCCATTGGCA ATTATTAATG TTTITGAAGA GTTCACTTCA 900 

ACATTAACGC TTGGTCTGCG TTTGTACGGT AACATATTTG CAGGTGAGAT ACTATTAACA 960 

TTACTTGCTG GCTTATTCTT TAACGAACCA GCATGGGGTT GGATTATTAG TATCCCAGGA 1020 

TTAATTGTTT GGCAAGCATT TTCAATATTT GTAGGAACAA TCCAAGCATA TATCTTTATT 1080 

ATGCTTTCGA TGGTTTATAT GTCACATAAA GTGGCAGATG AACACTAAAA ATTTCAATAA 1140 

TTATATACAA TCACAGGAGG AAATTAAATT ATGAATTTAA TCGCAGCAGC AATCGCAATT 1200 

GGTTTATCAG CATTAGGAGC AGGTATCGGT AACGGTTTAA TCGTTTCAAG AACAGTTGAA 1260 

GGTGTAGCAC GTCAACCAGA AGCACGTGGT CAATTAATGG GTATCATGTT CATTGGTGTA 1320 

GGTTTAGTTG AGGCATTACC TATCATCGGT GTAGTAATTG CATTCATGAC ATTTGCTGGA 1380 

TAATTAACAG ATAAAAGAGG TCGGGACAAA GCGCATAGGA CATAATTCAT GATGCATATA 1440 

40 TAGTAATATC TTTGAACTTT ATTAAATAGT TGAGATATGA ACGCACCATG CCTATCGCAT 1500 

AAATTCAGTA GGTCCTAACC TCGTCGTTTT TTTCTATATA ACACTAGCGA TTATTTTAAT 1560 

GAAAGGAGTG TCATGAACCC GTGACTGAAA CAGCTAACTT ATTCGTTCTT GGTGCAGCTG 1620 

GAGGCGTTGA GTGGGGTACT GTGATTGTAC AGGTCCTAAC TTTCATCGTG TTAcTTGCGT 1680 

TACTTAAAAA GTTCGCATGG GGTCCATTGA AAGATGTAAT GGATAAACGT GAAAGAGATA 1740 

TTAACAGAGA TATCGATGAC GCAGAACAAG CTAAGTTAAA TGCACAGAAA CTTGAAGAAG 1800 

AAAATAAACA AAAACTTAAA GAAACACAAG AAGAAGTTCA AAAGATTTTA GAAGATGCTA 1860 

AGGTTCAAGC ACGTCAACAG CAAGAACAAA TTATTCaTGA AGCAAACGTA CGTGCAAACG 1920 
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TTAATAATCa AGTATCTGAA CTATCAGTGT TAATTGCTTC TAAAGTTCrT AGPAPAGPJ^ ' 2040 
TTTCTGAACA AGACCAAAAA GCATTGGTTG ACAAGTATCT AAAAGAGGCA GGCGATAAAT 2100 
AATGGTAAAA GTAGCTAACA AGTATgcTAA AGCATTATTT GACX5TGTCAT TAGATACAAA 
TAATTTAGAG ACTATTAATG AAGAATTAAC AGTTATAAAT GAAGCAGTAA AAGATAAAAT 
TG 

(2) INFORMATION FOR SEQ ID NO: 381: 
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2160 

2220 
2222 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



20 ^^^^ SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

TAACAATTCC TaTATTCATG TTTAATACGA AACACTACAT TTACATTGTA ATTCACTATC 60 

TTTTGAAGTA ATAAAGTGAT TTGTTCAATC GATAGCTCAT TGCTTGTGTC GATTGTAACA 120 

ATTATATGCA AGTTTTCAGG ATTAACACCT AATCTTTGAA TGATTTGTTT AATAGTATAA 180 

TAATCCATCC AATAAAAATT ACTTCCTTTA ATATAAATCT TTTTAGGTTG ATACATTTGA 24 0 

CTCCTTTTTG TAGGCTCAAA AGGTATATCA ATCTCGCGCA TACTTGAAGA ACTTTGATTA 300 

GTATCATCAA ATAATTCAAT TATATTTTTA TCAATTTCAA CTAATTGGGA ATGGTTAATT 36 0 

GAATGAGATG TTGGACTATA TCTTTTTCTA ATTAATTTAG GAGTGTTACC ATACGTTTCT 420 

TTAAAGAGGT GTATAAAACG AGAATAATGA TTAAAACCAT GACTACTTGC GATTTCTTCA 4 80 

ATAGGCTTCT TAGrAGTTAA AATATCAATT AAGCAATGCT CCAGTCTAAT ATGATTTAAA 54 0 

TATTGAATAA AATTACTATA AGGTGTCGCT TTAAACATGT CACTTAAAGC TTTGTTTGTA 600 

ATACTAACTT GATTAATGAC ATCTTTCCTA TTTATCTTTT TATGGTGGTT GTTTCTTAAA 660 

TAATCGTGCA CTTCTTCGGC TACTAAATGA CGACTACCAT CGTATGTATT TAATGACATT 720 

AATTCAACAC ACATGTAATT AATAATCTTA TCATTAGCAT TATAAGACTG TTCTTTAATT 780 

45 TTGCTGTAGA TTAAATACTT AATCAAGATT CTAAAAGTGC TAGCAACCTC ArCTGTTAAT 84 0 

ATCTTGCCAC GCATCATATA GTTTGCATTC ATAAACTTCA AAAATGTCTT TGCACTTACT 900 

TCAACAACAC AACATACACT ATCATCATTA CCATCAATTT GATACAAATC ATTCAACATA 96 0 

ATAATGGTGA CATCATTTCT TTTTACATCA TATTGTTGTA GATTGATTGA AACTACACCT 1020 

GTACCTTGTA GCCAATATAT AATTTTtAAG TTCGCTTCCT TCACACTACT CATTTTCATA 1080 



EP0 786 519 A2 



ATT 

(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQXreNCE CHARACTERISTICS: 

(A) LENGTH: 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 
GGACCACCTT TTTTACCTGT ACGACTAAAG CCAACTGTTT TAAAGGCACC AACAGCATGC 
CACACTTGAA TAATTTCTTG TGATGGTCTA AAGCGAACCG TATAAATTAA TGGATGGAAA 
TCATCAACAA AAATATAATC GGCCTTACCA AGTAAATATG GCAATCTAAA CTTGTCTCTC 
CATTTGCGTC TATCCGTAAT ATTCTCCTTA AATACCGTTT TAATATCATA ATCAAAATCT 
ACTTTTTGGC GTAGTAACTC ATCATATACA TACTTGAAAT TCCCTGATAA ATTCGGACGC 
GAATCTGATG TGAATAATAT TGTTTTGCTT CTTTTAATAT GTAGTAACTT TGTAATATTA 
AAAATAGCTT TAAATAAGAA ACTTCTACTT TCAAATGAAG CTTTATGaCC TTGTTTaTGA 
AGCCAGTGTG cACTTgTCGC AATGaCCCCT GaTTTCyCyT GagGtAAGGk GaTTTCmATA 
TCAAATACAA ATTCGTTAAC GTCACT 
(2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 
AGCATCTGCA AtGAGTcTCT AATACAACGA TACGTTTTGC ATCTTTAGGT ACTTTTACTG 
TACCATTTTC ATCTTTTACC GAAATAGTAT CTTTAGTTGA TGATTCTTTT TTACTTGAAT 
TATCCGTATT ACCACAAGCT GCAACTAAAA GTAAGGCAAC TATTAATCCC AATATACTAA 
AAGTTTTTAG ACCTCTCATC GTTCCACTCC TTAATATGTA TAACTTCATT TATTATTTTA 
TTGATAACAA TTATCATTGT CAAGTAGCGT TCAATCTTTT TTATATTTCT AAAATGTATG 
ACTATATATT TCCTCTAATA ATTATGACTA CAATTAGCAC ATTTCCTTAG ACAAAATACT 
GATAATGTAT CATTGCTATA TCATCTTTGC ATTAATACAA TTGACACCAT TTAGCATGAC 
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(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 





TGCACTTGAT 


ACTTTAGCAC 


ATGAAATTGC 


TAATCGATTA GTTGCAAATG 


Ana HTV^ A 


60 


IS 


AGCAACTTTG 


GAAATGACTA ATAAAATGGC 


AACGATTCGT TTTACAGAAC 




120 




TGCATTAGCA 


GGGGGTAATG 


TCAAAGCTTA 


CACTGAGCAT ATGACTATAT 




180 




ATTGTATTTG 


TTAGATAAAG 


GCGATGTTTT 


AAAGTTTAGA GAAACAAGTT 




240 


20 


AGTGTATTTA 


GCTGTgGGAG 


GCGGATTTGA 


ATTAGATGCA TGGTTAGGAT 


CTAACTCAAC 


300 




CGACTTTAAT 


GTAAAAATTG 


GTGGTTTTAA 


AGGTAGAACA TTACAAGATG 


GCGATGAAAT 


360 




AAAGCTTAAG 


AGAGATTATA 


CAGCTCGTCA 


TCATAAGTTA TTTGAAAACC 


TTGCTCACAC 


420 


25 


GAAACAAACA 


GATTGGGGTA 


TTGATGGATA 


CGCCTTGTCA TTTAATTATA 


TGTCTGATGT 


480 




ATTTCATGTC 


GTTAAAAATA 


AAGGTACGGA 


AGATTTTAAA GAAGATGCCA 


TTCAAAGATT 


540 


30 


TGTGAAACAT 


GATTATAAAG 


TAACGAGCAA 


AGCAAATCGC ATGGGGATGA 


TGCTTGAAGG 


600 


TGAAAAAATC 


AAAGcrrrrr 


ATGAAGATAT 


GCCACCGTAT CAGACTGTCA 


AAAAAGGAAC 


660 




GATACAAATT 


AAGCGTGATG 


GCACACCTAT 


TATCCTATTA AATGATCATT 


ATACGCTAGG 


720 


35 


TAGCTACCCG 


CAAATCGGTA 


CAATCGCAAG 


TTATCATTTA ACGAAATTAG 


CACAAAAACC 


780 




GCAAGGATCA 


CGTTTGAAAT TTCJSlATTTAT 


AGATATTTTA ACGGCTGAAA AGAACCTTGT 


640 




TAAgTATAGT AACTGGTTAA AC 






862 


40 


(2) INFORMATION FOR SEQ ID NO: 385: 







( i ) : SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1027 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: do\ible 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 
AAATGAGTTG ATGATGGCTC CTATTCCATC TATCTCTAGG TATGACAGAA SATAAACGGT 
CTTCTACCTG TCTsACATTA TCTTTCCAAC GATTAATACC TAAGCGTTTA GAAACACGTT 



1280 

BNSOCX;iD: <EP 0786519A2_L> 



EP0 786 519 A2 



5 



10 



IS 



20 



2S 



CAGTTrrACG 


TCCTACACCT 


GCTAAACTTT 


CTAATTCCTT 


ATGTGTTTGT 


GGTATTTCTC 


240 


CATTAAATTG 


ATCAATCAAA 


GATTGACAAA 


GrnxriTAAT 


ATTCTTAGCT 


TTGTTACGAT 


300 


ACAGACCGAT 


AGAACGAATA 


TCATTCATAA 


GTTCTTCATC 


ACTGACTGCC 


AAATAATCTT 


360 


CAGGCGTl-l-r 


GTA'rrrrrrA 


AACAGCTCAG 


TTGTTACTCT 


AtTTACTAGA ACGTCTGTAC 


420 


ATTGCGCTGA 


CAATAATACA 


GCAATAGTTA 


ATTCGAACGG 


ATTATCATGT 


TTTAATTCAC 


480 


ATTCTGCATC 


CGGAAACATA 


TTTGCTATAA 


CATCAATCAT 


TTCTAATGCT 


TTTTTCTTAC 


540 


TTACCATCAA 


GGTTCTCCCC 


GTTTAACCAA 


TCAAATTTAG 


GTACCGTTTT 


AACTGTGTGC 


600 


GTCATTTTCG 


GTTTATTGAA 


TTTTTCTCTT 


ATTTTTCTAG 


AATCGTCAAT 


TGTTTTGACA 


660 


TTGTTTTTCT 


TCCAATTAAG 


TAAAATACGA 


TCTATATATT 


TAAAGCTAAG 


TTTATTCAAA 


720 


CTATTCGCCT 


CGTCTAATGC 


CGCTTGTATA 


ATTGCAGTAT 


CGTGTTTATC 


AACATCAATC 


780 


CATTGATTTA 


ACGTTTCTAT 


TTCATATGGA 


GATAACGGCC 


TTGCAAATGT 


ATCCTCTAAA 


840 


ACTCTAAATA 


ATTGTTTAAA 


TTTTTCTTXA 

A A 4 W AAA 


A A A A 


CTTTCGTTTC 


CATACTTTGT 




TGCTTCAATA 


TATGACTTAA 


TTTTTCGAAA 


AAAGGATCTA 


GATTCATATA 


TTCGGkAAAT 


960 


CTACCTTCyT 


CATCyTTTTG 


aACTkGtAAT 


tCTAGCAATT 


CACGTgTATC AAATTTTGGA 


1020 


TACCATT 












1027 



(2) INFORMATION FOR SEQ ID NO: 386: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

3S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 
AAGGnTTGGA GGGAATTAAT nGATGGCAAA TCCCAGaTAA AGTACACaAC GCATGGAATG 60 
TGATGCATTT AATATTTCCA GTAGTTAGTA CGACGTTTGC AAGCTTTAAA TCTATGTATG 120 
GGGGCATACC AAAAGATTTC ATAGACTACT TATTTATTGA TGAAGCAGGA CAAGCAATAC 180 
45 CTCAAGCAGC TGTGGGAGCA TTATATCGTT CAAAAAAAGT TGTAGCTGTA GGTGATCCGA 240 

TTCAAATAGA ACCGGTTGTG ACTTTAGAAA GTCATTTAAT TGATAACATT CGTAAAAATT 300 

ATCATGTTCC GGAATATCTA GTTTCTAAAG AAGCTTCTGT GCAGTCTGTT GCAGACAACG 360 

SO 

CCAATCAATA TGGTTTTTGG AAATCTGATG CTACTGATAG TAATCAAAAA ACCTGGATAG 4 20 

GCATACCTTT ATGGGTGCAC AGACGATGTT TAAAACCTAT GTTCACGATa GCTAaCCAAa 4 80 

55 
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35 



40 



45 



SO 



55 



GGTATGACGT TAAACSGAAaC GCAGTTCAAA AACAATTTCT GAAAGAGCAT GGTXyVAAAaG 
TAGTGGGATT ATTAGCTGAT GATTGGATTG AAGCAAITAA GGAAGGTAAA AATGAACCGA 
GCTCATTTGT AATATCGCCT TTTTCAGCAG TACAGCAACA GATTAAACGT ATCTTAAAGC 
AACAACTACC GACTAGAATT GATATTGAAC GTACAAAAAT TAATCAATGG GTCGATAAAT 



600 
660 
720 
780 

CCATTGGTAC TGTTCATACT TTTCAAGGTA AAGAGGCTCA GAAGGTGTAT TTTGTAATAG 840 

GTACTGATAA TACCCAAGAT GGTCCTGTGA ACTGGTCATG CGAAAAACCA AACTTGTTAA 

ACGTTGCAGT GACAAGAGCT AAGAAAGAGT TTaTCTAATT GGCGACATGC AAAGAATACA 

,5 GATGAaACCA TTTTATGAGA CGATTTTTAn AGnAAGAAAT GTAAAA 

(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 
^° (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 
CGTTTCATCA AGTnATCGAT CCTAATTTAC AAAATATTCC GGTTAGACTT GAAGAAGGGC 
GTAAAATAAG AAAAGCCTTT AAACCAACTT CAAAAGATAG CGTFATATTA TCAGCAGATT 
ATTCTCAAAT TGAATTGCGT GTATTAGCAC ACATTACACA AGATCAGAGT ATGAAAGAAG 
CATTTATCAA CGGCGATGAT ATTCATACAG CAACTGCTAT GAAAGTATTT GGTGTAGAAG 
CTGATCAAGT CGATAGTTTA ATGCGTCGTC AAGCAAAAGC ACTTAACTTT GGAATTGTTT 
ATGGGATAAG TOATTATGGT TTAAGTCAAA GTTTAGGTAT TACTCGTAAA AAAGCAAAAG 
CATTCATTGA TGATTATTTA GCTAGTTTCC CAGGTGTAAA ACAATATATG TCTGATATTG 
TAAAAGATGC CAAAGCTTTA GGTTACGTGG AAACATTGCT ACATCGTCGA CGCTATATTC 
CTGATATTAC GAGTCGTAAC TTTAATTTAC GCGGCTTTGC T<5AACGTACT GCTATQAATA 
CGCCAATACA GGGCAGTGCT GCAGATATCA TTAAACTGGC AATGGTTAAA TTTGCTCAAA 
AAATGAAAGA GACAACATAT CAAGCTAAAC TATTATTACA AGTACACGAT GAATTAATTT 
TT 

(2) INFORMATION FOR SEQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



900 
960 
1006 



25 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

^ TCATCCAAAT TTTGGAAATT CCACATTTTA CATATCGTAA TTTTTTAGGA AACTAGTGAA 60 

TATAACAAAT CCCTCCTCTC ATTTTTAAAA TAGATATATC ACTTCCCCAC TTTTACTTAA 120 

CTAAACTGCA ACGGTTCCTA ATACCAAAAT CCTGCCCTCT ATTTTTATCA ATTCAAGCAT 180 

10 ACTTATTGAA AAATGTTAAC GTTTTCTTGA TAATCATTGT AAGCGCATTT ATTTTATAAA 24 0 

CTAACGTTTG AAATATACTA CAGGAGTGAC ACGTAATGAC TCAAATTACT GAAAAAGAAT 300 

TAAAAAAGAA GTATTTAGAT TTACTATCCC AAAATTTTGA TACTCCAGAA AAACTTGCAA 360 

CTGAAATTAT CAATTTAGAA TCAATTTTAG AATTACCTAA AGGTACGGAA CATTTCGTCA 420 

GCGATTTACA TGGTGAATAT GAAGCTTTCC AACACGTATT ACGCAACGGT TCTGGGAACG 4 80 

TGCGAGCGAA AATCAATGAT ATTTTCAAAG AGAGACTTTC AACTAAGGAG CTTAATGACT 54 0 

20 

TAACTGCTCT TGTCTACTAT CCAGAAGrCm AATTtAAAAT TgATTAAAAG TGATTTCCAA 600 

AATTgCGGtC mACtTAATGt CyGGtATATC ACaACmATCG aACATTTAAT TGAGTTAATT 6 60 

AAATATTGT 66 9 

(2) INFORMATION FOR SEQ ID NO: 3 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 9 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

CACATGGCTG TTAGAGATAT GAATGGCCAT GCGTTACCTT TAACAAAAGA TGGCAATTTT 60 

TATCAAACGA ATGTAGATGC AAATGGTGTT AATCATGGTG GTAGTGAAAT GGTGCAAAAT 120 

40 

AAAACAGGTC ATATGAGTCA ACAArGCCAT ATGAATCAGA ACACACATGA ACCAACAGCC 180 

ACACATGCAA CAAGGTCATA TGCAATCATC AAACCATCAA ATGATGAGTC CAAAAGCAAA 240 

TATGCATTCA TCAAATCATC AAATGAACCA AAGTAACAAA AAAGTTTTAC CAGCTGCTGG 3 00 

45 

TGAAAGTATG ACATCAAGTA TTCTTACTGC AAGTATTGCC GCACTACTAT TAGTATCTGG 360 

GTTATTCTTA GCATTTAGAC GACGTTCAAC AAATAAATAA ACATAATACG ATTAATAATA 4 20 

SO GAAAAATCGT GTGATTATCT GaGGGAGCCT AGGACATAAA TCAATGTCCT AGGCtCnCTA 4 80 

AtGTTATATT GGCAGTAGTT GACTGAATGA AATTGCGCTT GTAACAAGCT TTTCCATTTC 54 0 

55 
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CGATTGTCTT ATACGTGTCA GTGTTAATTC AGATATTTCC TGTGGAATAT ACCACTTATT 
AATCATAATT GGATAAGGTG tTTGTGCGTA CAGTGTTTCA ATAATCAGCC AACAATGTGT 
ATCACCaTCA AACACGTGAC TATGATTTTk GAAGTGGGGC GCTTTGGTAA TAGACATTTT 
TAAATCTGAT TGATATGCAT TGCTATAAAT CGTTTGCTCA ACGAATGTCT TCATGTCGTC 
TTCGTTTTGT GTATTCACTT TAAATGTGTC AATGACATTT AACGGTATAA AGGTAAAGCA 
AAATGCATCA GCTTGCTTAG AATGATTGTC CrrriTTTGA TAATAGCGTT CCATTGCAAT 
GACGGCAGAA GGATGGTTTG CAAACAAATG ATTTGTATAT TCACTTTCTA AATCAACACG 
ATAATTAATT GATGACATAG ATACGCGAGC TAGCAATATT TGATCAAGTG GATGCTTAAA 
TTGATCCATA CTTGAAGCGT GTTGGGCATT TGTTTGTGGA ATAACAAAGT GTCCCTTCCC 
TCTTGTACTC TCTACGATGC CATCTTCGGC TAACAATTTt ATAGCTTGGC GCAAAGTCAT 
20 ACGACTGGAC ATCAAAGCGC GCACAAAGTT CCTTTTCAGT AGGTAATGC 

(2) INFORMATION FOR SEQ ID NO: 390: 



2S 



30 



35 



40 



SO 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doTlble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

AATGATGATT CATTCAAAGA AATTAACGCT CGGTATATGC TTGGTATTAC TCATTATATT 60 

GATTGTAGGT TATGTCATTA TGACAAAAAC AAATGGTCGA AACGCCCAAA TTAAAGACAC 120 

ATTTAATCAA ACATTAAAAT TATATCCAAC CAAAAATCTC GATGATTTTT ACGATAAAGA 180 

AGGCTTTCGA GATCAAGAAT TTAAAAAGGG TGATAAAGGT ACTTGGATAG TTAATTCTGA 24 0 

AATGGTAATC GAGCCAAAAG GTAAGGATAT GGAAACGAGA GGAATGGTGC TCTATATCAA 300 

TCGCAATACT AGAACCACAA AAGGGTATTA TTTTATAAGT GAAATGACAG ATCACAGTAA 360 

CGGCAGACCA AAGGATGATG AAAAAAGGTA TCCGGtAAAA ATGGAACATA ATAAAATCAT 420 

45 ACCAACGAAG CCACTACCGA ATGACAAGTT AAAAAAAGAG ATTGAAAACT TTAAGTTCTT 4 80 

TGTACAATAT GGCAACTTTA AAGATATTAA TGATTATAAA GATGGTGATA TTTCATATAA 54 0 

TCCTAATGTA CCAAGTTATT CGGCAAAATA TCAATTGAAT AATGATGATT ATAATGTCCA 600 

ACAGTTaAGa AAAAGATATG ATATTCCAAC CAAACAAGCG CCGAAACTAT TATTGAAAGG 660 
CGATGGAGAT TTAAAAGGTT CATCCGTAGT TcTAGAAGTC TTGAATTTAC CTTTGTCGAA 



720 



55 
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AGGTATGAGT CAAACTGAAT ATCAAATAAA 
AACAAGTTCG ATATCTAAAG TGAGCTGTGA 
5 GGTGAATTTG ATAATGTAGC TCAAGGAGAT 

GATATTAGTA AGAAATTAAA AAATATTCAA 

TTGGAGCTAA GACAGCATGC GTTCATTCAT 

10 

AAGCTTCTAA TGTGATACAG GAACCTCATA 

TAAAGAACAT CTACATTTTA TCATATTTTC 

TTTCTTAATT TTCCAAAATA GTGATAAATT 

75 

TTTGGAAATT CATGTAATAT TAGACTTGTA 

GTTTATATAA ATACTATATA AACATTGGAG 

TTATTTTCTC AGACGTAAtA GCTATGATAT 

AATTGGTGAC ATCAAAGATG GTTACCTAAC 

GCGTTTGTAT CACTTTGCTA TGGAACGTAA 

25 AATGATTACT ACCCCTAGTC GGCGGCAATT 

GGTGAGCAAT TGAGGGCAGA CCCCTTTAAT 

AACGATACTA AAAGCCACGT CCTATATTGG 

tAAAACGATT ACCTACAAGA TTTACATATA 

(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 24 07 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 



GCAAGTTTAG TTAAACAGAT ATTAAAACCA GATCAATATA CAGATGCAAA TTCTCTATTA 60 

GAGATTAGTT TGCAAACAGG TATGTTTATT GCAGGTGGAT TATCAGGAAT ATTATATAAA 120 

ATAAATGGAT TCACTCTAAT TATAGCGATG ACTATAATGA TGTTTCTAAT CAGCATTTTT 180 

ATGTTATTTA GATTGCATGT AGATAAACCA ACTCATTCAG AGGAAGAATC AACAAATAGC 24 0 

SO TTATTGCAAG AATATTTATT AGGATGGAAA TTTTTAAAAG ACAACATGAT GATATTTATT 3 00 

TTTGGAGTTA TTTCAATCAT ACCAATGGTG TTTACAATGA TCTTTAACAT ATCATTACCA 360 



55 



ACCTGGTAAT ATAACAAGTA 
AATATAGGTA GCCATTTCAA 
TGGGTTAAAA AGGCGAAGAA 
AGAACGGAAG TTTAATAGCT 
GCCATTATTA ATATAAGCAC 
TTCCGTATCA TGTTAGAAGC 
TGACTTATTA AACTTTTATA 
TGTGAAATAC ATCACAAATC 
AGAAGTTAAT AAATAGAGAG 
TGATGATTAT GAGAAAAGAG 
TTACGTGAAC ACTGGTGTAA 
TATTGATTCT ATGCCTTACA 
ATCGTTAGTC ACTAACTAGT 
GAAGTGTGAT TGATGTAACT 
TAAGTAAACC CTAACTCCCC 
ATGTGGCTTT AGTCAJcACTT 
AAATTCTATC ATGnCTGC 



ACTCTGAAGA 84 0 

AAAATTTAAA 900 

TGAACTGGAT 960 

TATATGATTC 1020 

CGCAACAAAA 1080 

TTTTAATGTC 114 0 

TAATTAAATA 1200 

CCTTTATTTA 1260 

AGACGAGAGA 1320 

ATTGAAGCGC 1380 

ATCAAGGATT 1440 

TTGATGCTGA 1500 

TCTTATTGCC 1560 

TGCCCTCGTT 1620 

ACAAATCTGG 1680 

ATATTATTTT 174 0 
1788 
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TATGGCATTG 


' GAGGATTATG 


' TGCAGGTTTA ATTTCAGCTA 


X 1 1 1 rCGAA GAAAATCTCA 


480 




ACTAAAGTAT 


TGATATTTTT 


' GTTATATTTC 


ATATTAGTCA 


-I. i 1 v^ACtC actatttatt 


540 


5 


TGGATAAACT 


CAGCATTTTA CTTATTCATA 


GGATCATTTA 


*■ 1 T\\j\J\^ i A 1 LiAA 111 t-A 


600 




TCAATCAGAA 


TTTATATGAA 


TACAGCTATA 


ATGAACACTG 


J- X x^h^^AlAA ATATGTwGGT 


660 


10 


CGCTCATTTA 


CGATATGGAC 


GTCAATTTCA 


TTGTTACTGG 


"wivjillAAl loCTCCATnT 


720 




TTAGGAAGAT 


GGATTAATGA 


AATTAATGAT 


AAATTCGGTT 


1^-iAlAl lAT ACTCATTTTA 


780 




TCCTTACTCA 


TATTTGTaCA 


CTGCTGCTTG 


TTAACAAAAC 


a^jAL-AAAATA aaatatgcac 


840 


IS 


ATAAAGAAGA 


GTGACCGTCA 


CTCTTCTTTA 


ACAAGCGACC 


AlllAlCGaT GGGCTTAGTT 


900 




CTCTCTGCAC 


CCACACTTCA 


CTACTTCACT 


TTTTCAAATC 


Ai I 1 1 i lATG GTCTTAAATA 


960 




AATCAGTGAG 


ATTTGTTGCT 


TCGGTAAAGT 


CTAGAATTAA 


1 UA ill LTl TTAGAACCTG 


1020 


20 


GATATGGCGA 


AACTAATGTA 


TTATCTTGCA 


ATTTCTGCTG 


wOw-Aui l i l A GTCGCCTTGA 


1080 




CCAATAATCT 


ATTATCATAC 


AAACCACCTA 


TAACCACGCC 


A 1 t-A 1 AATAA ATAATATATT 


1140 




CTCCCATCAT 


CTTTCTTGTC 


TTAACCGCGT 


TTGAATTCAC 


A 1 ajA 1 1 r AAA AATAAATCAT 


1200 


25 


GTACATCTTT 


CTTAGTCGCC 


ATTGTAATCG 


CTCCTTCAGT 


A 1 lAiTjTTTA ATCACATTCG 


1260 




TATTAATGAT 


TCATTTCGTG 


TTGCTCTTAA 


TTTTATCTAT 


AAi^TATATGC GTAGTTAAAA 


1320 




TCAAACCTAT 


GGAAAAGAAA 


ATAATGATAA 


CAATGTTAAG 


MAA 1 A 1 AOTT ATAAAATTAT 


1380 


30 


AGTTTGGAAG 


GTATGCGAAT 


AGCAGAAAAA 


TAGGTATCGC 


AAAAAATAAA TCCCACCAAC 


1440 




CTAAACTTTT 


TAAAGAATGC 


TTTAAACCTT 


CCATAATATC 


1 lAlAA Al 1 ICaTCTTX 


1500 




GTTATAAGAT 


AACTAAAAAA 


TCGCTTTACT 


GTAAAAGTAG 


W\*MAA^AAAA 1 1 wl GAAT CA 


1560 


35 


TATTCATAAG 


TAGTGTATCA 


TTAATAATGA 


ACAATTTAAT 


A A X AA 1 CI- 1 IXiATCXTTG 


1620 




TATTGATCAA 


CTTACCACAA 


CATTTATTTT 


AGACTACTCT 


xrujiAw X 1 111 x-AAA^^iu 


1680 


40 


TTGCATCTAT 


TGAAATTCCT 


TTTGTATAAG 


TTAGGCTTTT 


vjrxvawXAnliiX l-Al v.Al\jtAX 


1740 




AAAAAATCGA 


GATACTAATT 


ATAAAGAGGG 


TATAAATATA 


X X>\X\AnAAvaA AAA'X l^l'l'lTSG 


1800 




AGTGAATTAC 


CAOGTCCATT 


TTTTATTrrG 


GCGCCAATGG 


«Av»AC^» X 1 Aw AQATATCGTC 


1860 


45 


TTTCGACACG 


TTGTAAGTGA 


AGCAGCTAGA 


CCGGATGTGT 


XXI xi-Au l\»A Ax X X ACAAAT 


1920 




ACTGAAAGCT 


TTTGCCACCC 


TGAAGGCATA 


CATAGTGTGC 


GCGGACGCTT AACTTTTAGT 


1980 




GAAGATGAAC 


AGCCGATGGT 


CGCTCATATA 


TGGGGAGATA 


AGCCAGAACA GTTCCGTGAA 


2040 


50 


ACGAGTATTC 


AATTAGCTAA 


AATGGGCTTT 


AAAGGCATAG 


ACTTAAATAT GGGATGTCCT 


2100 




GTAGCAAATG 


TTGCTAAAAA 


GGGTAAGGGT 


TCCGGCTTAA 


TCTTAAGACC TGACGTTGCT 


2160 
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GGCTACTATG AAATCGATGA ATGGAAAGAT TGGTTGAAGC ACGTCTTCGA ACAAGACATT 2280 

GCCAATTTAT CTATTCATCT TCGTACACGT AAAGAAATGA GTAAAGTAGA TGCACATTGG 234 0 

GAATTAATCG AAGCTATTAA AAATTTACGT GACGAAATTG CACCAAATAC ATTGTTAACA 2400 

ATTAACG 2407 



(2) INFORMATION FOR SEQ ID NO: 392: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 92: 
ATGATGnATG GCnCCGCCAA GAAGTTGAAC CAGTCTATTG GCTAGAATTG CTCAAAAAGC SO 
GAGAcaTGTG GTATTCaTAT GTTaGTAgCt ACGCAAAGAC CATCTGTCAA TGTAATTACA 120 
GGTTTAATTA AAGCCAACAT ACCAACAAGA ATTGCATTTA TGGTATCATC aAGTGTAGAT 180 
25 TCGAGAACGA TATTAGACAG TGGTGGAGCA GAACGCTTGT TAGGATATGG CGATATGTTA 24 0 

TATCTTGGTA GCGGTATGAA TAAACCGATT AGAGTTCAAG GTACATTTGT TTCTGATGAC 300 
GAAATTGATG ATGTTGTTGA TTTTATCAAA CAACAAAGAG AACCGGACTA TCTATTTGAA 360 
GAAAAAGAAT TGTTGAAAAA AACACAAACA CAATCACAAG ATGaATTATT TGATGaTGTT 420 
TGTGCATTTA TGGTTAATGA AGGACATATT TCAACATCAT TAATCCAAAG ACATTTCCAA 4 80 

ATTGGCTATA ATAGAGCAGC AAGAATTATC GATCAATTAG AGCAACTCGG TTATGTTTCG 540 

35 

AGTGCTAATG GTTCAAAACC AAGGGATGTT TATGTTACGG AAGCAGATTT AAATAAAGAA 600 
TAATTATGAG TAAGGAGTTT TATATAATGA CACACTATCA TTTTGTCGGA ATTAAAGGTT 660 
CTGGCATGAG TTCATTAGCA CAAATCATGC ATGATTTAGG ACATGAAGTT CAAGGATCGG 720 

40 

ATATTGAGAA CTACGTATTT ACAGAAGTTG CTCTTAGAAA TAAGGGGATA AAAATATTAC 780 
CATTTGATGC TAATAACATA AAAGAAGATA TGGTAGTtAT ACAAGGTAAT GCATTCGCGA 84 0 

45 GTAGCCaTGA AGAAAtAGTA CGTGCACATC AATTGaAATT AGATGTTGTA AGTTATAATG 900 
ATTTTTTAGG ACAGATTATT GATCAATATA CTTCAGTAGC TGTAACTGGT GCACATGGTA 960 
AAACTTCTAC AACAGGTTTA TTATCACATG TTATGAATGG TGATAAAAAG ACTTCATTTT 1020 

SO TAATTGGTGA TGGCACAGGT ATGGGATTGC CTGAAAGTGA TTATTTCGCT TTTGAGGCAT 1080 
GTGAATATAG ACGTCACTTT TTAAGTTATA AACCTGATTA CGCAATTATG ACAAATATTG 114 0 

55 
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5 



10 



IS 



25 



30 



3S 



40 



TGGCACATAA 


TGTTAAAAAA GGTATTATTG 


CTTGGGGTGA TGATGAACAT CTACGTAAAA 


1260 


TTGAAGCAGA 


TGTTCCAATT 


TATTATTATG 


GATTTAAAGA TTCGGATGAC ATTTATGCTC 


1320 


AAAATATTCA AATTACGGAT AAAGGTACTG 


crmrG A *TVTi*T* 

^ A X X X X w X 


WviAXvjjAjCiAT GGTGAGTTTT 


1380 


ATGATCACTT 


CCTGTCTCCA 


CAATATGGTG 


AC CAT Ar!AGT 


XXX AAATGCA TTAGCTGTAA 


1440 


TTGCGATTAG 


TTATTTAGAG 


AAGCTAGATG 


TTACAAAT A T 


XAAAUAAGCA TTAGAAACGT 


1500 


TTGGTGGTGT 


TAAACGTCGT 


TTCAATGAAA 


CTAPAATTY^f 


AAATCAAGTT ATTGTAGATG 


1560 


ATTATGCACA 


CCATCCAAGA 


GAAATTAGTG 


CTACA ATTG A 


AAL-AGCACGA AAGAAATATC 


1620 


CACATAAAGA 


AGTTGTTGCA 


GTATTTCAAf* 


P A P A A r*T*rnp 


CTCTAGAACA CAGGCATTTT 


1680 


TAAATGAATT 


TGCAGAAAGT 


TTAAGTAAAf^ 


X X\j X 


ATTCTTATGT GAAAITl''l"rG 


1740 


GATCAATTAG 


AGAAAATACT 






TTTAATTGAT AAAATTGAAG 


1800 


GTGCATCGTT 


AATTAATGAA 


GATTCTAT*TA 


n X 1 n X X A\j A 


ACAATTTGAT AATGCTGTTA 


1860 


TTTTATTTAT 


GGGTGCAGGT 


GAT ATTCA A A 


A ATTAP A Ji a a. 


TGCATATTTA GATAAATTAG 


1920 


GCATGAAAAA 


TGCGTTTTAA 


TATGTTTATA 

X^^XVJX X l.n. 


A T A n 2i P*T* JV i*"T» 


ATGGGTATTT ATTATTAATG 


1980 


ACATTATTAC 


ATGTTAATTA 


GGAGGrGTTT 


X XMM.XUV9AX X 


GGATTTTACC AATTGCTGGA 


2040 


ATTATCGCTG 


CGATTG r* A TT 


X X AAi 1 1 X A 


TGT A TPfli^ TT & 
X Vj 1 M X Uvnj X A 


TCGTAGCTGT ATTAAATTCT 


2100 


GTTAAGaAAA 


ACTTAGATTA 


TGTTGCAAAA 


ACACTTGACG 


GTGTAGAAGG TCAAGTTCAA 


2160 


GGTATTACTC 


GTGAAACAAC 


AGATTTACTT 


CATAAAGTAA 


ACCGTTTAAC TGAGGATATC 


2220 


CAAGGTAAAG 


TAGATCGTTT 


AAACTCAGTT 


GTAGATGCTG 


TTAAAGGTAT CGGTGACTCA 


2280 


GTACAAACGT 


TAAACAGCTC 


TGTAGATCGT 


GTAACAAATT 


CAATTACACA TAATATTTCT 


2340 


CAAAATGAAG 


ATAAAATCTC 


ACAAGTTGTT 


CAATGGTCAA 


ATGTTGCAAT GGAAATTGCA 


2400 


GACAAATGGC 


AAAATAGACA 


CTAC 






2424 


(2) INFORMATION FOR SEQ ID NO: 393: 







(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 

45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 
CnATAATATT CnTCAAnCCT TTAAATAGAG GAATAGACTG CCGACAGAGT CCGAGACTTT 
GTGGGTAGTT TTTTTAGTTT TGATAACGGA AGTTAGAGGC TCTCTGTCAA ATTGGGCAGA 
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AGTGTAAAGT 


TTTCAACATA 


ATACTATTAG 


TTCGGTCATG 


TATCGGACTG 


ATGGAAAAGC 


240 




GTTTCACTTT 


TAATGACTCA 


TTAAGAACGG 


CCTGAAAATG 


TTTGGCGTAT 


TAAGTGCAAT 


300 


5 


GATAGTlTIXi 


ACATTTAGTT 


TCTAATTGGT 


CATTACTGCC 


GAGCAAATCT 


AGTAGAGTAA 


360 




TCATGTAAAT 


CTTTAATGTG 


CCATTTGATT 


CACTAGCGGT 


GTTAATAACT 


ACGGAAATTG 


420 


10 


CATTTCCGAC 


TGAAATTTTT 


GAAAAATATC 


AACGTACGCT 


ACAAATAAAA 


TTTTAAACTG 


480 


TTATAAATGT 


GTCTCAATTT 


CATATGTTCA 


TCGACGATAT 


GAAGCGTATT 


ATGGTAAAAT 


540 




GAAGAAATAA 


TAAACTTGTT 


AATAAATAAA 


ACATCACGAT 


TTGACTAAAG 


CACTTTATTA 


600 


IS 


TTGTGTAGAT 


AATAGTTTTT 


TAACGAAATA 


AAAATGGCGA 


CTGGTTTTAA 


TAAATCAGCT 


660 




AATGAATCAC 


TACACCTATA 


AGTATGAATA 


TAGTGATTAG 


AATGCTTTGT 


ATAGTTGGAT 


720 




TTTGCAAAAT 


TGATGTTA 










738 


20 


(2) INFORMATION FOR SEQ ID NO: 3 94: 








25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 






30 


AAAAGTTGTA 


ATTAAAAGTG 


GGATTTTACT 


TAAGnnAGAA 


GGAAACTATT 


TATATGACTA 


60 




ATAAAAGAGn 


AGATGTCCGC 


AATATAGCAA 


TTATTGCTCA 


CGTTGACCAT 


GGTAAAACAA 


120 


36 


CTTTAGTAGA 


TGAGTTGTTA 


AAACAATCTG 


GTATATTCAG 


AGAAAATGAA 


CATGTCGATG 


180 


AACGTGCAAT 


GGACTCTAAC 


GATATCGAAA 


GAGAGCGTGG 


AATTACGATT 


CTAGCCAAAA 


240 




ATACGGCTGT 


TGATTATAAA 


GGTACACGTA 


TTAATATTTT 


GGATACACCA 


GGACATGCAG 


300 


40 


ACTTTGGTGG 


AGAAGTAGAA 


CGTATTATGA 


AAATGGTTGA 


TGGGGTTGTC 


TTA6TAGTAG 


360 




ATGCGTATGA 


AGGTACAATG 


CCTCAAACAC 


GTTTTGTACT 


TAAAAAAGCG 


CTAGAACAAA 


420 




ACCTGAAACC 


TGTTGTTGTT 


GTTAATAAAA 


TTGATAAACC 


ATCAGCACGT 


CCAGAGGGTG 


480 


45 


TTGTAGATGA 


AGTTTTAGAT 


TTATTTATTG 


AATTAGAAGC 


AAACGnTGAA 


CAATTAGAAT 


•540 




TCCCTGTTGT 


TTATGCTTCA 


GCAGTAAATG 


GTACAGCTAG 


CTTAGATCCT 


GAAAAGCAAG 


600 




ATGATAATTT 


ACAATCATTA 


TATGAAACAA 


TTATTGATTC 


ATGTACCAGC 


TCCAATTGAT 


660 


SO 


AACAGTGATG 


AGCCCATTAC 


AATTTCCAAG 


TAGCATTGTT 


GGACTACAAT 


GATTATGTTG 


720 




GACGTATTGG 


TATTGGTCGT 


GTATTCAGAG 


GTAAAATGCG 


TGTCGGAGAT 


AATGTATCAC 


780 
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GATTAAAACG TTTAGAAATT GAAGAAGCAC AAGCTGGAGA TTTAATTCCT GTTTCAGGTA 
TGGAAGACAT TAATGTTGGT GAAaCTGTAA CACCACATGA CCATCAAGAA GCATTGCCAG 
TTCTACGTAT TGATGAGCCT ACTCTTGAAA TCACATTTAA AGTTAACAAT TCTCCATTTG 
CTGGCCGTGA AGGTGACTTT GTAACAGCAC GTCAAATTCA AGAACGTTTA AATCAACAAT 
TAGAAACAGA TGTATCTTTG AAAGTTTCTA ACACAGATTC TCCAGATACA TCGGTAGTrG 
CTGGTCGCGG TGAATTGCAT TTATCAATCC TTATTGAAAA TATGCGTCGT GAAGGTTATG 
AATTACAAGT TTCAAAACCA CAAGTAATTA TTAAAGAAAT AGATGGTGTA ATGTCTGAAC 



(2) INFORMATION FOR SEQ ID NO: 395: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: X365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



60 
120 
180 



25 ' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 

AAGACCAGGA GAACAmGTAA AACAATATAC AGTTGaAATC GCTCGTAAAT TAATGG/UVTT 
TGATATAAAA TGCTCGTGAT TGCTTGTAAA TACGCAACTG CTGTnGCTTT AGAATATTTA 
CAAAAGACCT TATCAATCCC CAGTGATTGG CGTAATTGAc CAGGTGCTAG AaCAGCAATA 
ATGACTACTA GAAATCAAAA TGTATTAGTA CTAGGAACGG AAGGCACAAT TAAATCTGAA 24 0 

GCATATCGTA CGCATATTAA ACGTATCAAT CCACATGTAG AGGTACATGg CGTTGCCTGT 300 
CCAGGTTTTG TGCCACTTGT AGAACAAATG AGATATAGTG ATCCAACAAT TACAAGCATT 3 60 

GTCATTCATC AAACACTGAA ACGTTGGCGT AATAGTGAGT CTGATACTGT CAriTTAGGA 
TGTACCCACT ATCCATTGCT CTATAAACCT ATCTATGATT ATTTTGGTGG TAAAAAGACA 
GTGATTTCGT CTGGATTAGA AACGGCTCGT GAAGTTAGTG CATTGCTAAC ATTTAGTAAT 
GAACATGCAft- GTTATACTGA ACATCCAGAT CATCGATTTT TTGCAACAGG TCATCCTACT 600 
CACATTACTA ACATTATCAA AGAGTGGTTA AATTTATCTG TCAATGTGGA ACGTATATCA 660 
GTGAATGACT AGGAGGATTT TTAATGAAAG AGATTGTTAT TGCATCGAAT AATCAAGGGA 720 
AAATAAATGA CTTTAAAGTA ATATTTCCAG ATTACCACGT AATAGGTATT TCAGAACTAA 780 
50 TACCAGATTT TGATGTGGAA GAAACAGGAT CAACATTTGA AGAAAATGCT ATATTAAAAT 84 0 

CAGAAGCTGC TGCAAAAGCA TTGAATAAAA CGGTCATAGC TGATCACAGT GGACTAGAAG 



420 
480 
540 



900 
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GCGATGAAGC AAATATTGAA AAATTATTAA ATAAGCTTGG TAATACAACT GATCGTCX5TG 1020 

CGCAATTtGT TTGTGTCATA AGTATGAGTG GCCCTGATAT GGAAACAAAA GTATTTAAAG 108 0 

GTACTGTTTC AGGTGAAATT GCAGATGGAA AATATGGCX3A AAATGGTTTC GGATATGATC 114 0 

CGATATTTTA TGTACCGAAA TTAGATAAAA CCATGGCTCA ACTTTCAAAA GAACAAAAAG 1200 

GGCAAATTAG TCATAGACGA AATGCGATTA ATTTACTTCA AGCTTTTCTT GAAGGTGATA 126 0 

AAAATGTCTA AATGGATTAT TGTGAGTGAT AACCATACTG AATCAGGCGT TTTATATCAA 1320 

ATTTATGAAA TGCACCCAGA TGCAGATGTA TATTTACATT TAGGA 13 6 5 



(2) INFORMATION FOR SEQ ID NO: 396: 

IS 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 13 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 



2S 



4S 



SO 



AATTCCTGGT 


GCAATAATAA 


ATAGGATGAA 


AAAGATTCGG 


AAAATATGAT 


AACTCGTAAT 


60 


CATAGCAACA 


TCGGCACCAG 


TAGCTAATGC 


AACTAAAACT 


ATCTGATTAA 


CCCCTCCTGG 


120 


TGCTGCACCA 


AGaAACAATT 


CATTAATAGG 


ATTATTATCA 


AAGAAATGTA 


TGATATAAAC 


180 


CATGATTAGC 


GCACCAATTA 


TCAACATAAT 


ATTTTGAATT 


GTAATTGCGA 


TTGCTAGTCT 


240 


ACCTTTTAAA 


TCTGACAATA 


AATGCGCAAT 


TTGAACTCCA 


ATTCTAATCA 


TATATATTAG 


300 


TTGTGCCATG 


TTCAACAACC 


AATGATCTAG 


TGTAAATGTT 


AAACCTGTAG 


AAAAATTCCA 


360 


AACAATTAAT 


ACAATGAGTG 


GTGCTAATAA 


TTGAAATGTT 


GGAAACTTTA 


TTTTAGACAT 


420 


AATTAGATAA 


ACTATAAAGA 


TAGCTATCGC 


TAAAATAACT 


ATTTGCCCTA 


TGTTTAATAC 


480 


TTGTGATAAA 


C?GCAAGACTT 


TTGTTAACTT 


TCCATTCGCA 


TGCATGTTAC 


CATCATGAAA 


54 0 


AAAATATGAA 


ATGAACGGTA 


CTAAAACAAC 


AACAAATATA 


ATTCGTGATG 


TTTGCGTTAA 


600 


GCTAACAACT 


AACAAATTAG 


CACGTTTGTC 


TTGTTCAGCC 


ATGACCAGCA 


TTTOTGITAG 


660 


TGCTCCTGGT 


ATAACACTTA 


AAATAGCTGT 


TTCTGTATTA 


ATACGTGCAA 


ivrrriTAAA 


720 


AACAAATGCC 


ATTACTATTG 


CAATTAATAA 


TATCGAAATA 


GATACAACAA 


TAATCGAAAG 


780 


CCAATTGTTT 


TTAATATCCA 


TAACGACATT 


TTTCGTAAAC 


GTTGATCCGA 


TTTGCACACC 


640 


TAATAGTACA 


ATACCTAATT 


CACTAAGTAA 


GAATGGCCAT 


TTAATATCAA 


GTTTGAAAAC 


900 


TTTTACACAA 


ATGATTGATG 


CGATAATAGG 


ACCAAACATA 


AATGGAAGTA 


ATACGTGCGA 


960 
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TATCATTGCC ATGTTTTCCA CTTCTTTCAA TAAAAAATAA AATGACTAAA rTGCTGCTTG 
AGCTTCACGT TTGTTAAGAT AACAATATCC GCTAGCAGTT tTGACTACAA AGCATATATG 
GaCrrrCACT ATCAAGTCGC CGCCCATCCC TTATATACAT TTAAAAnGAG CCTGAACAAA 
GTTCAGGCTC TCAATTTGTC CGTATATTTA TTTTACAATA CGACTTAAAG CCGTATCAAA 
TGCTTGAATC GTTTTTCAAT ATCTTCTTTC GTGTGTGCCG TAGATAAGAA TGTACCTTCA 
AATTGAGATG GnGGnAAAAA CACACCTCTT TGnCATTCTC GGTACATTTC TGCAATAATT 
TCC 

(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1080 
1140 
1200 
1260 
1320 
1380 
1383 



60 
120 
180 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 397: 

25 TCCACTAAnA TGATTACAAT TGCATTAGTT TGGTGGAGTG CATTTACrtAT CTTAACGGGT 

ATGATTAAGA ACCACGGTTT AATTaTTTAG TGAGATTCTT ATTTGGTGTA GGTGAGGCGC 

CAATGTACCC TTCTAATGCT GTGTTTAATT CATTTTGGTT CTCTT^AAAAT GAAAAAGGTA 

GAGCATCAAG TGCATTATTA GCAGGATCAT ATTTCGGACC TGTATTAGCA CCAATAGTTA 24 0 

CAATTGCTAT TGTTAACGCA TTTAACTGGC AAGCAGTATT TTACATTTTT GGTGCAGTAG 300 

GTATTTTAAT GGCkGTATtA TGGGCGATTA TTGCCAAAGA CTTACCTGaG CrACATAGwa 

TGGTTAATGA AGCGGAGAAA CGTTTCATTA TGGAAAATCG TGATATCGTA GCTAC 

(2) INFORMATION FOR SEQ ID NO: 3 98: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
" <D} TOPOLOGY: linear 



360 
415 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 

TTTTAGaTaA aTyCAATTnT CyATaCTaAA TgATTnTCTT ATTaCGTCAA TTcGCCTTTT 60 

aTTTTATCGT AATCTTTCCa CTGCAAAGCT AAAGCTTCTC CTATTCTAAG ACCAGAATAA 120 

AATAACAGTC TAGTTAGCTG ACGAGAAGTA TCATTTGTGA TTTGTTCTAC TTTTTCATCA 180 
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5 



20 



25 



AATGTGGGGT 


CGTATAAGAG 


CTTGTAATGC 


riTTTGGCGT 


AATTGATAAC 


TGCTTTAAAA 


300 


CCTGCCCACA 


CAGATCGTGC 


ATAGTCAACA 


GAAAGACCTG 


CATCGTTTAA 


CAAATAATTC 


360 


CTGAAAGCAG 


TACATTGCGT 


AGTAGTGATT 


TTGCCAATAG 


GGATATTTCC 


GAACCTTTCT 


420 


TTTATGTGAG 


TATTATATTC 


TGTAGTTCGC 


TTTTCTATTG 


AGCGTGCAGA 


AAGATTTTCA 


480 


rrrrin'AAAC 


GATCAAAAAA 


TATATATTCA 


AAGGGTTGAT 


TGTCCGAGTA 


TCCATATTTA 


540 


ACATTTTGTA 


TAAATTCGCT 


TTCAGCTAGT 


TTGGCATCTT 


TCTTACGTTC 


AAACCCACGC 


600 


TTCA'rrm'c 


GTTTGTTATT 


ACCGTATACA 


TCTTTATATC 


TAATGGAAAA 


ATACCATTTA 


660 


CCTGTATTAT 


CATCCTTATA 


TACTGGCATT 


TTGCTTCTCC 


CTCCTCAAAA 


TTGGCAAAAA 


720 


ATAATAAGGG 


TAGGCGGGCT 


ACCCGAAATT 


TAGTACTAGG 


TACTAAATGT 


GATATAATAA 


780 


AATAAAAAGT 


AGGTGATGTT 


ATGACATTTA 


AAAACAATCA 


TAATTTCAAT 


GAATTAGTTT 


B40 


TAACGAATGA 


AGACATTAGA 


ATTTTAAAAA 


ATGTCTTAGA 


AGATGCAGTC 


AGTGTTTATG 


900 


ATGAATATTC 


GGTATGTAAT 


GAAGAATCCG 


ATTTTGCTTA 


ctgtttatta 


AGAGACTTAT 


960 


ATACATTAGA 


CAGCTTAGCT 


ATTTCGTCAA 


ATAATGTTTG 


'aattatcgaa 


TTGTACTCTT 


1020 


CGATTTTAAT 


ACCATGCATA 


ATAGAGTTTC 


TGTGTTCAAT 


AGCAGCTTTG 


ACTGAATGTk 


lOBO 


TTAAATGTTC 


TTCTATTAAA 


TCGTTGTTTk 


CCAtTTCGtk 


TAaAAATGTt 


CyTATATTCC 


1140 



T 1141 



30 <2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 

40 

ACTGTGGTAG GTTTTTTATT TTGAAGTATT AATCATAACA GACTAATAAT CATGAGGTAA 60 

CTAATAACAC ATATTTAACT TGTATTCTTA AACTGGTATA ATAAATTTAT GTTGAAATGA 12 0 

ATATTGTATG ACAGGGTATT CACTTTTATT AAAAGGTAAA ATTAAATAAA GGTTTTATAG 180 

45 

AACGTATTTA AATATATGAG GAGTAAACAA ATGGCTGATA GAACGAATAA AGAAATTAAA 24 0 

ACAGGACGCT TTATTGCAAC TGCATCAATC GTATTCTCAA TATTATTGAT TATTCATTAC 300 

TTTGTTTCGT TGGATAATGC GACTGCCAAA GCATTACTTA ATTTAACGAA TCAAAACACT 360 

SO 

TCAGATAAAG CGATTGATTA CATTTTAAAC AGCTTTAGAT TCACTGGTAT TATGTATATT 420 



55 



BNSOOaO: <eP 0786519A2J_> 



1293 



10 



IS 



4S 



SO 



EP 0 786 519 A2 

ATGTTTGCAG TTTATGTATC AAATAGTTrG TTTACGTTGA TTAATTTATC AATCACAATt" 
CMKGCPiATAA AAGCTGCACA CGGTGCGTAC TTAACATTGC CAATTTTAAT TCTTATTATA 
GGTTCGGTTG CATTAGCGAT TTATATGCTT GTTGTTTCTA TCAAACGTAA AAGTACATTT 
AATCGCTAGA AAATTGATTT TAACAATAAA AATATGAAAA AAAAnn 
(2) INFORMATION FOR SEQ ID NO: 400: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



540 
600 
660 
706 



60 
120 
180 
240 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 
20 ACACAATCTG AAGATTCACG TTGTGGTGCT GGACATGATC GAAAAATTAG AGCTGAACAA 

ATGAnAGAAA TCAGTGATTT TGTTAAAAAG AAAAATATCC CTAAAGATGA AACGGTATAT 

ATAGGTGGCG ACCTTAATGT CAATAAAGGC ACTCCAGAGT TCAAAGATAT GCTPAArAAC 

2S 

TTGAATGTAA ATGATGTTCT ATATGCAGGT CATAATAGCA CATGGGACCC TCAATCAAAT 

TCAATTGCX3A AATATAATTA CCCTAATGGT AAACCAGAAC ATTTAGACTA TATATTTACA 300 

^ GATAAAGATC ATAAACAACC AAAACAATTA GTCAATGAAG TTOTGACTGA AAAACCTAAG 3 SO 

CCATGGGATG TATATGCGTT CCCATATTAy aCGTTTACT^ TCATTTTTCA GATCATTACC 420 

CAATCAAAGC CTATAGTAAA TAGTGCTCAA CTAACTAATA ACTTGCTtCG TTCTAAAAGG 430 

^ ACGAAGCGAG TTATATTGTT AAAATTTGAA TTGACTTACA TTTTAATAAA ATCATCTTAA 540 

CAACTTTAAT TTTTCaTTAA TACAaGTCTT TACTCTACAC TCAAACnAGA TTCATACACT 600 

GCACGTCATA ATAAATCTAT CTATTCAAAT ATAAATAAAA GTTACCTACT ACATTCTATG 660 

40 TAGCAGGCAA CTTTTATTAC TTATTTCTTT TCATTATCAT TAAGTACTTT TACAAACTTC 720 

ACATTATGTG TCTTCCAATC AACTTCATAT AATGCTGATA ATTTTTCTTC TTTTTTATCT 780 

ACATGGTTTT CACCAGACCA ATAGCCCCAG AAACCATGGC GATTCCAATC TATTTTAAAC 840 

TCATCCATTG ATCTTTTATA ATGAACAACA AATTGTGATT TACCTTTGTC TTTTTTATCA 900 

TGTGACATAA CAGCTAAAAA TTCTGGATTA AACCCTTCAG ACACAGTTAC AGGCATTTTG 960 

TCTTTAGGTG TGAAATTATC TTTCGCCCAT AAATTTCCAT TTCGTGTTAA AGAAAAAATT 1020 

TCACTTTTAG TTCTATTATC ACTATCATTA GTTAATTGTC TCX5TATGOTC ATGTCCCATA 1080 
TTATTTATCA AATGTGCTTC TACTTTCCAA CCTACACCTT TATGTGACGT AGATTGATCA 



1140 
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(2) INFORMATION FOR SEQ ID NO: 40X: 

<i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 847 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doub 1 e 

(D) TOPOLOGY: linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 
CCAGAATTAT TTTTTCAAAA AGGACAATTT AACAATGTCG ATAACGTTAT CATAAGCAAA 60 
CCGATGAAAG GGACAATGCC TAGAGGTAAA ACGGAaGCTG AAGATCAACA GTATTATAAA 120 
ACATTGCAAA CTTCTTCGAA AGATCGTGCA GAAAATGTCA TGATTGTTGA TTTACTAAGA 180 
AACGATATAG GGAGAATATC ACAGAGTGGC TCAATTAAGG TGTATAAACT ATTTTTTATT 24 0 

20 GAGGCATATA AAACTGTATT TCAAATGACT TCGATGGTAA GTGGAACTTT AAAAAATAAT 3 00 

ACAGACTTAA CTCAAATTTT AACATCGTTA TTTCCTTGTG GTTCGATTAC AGGTGCACCG 3 60 

AAACTGAATA CAATGAAATA TATTAAACAA TTAGAAAGTT CACCTCGTGG TATATACTGC 4 20 

GGACAATTGG ACTATTACTT CCAACTGAAG ATGATAAAAT GATTTTTAAT ATTCCGATTC 4 80 

GCACTATTGA GTATAAATAT GGACAAGCGA TTTATGGAGT CGGAGCAGGT ATTACAATTG 540 
ATTCTAAGCC AAAAGATGAA GTGAATGAAT TTTACGCAAA AACCAAGATT TTGGAGATGT 600 
TATAATGCAA TTATTTGAAA CAATGAAAAT TGATAATGGA CATATCCCTA GACTTACTTA 660 
TCATACTAAT CGCATAAAAT GTTCTTCTGa GCGATTAAAC TTTAAATTTG ATGAACATGC 720 
^5 ATGGCGAAAT GAATTAAACG ATGTAACAAC AAAGTATCAC AGTGGTCAAT ATAGACTTAA 780 
. AATCGTATTA AATGCTGAAA GCAAATTTGA AACGATAGTG TCACCTTTAC CTGAGAAAAG 840 
TAGTTTT 847 
"^0 (2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 740 base pairs 

(B) TYPE: nucleic acid 
4S (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 



30 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 
TGAAGATGAA GCAQAAGCAG AAGACAATCT GCTACGAGTA CAATCGAAGA AAGAAGAAGT 6 0 

GTATCGTCGA TTACTTGCTT CTAACTTAAC AAGCGTTCCT GAAAGtTTAT CATCATGAAA 12 0 
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GTTAAACAGT TAAAAGATAA AGTGTCTAAA ATTGTGATTC AAATGAATAC ATTTGAAGAT 
GAAGCAAATG ATGrrCTTGT TAATGCTGTT TATGCAGAGA AATTAATTCA ATATGGAAAT 
AGATATCGTA AGGACTATAG CAATGTTGAT AAGAGCTTAA ATGAAGCTGA ACGATTATTT 
AAAAATAATC GCTATAAGCg TGCGATTGAA ATTGCAGAGC AAGCTCTTGA AAGTGTTGAG 
CCAGGTGTTA CTAAACATAT TGAAGAAGAA GTTATTAAGC AATAGAAACT AGTATGTAGT 
TATACTTAAA TAATATGAGC ACTCTGTCAA ATTGGACTGA TGAGTTTAAT AATTGAAGTT 
AGCCAACGAT ACGTTGTCTA GCTTCTTTTT TATATGGATA AATGaAAGGG ACAATAAATA 



GTGAAAAAAG CATAATCACA CAGCTGTTTA AATAGAGTGA AATAGTCTAA TTCTTATTTA 
ATT^GTAGAA ATAAGATTAT 
20 (2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



720 
740 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 

ATGCCAATAA TTAAACCATG TAATAAATAT ACGTATAGCG TACGACTACC AATATAAGTA 60 

TATAATTTTT TCTTTGTTGA CATTAAATTT AGAAACGCAG TCATTGCGAT TAATATAATT 120 

CCATATAATA TAAGTCGTTT AAAAGGACTG AATATACTCT GTCCTTCATT TTCAAGTGAA 180 

GTATATGGTG AACTTCCCAA TAACCAATCT GCATTGATAG GATGAATCAC GTAAACGATA 24 0 

AAAAACAAAA TAAAGGTAAT GATAGATACT GGTATTAGTT TTTTATTTTT AAAAATAGCC 300 

40 GTATGTTTTT TGGTGAAAAT GTAACCTAGA TAAAATATTG GGAAAAATAC GATTGTCCTT 360 

GAAATGCTTA AGTAGCTATC GATGTTATCT GAAAAACCTG CTCCAATAGA TATAATAATT 420 

GAAACTGATA GCACTTTATA TGGATTAAAT CTTCTAACTA TTACTAAAAT GACATGAAAG 480 

AAAAATAGCG TGATCAAAAA CCATAACGCA AATACTGGGT TAAAAGGATC AAGTTGTAAT 540 

TCGTCACTTT TACCTGTTAA GAAATAATAA ATTGAAAAGA ATGCAAAAAA TATCATATAA 600 

GGTACTATCA AACGTTTTGA AATTTTTTCT 

(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6254 base pairs 
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20 



25 



(C) STRANDEDNESS : doxible 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 
ATTTAAACGT TCAGTTTCTA AAAGTGGAAC AATCCAAGAA GTACGTAAAC GTGAATTTTA 60 

10 CGAAAAACCA AGCGTAAAAC GTAAAAAGAA ATCAGAAGCT GCACGTAAAC GTAAATTCAA 120 

ATAATTAATA CCTCTGTTGA CTCCCTCAAC ACGAATATTA ATTATATAAA ACAAACATCA 180 

CAAGTTAGTG TCTGACACTA ATATGTGATG TTTTTTTGTT GTCAATTTTT AATTAAAAAA 24 0 

AGTTATATAG TTTATAAATA ATCAAATTGA TATTCTATAG GTTCTTATAA CTATAAAGTA 3 00 

TATTCAATTT CATGTATAAT TAATGTGAGG GCGAGGTGAA ATTGTGAGTT ATAATAATTT 3 60 

TTTACAAATG ACAACTATCT TGGAATCAAC GGCTGGAGAT ACTTGGGTTG AACAAGTTAG 420 

CAATATAATT GTTCAACCTA TTTTTACGTT AATATTAACC TGTTTGACAT TCTTAGGATT 4 80 

TGTATATCAA CTTTACTCTA AAAAAATCAA TGCAGCTGGT ATTATCGCCA CATTATCATT 54 0 

ACTTATTTTA TTTTTGGGAT TTCTAATCCA AGGAAATGTC AATATGCATT CTATCTTAAT 600 
ATTCTCAATT GGCGTTATAT TAGTTGTAAT TGAATTATTT GTAGTTGGTG CAGTAATTGG 660 
TATTATTGGC ATGATACTGA TAACTATAAG CATAACAACG CTCGGTGATA ATTTGCTATT 72 0 

30 TATGCTTGCG AATGTTATCG TTGCCTTGAT TTTAACGATT GTAGAATGGG TGATATTAGT 780 

GAAGATTTTC AACAGAAAGA TTCCGTTTTT GGATAAAGTT ATCTTAAAAG ATTCAACTAA 840 
TTCTGAGTCA GGTTACAATT CTCATGATAA CCGCTCGCAC CTCGTAGGAA AGACTGCTCA 900 
AACAGTTACA GATCTTCGAC CTGCAGGGAT TATTTTTTGT GAAAATGAAC GTATTGATGC 960 

TGTTTCAGAT GGCAA C TTT A TTTTGCGCAA TAAAAOGGTA AAAATCCTTG AAGTTGAAGG 1020 

AACAAGAGTA GTTGTGAGGG AAGTAGATTA ATTAAAAGGA GCGATACCAT GTTTAGTTTA 1080 

AGTTTTATCG TAATAGCAGT TATTATAGTA GTTGCATTAC TTATTTTATT CTCATTTGTA 1140 

CCCATTGGTT TATGGATTTC AGCGTTAgcA GCTGGCGTTC ATGTTGGTAT AGGTACATTG 1200 

GTTGGTATGC GTTTACGTCG TGTATCTCCA AGAAAAGTTA TAGCGCCATT AATTAAAGCG 1260 

CACAAAGCAG GACTAGCATT AACAACAAAC CAATTAGAAT CGCATTATCT AGCAGGAGGA 1320 

AATGTTGACA GAGTTGTTGA CGCTAATATT GCTGCACAAC GTgcTGACAT TGATCTTCCT 13 80 

TTCGAACGTG CTGCTGCAAT TGaCCTTGCA GGACGTGACG TATTAGAAGC GGTTCAAATG 144 0 

TCTGTTAATC CTAAAGTCAT TGAAACACCA TTTATCGCAG GTGTAGCAAT GaACGGTATT 1500 

GAAGTGAAAG CCAAAGCTCG TATCACAGTT AGAGCTAATA TTGCTCGACT TGTTGGTGGT 1560 
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AGTAAGCATC ATACAGAAGT ACTTGAAAAC CCAGATAATA TTTCTAAAAC AGTTTTAAGC 
AAAGGTTTAG ATTCAGGTAC TGCATTTGAA ATTTTATCAA TTGATATTGC TGACGTTGAT 

5 

ATTAGTAAAA ATATTGGTGC AGACTTACaA ACTGAACAAG CATTAGCAGA CAAAAATATT 
GCACAAGCAA AAGCTGAAGA ACGTAGAGCT ATGGCTGTAG CAACTGAGCA AGAAATGAAA 

GCGCGTGTAC AAGAAATGCA TGCTAAAGTA GTTGAAGCCG AATCTGAAGT ACCATTAGCT 1920 

ATGGCTGAAG CATTACGTTC AGGTAATATC AGTGTTAAAG ATTATTATAA TTTGAAAAAT 1980 

ATCGAAGCTG ATACAGGCAT GAGAAATGCA ATTAATAAAC GAACTGATCA AAGTGATGAT 2040 

^5 GAGTCACCTG AACATTAAGT CGAGAGGTGA TTAAATGAGT GTCGGTATTC TAATTTTTGT 2100 

CATATCAGTG ATCATTTCTA TCATTACTAC TATGCGCGAA AATAGTCATA AAGATAGACA 2160 

AAATCAAAAG CCACCTCAAA AAACATCTAC CGATAATGAA CCAAAAAAAG GTGGCTTTTT 2220 

20 TGAAGAAATT GAGCGAACGT TTAAAGAAAT AAGTGAAGAA TTAAATGAAG AAGAAAAGAA 2280 

ATCATCGAAA CGAAAATATG ATGATACGTT ACCACCTTTA TTCGATGAAC TTCCAAAGGA 2 34 0 

AGAGCCTAAA TCGAAACCTG TTGTAGAACC TATGGCACCT AAAAAACAAC AAGAAACAAA 2400 

ACCGATGACA GAGAAACCAA TCACAGTGCC TAAAGCAGAA CCGGTGGAGC AGAAACATAG 2460 

ACCTTCTAGA CAAGATAATT CTGACGAAAT TAGACGTCAA TTAGAAAAAT CACTTAGAGA 2520 

TGATATTAAA ACGATTCGTA CTGACATTGA TAGAGAAAAA GAAAAGCAAA TTGCTAAAAT 2580 

GGAAAAACGT GCTAGAGATA TTATTGAGGA TAAATACTTA TCTGAACGTA CAAAACGTTT 264 0 

GAAATTAAAG CAGCTGCTTA ATTCTCAAAA TGTCGAAAAA GATTTGACTA AATCAGCGTT 2700 

CCAATTTGAT AAAGATGAAG TAATCAATGG TATGATATGG TCAGAAATTT TAGCTAAACC 2760 

AAAACAATTA TAAAATTTTT TGAAAACAAG CACTATCGTA ATGGTAGTTG CTTGTTTTTT 2820 

TACGTTAAGG AAAATTAAAA AACAAAGAGA ATTTTTCGAG AAATATTAGT TATTTAAATT 2880 

40 ACAGCAAAAA ATTGATTAGT CTAAAATTGA ATCTGCTTTT ATGACAAGGT GAAAAGTATA 294 0 

AATGATTATT TTAAATTAAA GAAAAATGAG TAAGTCAATG CAAAGATGTT TAAATCAATC 3000 

AATTGCATGA TATAATTAAG TAGATATTAA AGCATCATAG AATGAATATA AATGATATAT 3060 

GAAAAGGAGC GCGTGTATGC CTGGAATTAT ACAAATAGAC GATATGAACC AATCTCAAGC 312 0 

TTTAATTGGA AATAATGATG AACATTTAAA AGCAATTGAA GAGAGTTTCG ATGTTGTCAT 3180 

CCATGCAAGA GGACAAGaAG TTGCCGTTAA AGGTACAAAA ATAGAAAACG TAGAAAAAGC 3 24 0 

SO 

GGAATCAGTA TTAATCAATT TGCTGAAGGT TATTGATTTA GGTAATAATA TTACAATTAA 3 3 00 

AGATGTTGAA GCAGCTATTA AAATGGCGCA TAATAACACA ATTCAACATC TGTTAGATTT 3360 
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GCAACGTATA 


TATGTTAATG 


CCATGAAAAA 


TAATGATTTA 


GTATTTGGTA 


TAGGTCCTGC 


3480 






AAGACATTCT 


TAGCTGTAGT 


TTATGCAGCA 


AAGCAACTCC 


GTAAAGGTGC 


3540 


5 


1. Vj JTTAAACGT 


ATTGTATTAA 


CAAGACCTGC 


TGTTGAAGCA 


GGAGAGTCAC 


TTGGATTTTT 


3600 




AC CAGG AG AT 


TTGAAAGAAA 


AGGTAGATCC 


ATATTTAAGA 


CCTTTATATG 


ATGGTCTATA 


3660 


10 


TACTGTTCTT 


GGGCGTGAAC 


AAACAGAGCG 


ATTTATTGAA 


AGAgGCATTA 


TCGAAATAGC 


3720 




GCCACTTGCA 


TATATGCGCG 


GACGAACATT 


AGAAGATGCA 


TTTGTAATTC 


TTGATGAGGC 


3780 




GCAGAATACG 


ACACATGCGC 


AAATGAAAAT 


GTTTTTAACA 


AGACTAGGTT 


TTGGCTCAAA 


3840 


15 


AATGGTAGTT 


ACTGGTX3ACC 


AAACTCAAAT 


CXyiTTTACCT 


AAAGGTGTTA 


AAAGTGGACT 


3900 




TAAGGAAGCG 


GTCAGTAGGT 


TACACAACGT 


TAAAGGTATA 


AGTATATTGA 


AATTAGATCA 


3960 




GAGCGATGTA 


GTCAgACATC 


CATTGGTAAG 


TAAGATCATT 


GAACATTATG 


AAGGAGAGAA 


4020 


20 


TTAAATGTTT 


ACGATAGATT 


TTAGCGATCA 


CACAGGCTTA 


GTTAAAGATG 


CTTGGTATAA 


4080 




ACAAATTGAA 


GATTTATTAG 


AATTTGCTAA 


AAAAGAAGAG 


CATATAGAAG 


ACGATGCTGA 


4140 




GCTTTCTGTT 


ACATTTGTAG 


ATAAACAAGA 


AATACAAGAA 


ATTAATCGAA 


CATATAGAGA 


4200 


25 


TAArGwTAAr 


GTTmCaGATG 


CAaTCyCaTT 


tGCTTTAGrA 


GrAGATGAGC 


CmGaGATJcGA 


4260 




TtTTAGTGGT 


CTTGATATAC 


CACGTGTTTT 


AGGGGATATA 


ATTATCTGtA 


CGGATGTAGC 


4320 


30 


GCAAGrACAA 


GCAAACAATT 


ACGGACATTC 


TTTTGAACGA 


GAATTAGGAT 


TTTTAGCATT 


4380 


ACATGGATTT 


TTGCATCTAT 


TAGGTTATGA 


TCATATGACT 


GAAGCGGATG 


AAAAGGAAAT 


4440 




GTTTGGTCGA 


CAAGATACAA 


TATTAAACGC 


ATATGGATTA 


ACACGAGACT 


AATTATGAAA 


4500 


35 


AGGTTTAAAT 


ATGCACTTGA 


TGGGCTGAAA 


ATCTTAATTC 


AAAAAGACTA 


TAAATTTCTT 


4560 




TTACATGTGT 


TTGCAATGAT 


TGTTGCTATT 


GTCTTTGGTC 


TCGTACTAAA 


TATTAATCGG 


4620 




ATTGAGTGGA 


TATTTATACT 


CATTGCTATT GCATTAGTTC 


TCACTGTTGA 


AGCTTTAAAC 


4680 


40 


ACTGCTATTG 


AATATGTTGT 


CGATTTAGTG 


ACCGTTGAAT 


ATCATGATTT 


AGCTAAATAC 


4740 




GCTAAAGATA 


TTGCGGCTTT 


TAGTGTACTT 


ATAGTTTCAA 


TATTAGCATT 


TATTATAGGT 


4800 




TTAATAGTAT 


TTTTACCACA 


TTTTATAGCG 


TTATTTTAGG 


GAGGCATATA 


TGAGTTATCA 


4860 


45 


AC CTCATTAT 


TTTCAAGAAG 


TTAGAAAAGC 


ACAACAAGAA 


TCATATTCGC 


CATACAGTCA 


4920 




ATTTAAAGTA 


GGGGCTTATT 


TAAAAmCGAA 


AGACgGTAGA 


AcrrrrrATG 


GTACCAATGT 


4980 


SO 


AGAAAATGCT 


TCTTATCCAT 


TATCGATATG 


TGCTGAACGA 


GCTAGTTTGG 


TATCGGCAAT 


5040 


TTCTCAAGGA 


TACAGACCAG 


GTGATTTTGA 


ATCAAtAACT 


GTAACXX3TAG 


ATGCAGATAA 


5100 




ACCGTCATCA 


CCTTGTGGTG 


CATGTCGTCA 


AGTTTTGAAG 


GAATTATGTG 


ATGATGATAT 


5160 
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ACCATTTGGA TTTTCAGGAA AGGATTTAGA ATAAATGACA GAACATAAAT CAGGATTTGT 
TTCAATTATA GGTAGACCAA ATGTAGGAAA GTCAACATTT GTTAATAGAG TGATCGGCCA 
TAAAATAGCA ATCATGTCCG ATAAAGCTCA AACAACTAGA AATAAAATTC AAGGTGTTAT 
GACAAGAGAT GACGCGCAAA TTATATTCAT TGATACX3CCA GGTATTCATA AACCTAAACA 
CAAATTAGGT GACTATATGA TGAAAGTCGC TAAAAATACA TTATCTGAGA TAGATGCAAT 
CATGTTTATG GTTAATGCCA ATGAGGAmAT TGGACGAGGC GATGAATATA TTATAGAAAT 
GTTGAAAAAT GTTAAGACAC CAGTATTTTT AGTATTAAAT AAAATAGATT TAGTGCATCC 
AGATGAATTA ATGCCAAAGA TTGAAGAATA TCAAAGTTAT ATGGACTTTA CAGAGATTGT 
ACCTATTTCA GCATTAGAAG GGCTAAATGT CGATCATTTT ATTGATGTTT TAAAGACGTA 
TTTACCCGAA GnACCTAAAT ATTATCCAGA TGATCAAATT TCAGACCATC CTGAACAATT 
TGTAGTGGGT GAAATCATTC GTGAAAAAAT CCTTCATCTT ACAAGTGAAG AAATCCCTCA 
TGCGATTGGT GTTAATGTGG ACCGTATGGT TAAAGAAAGC GAAGATCGTG TTCATATCGA 
AGCAACTATA TATGTTGAAA GAsGTTCGCA AAAAGGAATT GTCATTGGAA AAGGCGGTAA 
AAAGTTAAAA GAAGTAGGaA AAcGTGCGAG ACGTGAtATA GaAATGCtTC TAGGCTCTAA 
AGTTTACTTA GAATTATGGG TCAAAGTTCA AAGAGACTGG CGAAACAAAG TTAACTTTAT 
TCGCCAAATT GGTTATGTTG AAGACCAAGA TTAATCTTAA AAGTGGTGAA GATAATTGTT 
AATGCGCCAA AAAGGGATTA TCATCAAAGC AGTTGATTAT GGTGAATCTG ATAAAATTAT 
CACGATTTTA AATG 

(2) INFORMATION FOR SEQ ID NO: 4 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : do\lble 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 
GTTGTTCTAA ATGTTTCTTG nATGAAACGA GTCAATGTAA ACTGATATTG CTGTATTTGT 
TGCAGCAATT CATATTGGTC TGGTGTTGCA ATAACAGCAG CTTGAGTTGG AGTCGCAGCT 
CTGATGTCTG CAGCAAAATC ACTTAATGTA AAGTCTGTTT CATGACCAAC TGCTGATATA 
ATCGGTGTCT TACAATTATA TATTGCACGG ACGACAGCTT CTTCGTTGAA ATTCCATAAA 
TCTTCTATGG ATCCACCGCC TCGACCTACA ATAATGGTAT CTACACCTAA ACTATCTGCA 
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TGTATTTGTT CAGCTAATGG AAAACGACTA TTTATCGTTG AATGGATATC TCGAATTGCG 420 

GCACCTGTAC TCGCTGTTAA AACTGCAATT TTTTTAGGAA ACTTAGGTAT TGATTTCTTA 4 80 

TTOGCTTTAT CAAAACAACC TTCTTCAGTT AATTTTTTCT TTAATGCTTC TAATTTTTGA 540 

TATAAGTTCC CTATACCATC TAATTGCATT TTATTTACAT AAATTTGATA GTTTCCACGA 600 

CGTTCAAAAA CAGAAACACX; TGCTTCTAAT AAGACTTCAT CTCCTTCTTT AGGTTCGAAG 660 

TTTAATTTAG AAGCACTACC TTTGAACATC ATGGCACTTA TAACGCTTTC TTTATCTTTC 720 

ACATTAAAGT ATAAATGaCC ACTTGaATGc TTTTTGAAAT TTGAAAGCTC ACCTTTAATC 780 

AATACAGATT GGAGATGTGG ATCTTGATCA AATTTATATT TAATATATTT CGTTAAAGCT 84 0 

GAAACACTTA AATAATCTGA CATATAACAT CACTCAATTT TATTTTTTTA TATTACTCAA 900 

TACACCATTT ATAAATTTAT AATGATCATC ATCACTGAAT TGTTTTGTTA ATTCAACTGC 960 

2^ TTCATTCATT ACGACTTTAG CAGGTGTATC ACTGTGTAAT ATTTCATATG TTGCCATTCT 1020 

TAAAATAATA CGATCCGTTT TTAATAAACG TGCAATAGTC CAATCTTTTA AATAAGGACT 1080 

AATTGTCTCG TCTAATACAG GTTCGTGATC TTTAACGCCA GAAACTAGCC AATGAATAAA 114 0 

TTCGAAGTCT AAATCTGGAT TATCGTCTTT AATAAAGCTT ATCGCTTCAT TTATCGTTAA 12 00 

ATCACTGTCC TTCATTTCTA ATTGAAATAA AGTTTGAAAA GCTTGCACTC GGGATTCTTT 1260 

ACGACTCATT TTTAACTCCT TCAAACGTTT GTATTTTTCT TTATTTAATT ACTGAATTAG 13 20 

GTATGACATT ACTTTTCAAT AACGATTTGT GTAATGTGAA TATTAATTTG CTTAGGTTCT 13 80 

ATCGCTGTCA TATTAGAAAT TGAATTAAAA ATTGACGTTT GAATTTTGTT TGCAGTTTTT 1440 

35 GAAATATTAA CACCATGTTT TAATGCACAA TATACATCTA TATATATGCC ATCTTCTTTA 1500 

CTCTCGATTT TTAAATCACG GCTTAAATTT TTACGACTAA CTTTTTCTAA ATTTGTTTCT 1560 

TTTAATTCAG CAAAATGGCC AGTGATGCCT TCGACTTCCG AAOTAOCTAT ACTTGCAATA 1620 

ACAGATAGCA CTTCTGGCGC TATTTCTACT TTACCTAATT TTGAATTTGA ATAATCAGTT 16 80 

ACTTTGACCA TGGATTGACC TCCTATTAAC CTTCATCATT CATAATGCTA TTTTGCTCTA 174 0 

AAAAGTTTGT ATTAAATTTA CCGCTTCTAA ATATATCGTT ATTCAATAAT TTAATATGGA 1800 

ATGGAATAGT TGTATCAATA CCAAGAACCA CAAATTCACT TAGTGCACGA ATGCCAGCCA 1860 

TAATCGCTTC ATCTCGTGTC GGTTCATGTA TGATTAATTT CGCTACCATC GAATCATAAT 192 0 

ATGGCGGTAT CGTATAATTA GTATAACATG CTGACTCTAT TCGAACACCA TATCCACCTG 19 80 

GTGCAAGATA TTGCTCmATT TTACCTGGTG ATGGCATAAA GTTCTTGTAA GGATTTTCAG 204 0 

CATTAATTCT AAATTCAATT GCGTGTCCTG TTAATTTAAT ATCTTCTTGT TTATACGGTA 2100 
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CAGTTACAGG ATGTTCTACT TGAATACGTG TATTCATTTC CATAAAATAA AATTTATTAT 2220 

CATTTAAATC ATATATAAAC TCAATTGTTC CCGCATTTTC ATAATTTACA GCTTTCGCTG 2280 

CACGAACTGC GGCATTTCCC ATTTCACGAC GTGTTTCATC ATCTAAAATT GGGGAAGGTG 2340 

CTTCTTCCAC TAATTTCTGC ATACGTCTTT GAATTGTACA ATCACGTTCT CCTAAATGAA 2400 

TTACATTACC ATAGCTGTCC CCAACAATTT GGATTTCAAT ATGGCGGAAG TTTTCGATGA 24 60 

ATTTCTCCAT ATAAAGTCCA CCATTACCAA ATGCAGTTTG AGCTTCTTGT TCTGTCATTC 2520 

GGAAGCCAGT TTCAAGTTCT TTTTCATCAC GAGCAACACG GATACCTTTT CCGCCACCX3C 25 80 

CAGCAGTAGC TTTAATGATG ACCGGATAGC CAATTTTTTT GGCGATTTTC TTAGCTTCTG 2640 

AGACGTCTTT CATTAAACCG TCACTACCAG GAACAACTGG AACATTGGCT TTGATCATTT 2700 

CTGCCTTAGC AACATCTTTG ATACCCATTT TTTGGATAGA TTGATAACTT GGTCCAATGA 27 60 

20 ACTTCAATTG GcATgctTCG CATAATTCTG CAAAATCAGC ATTTTCAGCT AAAAAGCCAT 2820 

AACCCGGATG AACGCCATCA CAACCTGTAG AAGTTGCAAT AGATAAGATG TTCGGAATAT 2880 

TTAAATATGA ATCTTTAGAC AAAGTGGGAC CTACGCAATA TGCTTCATCA GCAATTTGAG 2 94 0 

TATGTAGCGC ATCTTTATCC CCTTCAGAAT AGATTGCAAC AGTTTGGATG CCTAAATCAC 3000 

GACAAGCGCG AATAATCCTA ACTGCGATTT CACCGCGGTT TGCAATTAAA ACCTTTTTCA 3060 

TTATTTCACC TTAAATAACG GTTGGCCATA CTCTACCATT TGTCCGTCTT CTACTAAGAT 3120 

TTCAACAATT TCACCTGAAA TTTCTGCTTG AATTTCATTA AATAGTTTCA TTGCCTCTAA 3180 

AATACACACT GTTGTTTCAT TTGAAACAGT GTCCCCAACT TGCACATATG CTTCTTCGTC 324 0 

3S TGGAGATGGC GATTTGTAAA ATGTACCTAC CATAGGTGCA TTAATTGTTT TGTGATTATC 3300 

TGAAGTTGGC TTTGGAGCTT CAGTTTTATT GCTATCAGTT GATTGTGCTT GAGGCATAGG 3360 

CATTGCCGCA GCTTCAACTG GCATTTGTGA GATTTGTGGC GTGATAATCT CAGTTTCTTT 3420 

TTCTTTCTTA AGCGTCACTT TGCCTTTAGT ATCTTCAATA TTGATTTCCG TTAAAGTTGA 34 80 

TTTATCCAGA ATTTCAATTA ATTCTTTGAT TTCTTTAAAG TTCATTATTA CTGACTCCTT 3540 

CAGTTTGTTT TCATCTACCC GTCTATTTTA CTTGAGACAA CTCTTCAATT CAAGCATGTT 3600 

CATATTGCTG GCGACATTAT AAGTCTATCC CAAAGTTATA ATAAAACCAC ATTTTAAATT 3660 

AAAAACACTT GTGTATTTAT TACTTAACAT TGACTCATCT TAACTCTTGA 3710 
(2) INFORMATION FOR SEQ ID NO: 406: 



2S 



30 



40 



4$ 



SO 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1705 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

^ GCTGATGTTC GTTGCCTTTn TCCACCAGAC AATTCAGAGG GATATTTATC ACTAATATCC 60 

AATATATTTA ATGCTTCTGC TACTTTTTCA TAACGATTTA ACATATGTTC TTTATCTAAC 120 

TTCTGTACCG TTAGTGGTAA CATTATGTTT TCTTTAACAG TCAATGTATG CAGTAAATTA 180 

70 TACTCTTGAA AAATAAAACC AATATCATGC TTGCGTATAT CAGATwATTC CTTGTTTGAA 240 

AGCTTTTCTA ATTTTTTTCC TTTTAATGTA ATAGAACCTT GTGAAATATA ATCAATTGAA 3 00 

CTTAAAACAT TTAATAATGT CGTTTTCCCA GATCCAGAGG GACCCATAAT AGCAATAAAC 3 60 

TCGCC T TCTT CAATAGACAT ATTGATATCT CGCAACACTT CTTGTGCCAT TTTTTTAGTT 420 

CCATATATTT TTGTTAATTG TTTTACTTCT AAAATTGCCA CTTTAACACT CCTATAATTT 4 80 

ATCTTAACTT CATTTCCTTT AGGCTTTGGC ACTTGTATCT TCAATTTAAC ATATGACTAA 54 0 

CATCTATCTT ATTATAACGT TGAAGCTGCA TTGATGTATC AATTCTAAGT AACAAAACGC 600 

ATGTTTAAAA TGACAAATTT GTCACTTCCG ACATGCGTTC AACAATTTCA TTTTGTAATG 660 

2s GGAAAATCAA TCTGACAGTT GTCCCCTTAC CAACAGTCGA CGTGACTTGC AGGTGAATAC 720 

CTAATTGATC CTTTACACTA TTTACTAAAT ATAGACCCAT ACCTGAAGAC GTCGTTTCAT 7*80 

TTCTGTTAGC CGTTGACGTA AATCCTCGTT CAAATATTCG CGGCATATCT TTTTTACTAA S'^^O 

30 TACCTCTGCC ATAGTCTTTA ATATATAACG AAACATGTTG ATCATTTAAT TCTGTCCCAA 900 

TTTCAATATT AAAATTCTCA CTATATTTCA ATGCGTTTGA CAAAATTTGT CTAATAATCA 9 60 

TACGACACCA TTTTATATCT GTATAAACAT AATCATCCAC TTTAAAGTCA ACATCAAAAC 1020 

CAATACCTTT AACCTGACTA ATATGTCTTG TTAATTGTAT TTCATCAATG ACCATGCGTT 10*80 

TAAGTGACAC GTAATCAAAA TACATATCTT TACGTTGAGA TTCTAATCTA GTAATATACA 1140 

GCTGTGTATC TAGCATCGAG TTTATACGAG ACCATTCATA TAGTAATGCT TGTwTTCTTT 1200 

CTTGATTTTT TTCTTGATCA ATTAATAATT TCATAGCTGT CACAGGtGTT TTTATGTCGT 1260 

GCACAAATTC TGTAATGGTT TGTTCATGCA TGTTCAATTG CAACTGTTGC TCAACAACCT 1320 

45 TTTCTTTGTq CGCTGAGATT TGACGATATA AATAATCAAC TGTATGACGT TGAAATGGCG 13 80 

TTTCCGCTAA ATCTTTATGT TTAATTTCTT CTATTTCTTT ATCTTTGTCA AAATGCTTAT 1440 

ATAATTTTAC TTCTTTAAAA TATGTCAATA AAAGAAAAAT CATTGTTAAA CTTAAATTCA 1500 

so 

AAGAAACAAT ATAAAATAAA CTGTCTATTG GAAAATCATA ATCGATTAGA CTArTGCCTA 1560 

ACATAAGGAA GTTTAAAAAC AATATCCAAA AtATCCAGTT CATGCGAGAT TTCAAAAAAT 1620 

SS 
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GCACTATCCA TACTAATTTC AGATA 1705 
(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 

TCAATATATC TTAAAATTCA ATGATTAAAT CnATTATCAC TAGACATnAA ATACATAAAT 60 

CCTATTCCCC ATTTTCATTT nTTAATTCAT AAATGAATCA ATAACCACCT AATAACAAAT 120 

CATATTATAC ACCTTTGTTC TCTATTTTTC TAAGGTTTAA AAAATATTTT TAGGTAAACC 180 

20 TAAAAATAGA TGTAATAAAA ACGCCTCCTC AGATATTTAT ATATCTATGA AGACGTTTAA 240 

ATACATTATA GATGGTCTGG TTCTGGGTGA ACGTATACTG AGGAAAtACC TTTTTTGTGC 3 00 

AAATGATGTT CGACATTGTC ACAAATTTGA TGCGCTTCTA CTAAGGAAAG GTTAGCATCT 360 

ACAACAATTG TGACATCAAT AAACACACTA CTTCCATGGT AACGCCCTTT AATACTTTTA 420 

ACTTCTTGTA CTTCATCAAC TTCTAAAATA TCATTGCGAT ACGCTTCTAA TTCAGTTTCA 4 80 

TTGAAACCAT CACTCAACAT AAAAATTGCT TCTTTAAAAA TACCAAAACC AGTATAAACG 54 0 

ATTAGTAAGC CTAGTAATGT TGCTAAAATA ATATCGACAA TTGGGAAACC GATTTGCGTA 6 00 

AAAATTAATC CTATCGCTGT TCCAATGCTG ACTAAACTAT CCGATAAATT ATCTTTGGCA 6 60 

35 GCCGAATTTA AAGAACTACT TTTCGTTCTT TTCGCTAGTC TTTGATTGAC TGCAAATACA 720 

ATCAACATTA CAAGACCACT GATTAAGCTG ACGATAATTG TTATTGCGTT AGGTACAACG 780 

TCATCTTCTT TGAACAAACG AGGTGCATTT TGAATAACTA CTTGGATACC TACAAACATA 840 

ATGACAAATG ACACCAATAA TGAAGAAATA TTTTCAGACT TCAAATGGCC ATAAGGATGA 900 

TTTCGATCGG CAGGTTTAAT TGAAATTTTC AATCCAATAA TAACAGCTAA AGAAACGATA 960 

ATATCTGTCA TATTGTTTAA TGCATCGGCT CTTACAGCTG CAGAGTTAAA GACAAAACCC 1020 

GTGACATACT TAACAATAGA TAAGATTATA TATACAATTA AACTCAAATA AGCACCGCGT 1080 

TGCGCCAATT TAAGATTTTC ATTATGAGAC ATGCGTTGAA CCACCTTGAA TTAGTATAGT 114 0 

AACAATATTA TGAATGATTC ATTTTAATTT TACAACGTTT TTAATTTTTA TAAATTTTTA 1200 

TAAAATTAAA CTAATTTATT CATTCGCAAC CCCTAAAAAT AATTTTTAGC CTTTCTGCGA 126 0 

ATTTTATGAG CTAGAAAGGC GCCCAACTCT CCCTGTTTGT TAACTTTCGC CTCGAAAGTT 1320 
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CGAAvwTTAT GAGCTAGAAA GGCTTATGCA GTTGACGTTT TACX3TCCAAC TGCGTTCCTC 
CGTCTTCTTC AAATTTATTT GTnAGAAAGG CACCCAACTC TCCCTGTTTG TTAACTTTCG 
CCTCGAAAGT TTCTATGTTA GAACCCTATG CATGAGTTGC GAAnTATCTA ATGTCGTGAA 
CTAATTATAT AGAAGAAAAA GTGCATCAAT GACAAATTAA ATGAGATTTC TACTCTACCA 
AACTCTCTTC GAAAGACAAT TTTCTCCTCT ATTTATTAGC AACTATTGCA TTTCTCCATA 
TAGTACTTCC TTACTTAAAA TACGCTGAAT GTCTGAATTA AA 
(2) INFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 
GCGAGACCCC CTGAGGGAGC AGTGCCCAGT CGAAGACCCG AGGCTGAGAC GGCACCCTAG 
GAAAGCGAcC ATTyCAATAC GAaTTgTGAt AAATAGAGAA CAGCAGTAAG ATATTTTCTA 
ATTGAAAATT ATCTTACTGC TGTTTTTTTA GGGATTTATG TCCCAGCCTG TTTTTTGTGA 
TTTTTAATAA TTTGAATATG GAAAATGTAT TATTCTCTCA TTTGTATAGA TTGTATTTAA 
TAAGTTAATG TAATCCTTGA GCTCACGATT AATAAAATTC TATAACCTTA ATTATTTTCT 
CGATACAAAG GGTTATTAAC TTTAATATAA GTATAATGAT GTGCCTCATC TTCAAGACGC 
ATTGTTGTAA CACACTTATA ATCTATAAAT GGCGCGAACA TGGTATCTTT AATTTCaTTT 
ATGCGATTCT CATTTACTTG ATTAGATTTG TGTGTCGAAA GTACAAGTTG ATCaAAAATG 
TTATCTAGTA CATCACGAAC GATATACCAC ATATGTCTTT CTAAGTTTGA ATCATTTGAT 
GCTTTAGAGA TTGTAAGAAT TAATTCGCCT AAATGGTTTT GAACX»TAGA ATAAAAGGCT 
TTGTTAAACA CAGAC G TTTT TGAATCAGTA AGAATTCTTG ATTTTTCATG GAAATGAGAT 
GTACTGTATA CCATTTCATT TAGTTGTGCT TTATCAATTC TTAAACCTTC AAAGTCTCTA 
ATATACATCG TGTCCAATAG ACCATCTTTC CGAAATGTTG CAATAGCATT TTGCAAGTGA 
GCCTCTAATG CAATGCCATA TTTAGTAACT AGTGGGATTA CGAGACCAAG CAATGCTTTA 
CTATAAGTTT CAACCCACGA TTTCGCCGAT GATTCAAAAT CAGACAATGA TGCAGCTGAT 
TGATAACGTT TAATCAATGT CACGATAGGT GATTCATTGT TAAATGGGTA GGTTGCAACT 
AAGCTTGAAG GAATCAATGG TGTGACTTCT TGTGGAATCA TTTGGTATAT ATTTTTTCTA 
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AAAAATGAAT 


ACCAGCAACT 


TCATCAATAA 


TTGTTGATGC 


ATAGGACTTA AATATGACAT 






CTTTCTCCAA 


AATATCATTT 


AAAATACGTG 


TCATTAGTGG ACCATTGTGC GTCGTTTGTT 


1200 


5 


CTGATAATGT 


ACGAATCTCA 


CCTGTAATAT 


GAACGTTTGT 


CGACAATTTG 


ATGTGTGGCG 


' 1260 




ACATAGCTGG 


GTATTTAGGA 


ACTAATGTTC 


TGAAAGATAA ACCAGCATAA 


TAATCCaACG 


1320 


10 


TATGTTTTGC 


TTCAATGATT 


AATTCTTTAT 


CTACTTCTGC 


TTGATAATCA 


GAATGTAATA 


1380 




CGTCATCTAA 


TTGCCATGGA 


TGAACAATCA 


TAATGTGATA ATCATTAAAG 


TTAAACTTTG 


1440 




GCGTAAATTC 


ATTTTCTAAT 


TGTTTAATTA 


AGTCCGGAAA 


TAGTTGATGA 


ACAGTTGTAT 


1500 


IS 


CATAATCTTT 


AGACAGTGAC 


ATAGTACGGC 


TTAATTTACT GTGAATCAGT 


ACTATTTTCA 


^ ^ o u 




ACTTAATAGG 


TTGATTGAAT 


TCTGAAGAAT 


ATAGGAATGT TTGTAATGCA mAAACCTT 


■L O ^ W 




TACGTAATTT 


AGCCCCAGGA 


TGTAGCGGAT 


GACCTTCAAT 


AACGGCTTGC 


TCTGAACGCA 


X 9 O w 


20 


AGTAACTATC 


TTCGCTATTT 


TCGATAATAT 


TAAATAAAGG 


TGCAGAATCA 


TGTTGCATTG 


J. / •» u 




ACAGTGCTTG 


ATAGCTAATT 


GCAAATGTCA 


TATtAGTtGC 


ACTGTTTATT 


AAATCTTGCT 


1800 


25 


GAAATTGATC 


ACTAGCAGCA 


TTTT-rrAAAT 


CTGGTGCTTC 


AATTAAAATA 


CACTCAAGAA 


1860 


TTTCATTTGG 


ATGGTGTACT 


CGTGTAATCG 


TATTTGTAAT 


GTCATCTTTA 


ATGTAGAAAG 


1920 




GGCCTTCAAC 


ATCAATTCGA 


TCAAAGGCGT 


GTTCTCCAGT 


GATAGGAGCA 


TATAATGTTT 


1980 


30 


GCTTAGCTTG 


TGGGAACTGG 


ATTTCTAGTA 


TATGAGTCGT 


TGAGATATCT 


AACATAATCA 


204 0 




AATCACGACT 


CAATATTTTC 


TTACTTTGAG 


TGCGCGCTTT 


AACTAAGTTT 


TCGCGATGCA 


2100 




TTGATGTGAC 


CAATCTCTGA 


GTGACTTTAT 


CTCTTCCTTG 


TAAAATCATC 


TCTTTAAAAA 


2160 


35 


TATTAGCCCA 


ATCGCTATTA 


TGTTGTTGTA 


AAAATAAATA 


TGTTTCTTGT 


TCTTCTTTGT 


2220 




TAAATTTTAA 


TGTCTGTTCT 


TTAAAAATTA 


AGTTCAAGTT 


CATAATTCAC 


CTCTATGAAA 


2280 




TATTTTACAA 


AAGCAAGATA GATTTGTATA ATCCATATTA ATGATAATGA 


yTCTTATTAT 


2340 


40 


CAACAGAATG 


CX3GGTGTAAG 


TTTTATGACA 


AAATATTTTT 


TTAGCAGTTC 


TTTTCTACTA 


2400 




TTTCTAGGTA ATTGGATTGG 


ACAAATAGGG 


CTAAATTGGT 


TTGTACTTAC 


CACTTATCAT 


2460 


45 


AACGCAGTTT 


ATCTGGGGAT 


TGTCAATTTT 


TGCAGACTTG 


TACCAATATT 


ATTACTAAGT 


2520 


GTGTGGGCAG 


GGGCAATTGC 


CX3ATAAATAT 


GATAAAGGGC 


GATTGCTGAG 


AATTACAATT 


2580 




TCATCATCAT 


TTTTAGTAAG 


TGCAATTTTA 


TGTGTGCTCA 


CGTATAGTTC 


ACTGCAATTC 


2640 


50 


CAATTAGCGT 


CATTATTATA 


TATGCGACAT 


TAAGAGGGAT 


TTTAAGTGCG 


GTTGAAACAC 


2700 




CTTTAAGACA 


AGCAATCTTA 


CCAGATTTAT 


CAGATAAAAT 


ATCTACTACA 


CAAGCTGTtnw 


2760 




CATTTCATTC 


ATTCATCATT 


AATATTTGTC 


GTTCAATAGG 


GCCTGCCATT 


GCTGGTGTCA 


2820 
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CAGTTTTATT ATGCTTACCA TTACATTTTA AAGTAACTAA AATACCTGAA GaTGCATCAA 294 0 

GaTACATGCC GTTAAAAGTT ATTATAGATT ACTTCAAATT ACATATGGAA GGTCGACAAA 3000 

TATTTATAAC ATCATTATTG ATTATGGCGA CAGGTTTTTC ATATACGACA CTTTTACCAG 3060 

TTTTGACAAA CAAAGTATTT CCGGGGAAAT CTGAAATATT TGGTATCGCT ATGACGATGT 3120 

GTGCCATTGG TGGTATTATT GCAACGCTAG TTTTACCTAA AGTACTTAAA TATATTGGTA 3180 

TGGTAAATAT GTATTATTTA AGTTCATTTT TATTTGGCAT TGCTTTGTTA GGTGTGGTAT 324 0 

TTCACAATAT TGTCATCATG TTCATTTGTA TTACATTGAT TGGGTTATTT AGTCAATGGG 33 0 0 

CACGTACGAC AAATCGCGTT TATTTTCAAA ATAATGTTAA AGATTATGAA CGTGGTAAAG 336 0 

TACTGAGTAT TaTTATGATG GgATAGAGGT ATGaTTCCAT kGGGAAGTCn ATTAATGAGT 34 20 

ATATGTGCAG ATGTGTTTGG CATTGTTAGA ACTTTTTCAA TAATGGGAAT AAGTACTATA 34 8 0 

20 TGCATTACAA TGGTATTCTA TTTTATAAAT AGAAAGTTGA AGTTAAAGTT GGAGGAAAGT 3 54 0 

AATCATGGTA TATCTTGAAT GGGCAAAGGC AGATAGAAAT ATTCAATATC GTGTAATTAA 3600 

CGCCATTATT AAAGAACGTA TTTACCCCGA GCAAACATTT ATTTCGCAAA AAGGATCTTT 36 60 

AATTGAAATT CAGTATCATA TGCATGTGTT GACTATTGAA GTTGTTAGAA AAAGTGCATT 3720 

AGAACGCTAT GAGTTTACAG GTGATATTAC TTATTTAAAT AAAGGTGAAA CGTCATTAAT 3 7 80 

TATAACTTTA GAAGGTTTAT TAGATGTGTT GAATCATGAC TTTGATATCC CTATTTCAGA 3 84 0 

GCGACTACGC GAAGAGTTAA TACACAGTCG AGATAGTTTA GTTGAAACAT ATAAGCAAAT 3 900 

GTCTCACAGA CAAACGTTAA TAAGTCmAAG TTTTAAATTT TCAAGGTTAC CACAAGATAT 3 960 

TAACTTTTTT TCakGGTtAC AACATGTAAA AGATAGTGAT AAGACAGATG ATTTAACTTA 4 020 

TTCTGAGAGT TTGGTACCAG AGGGGCATCC AACACACCCT TTAACCAAAA CGAAATTGCC 4080 

CTTAACTATG GAAGAAGTAC GAGCATATGC ACCTGAGTTT GAAAAAGAAA TCCCTTTGCA 4140 

40 AATTATGATG ATTGAAAAAG ACCATGTTGT GTGCACAGCT ATGGATGGTA ATGATCAATT 4200 

TATTATTGAT GAAATAATTC CCGAATACTA CAATCAGATT CGTGTGTTTT TAAAGAGTTT 4260 

AGGTTTGAAA AGTGAAGACT ATAGAGCGAT TTTAGTACAT CCTTGGCAAT ATGATCATAC 4320 

GATAGGGAAA TATTTTGAAG CATGGnTTGC TAAAAAAATA TTAATTCCAA CGCCGTTTAC 43 80 

AATACTTcCA AAAGCaACTT aTCatTTaGG ACGATGTCTT TAATTGATAA ACcATACCAT 44 4 0 

GTTAAgTTGC CCgTCGATGC aCAAGCAACA AGTGCCGTTA GAACAGTCTC AACTGTGACT 4 500 

ACTGTAGATG GACCAAAGTT AAGTTATGCT TTACAAAACA TGTTGAATCa ATATCCaGGA 4 560 

TTTAAAGTTG CTATGGAACC GTTCGGTGAA TATGCAAATG TTGATAAAGA TAGGGCACGT 4 620 

55 



30 



35 



45 



SO 



1307 



EP 0 786 519 A2 



AGTGCAAGTC 


TAGTTAATAA 


AAATCCAATA 


GATCAAAAAG 


X. A X \»o X v*vx/\ 


1* & G TT A f^TT A 

X AO X X A V» X X n. 


GAGTGGTTAA 


ATCAAGGAAT 


TACTAAAGAA 


AGTATTACGA 


X X n X X 


A PGAT ACGfT 


CAAGCATTAA 


TCCCGCCTTT 


AATTGCTTTT 


ATTCAAAATT 


ATGGAATTGC 


TTTAGAAGPA 


CACATGCAAA 


ATACAGTAGT 


GAACTTGGGG 


CCACATTTTG 


ACa TTC^AATT* 


TTT AGTG AG A 


GATTTAGGTG 


GTTCTAGAAT 


TGATTTAGAA 


ACATTACAAC 


AT CGTGT ATC 

f\ X X ^7 X X W 


AGATATTAAA 


ATTACAAATG 


ATAGTTTAAT 


AGCTGATTCT 


ATAGATGCAG 


TGATTGC!AAA 


ATTC?C!AA fTA T 


GCTGTTATTC 


AAAATCAAAT 


GG CAGAATTA 


ATCCATCATT 


TTAATCAGTA 


X\Jx\X XV^XVJXX 


GAAGAAACCG 


AATTATTTAA 


CATAGTACAG 


CAAGTAGTAC3 


CG CATGT'P AT 


T A A f^rr* A A ^ A 


CTACCACATG 


CAAATGAGTT 


AAAAGATATT 




PAAr^AATTAr* 


Tf2T*f* A A A rri 


TTGTTAAATA 


TGAGAATGGA 


AAATAAAGTA 


AAGCAATATT 


T AAAT ATTG A 


nTTAr: ATA at 

wX XAunX/vaX 


CCGATAAAAA 


AAGAGGTGTA 


GTACTACATG 


GCACACGTTA 


ACATAAATAT 


ATCGAAGATT 


AAaTATAACG 


CCAAAGTACT 


TCAAACAGTT 


TTTCAAAGTA 


AAAATATGCA 


ATTCACACCA 


GTAATTAAGT 


GCATAGCTGG 


TGACCGTACA 


ATTGTAGAAA 


GCTTAAAAGC 


GTTAGGTATC 


AATCATGTTG 


CAGAATCCAG 


ATTGGATAAC 


ATAATTAGTA 


TTGCAGATAC 


AGGATTTAAC 



A 

(2) INFORMATION FOR SEQ ID NO: 4 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 
AGGTCGTCTr GCaGmAGmGm TATTTGACGA TCTTGCTTTC CCAAAACACG ATGATGATTT 
TAACATACTG TCTGATTATA TTGAGACACA TGGTGATTTC aCATTGCCAA TGTCTGTATT 
TGATQATTTA TATGAAGAAT ATACGGAATG GCTAAAATTT TAATATAATT TTTAATAATA 
ATAGTTAGAA CCAGGGTGAT GCAATTCGTT ATCCTGGTTT TAATTTAAAA TAAACTAAGT 
TGTGACTAAA AATTAATCAA TTATAGTGAA ATATGGTGCG CTAT C TTGCA TAAATTGATA 
TGATTAACTA CACAGAATTT AAAAGTACAT AATACATAAT AAGGAAGTGA TACAATGGAT 
GATAAGCAAC ACACATCTTC ATCCGATGAT GAACGCGCTG AAATTGCAAC AAGCAATCAA 
GACCAAGAAA CTAATTCATC GAAACGCGTT CACTTAAAAC GTTGGCAATT CATATCAATA 
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CAAAAAATAA 


GTGGTTTAAA 


CAAAACTGAT 


CAAGCAAACT 


TAAATAAAAT 


TGAAAATGTG 


600 


TATAAAATCT 


TAAATAGTGA 


TTATTACAAA 


AAACAGGACT 


CTGACAAGTT 


AAGTAAAGCT 


660 


GCAATTGATG 


GCATGGTCAA 


AGAATTAAAA 


GATCCTTATT 


CTGAATATTT 


AACAAAAGAA 


720 


CAAACGAAAT 


CCTTTAATGA 


AGGTGTTTCA 


GGTGATTTTG 


TAGGTATTGG 


TGCAGAAATG 


780 


CAAAAGAAAA 


ATGATCAAAT 


TATGGTTACT 


AGTCCTATGA 


AGGGATCTCC 


AGCAGAACGT 


840 


GCTCGCATTC 


GTCCTAAAGA 


TGTCATTACT 


AAAGTAAATG 


GAAAATCAAT 


TAAAGGTAAA 


900 


GCATTAGATG 


AAGTTGTCAA 


AGATGTTCGT 


GGTAAAGAAA 


ACACTGAAGT 


CACTTTAACT 


960 


GTTCAACGAG 


GTAGTGAAGA 


AAAAGACGTT 


AAGATTAAAC 


GTGAAAAAAT 


TCATGTTAAA 


X020 


AGTGTTOAGT 


ATAAGAAAAA AGGTAAAGTT 


GGAGTTATTA 


CTATTAATAA 


ATTCCaGAAT 


1080 


GATACATCAG 


GTGAATTGAA AGATGCAGTT 


CTAAAAGCTC 


ACAAAGATGG 


TTTGAAAAAG 


1140 


ATTGTTTTAG 


ATTTAAGAAA 


TAATCCAGGT 


GGACTACTAG 


ATGAAGCTGT 


TAAAATGGCA 


1200 


AATA'l-rriTA 


TCGATAAAGG 


AAAAACTGTT 


GTTAAACTAG 


AnAAAGGTAA 


AGATACTGAA 


1260 



G 1261 
25 (2) INFORMATION FOR SEQ ID NO: 410 r 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2488 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

35 



40 



AAATATATTG 


AAnAGAnAAT 


TACTAAGATT 


AAATCnTCTT 


AAAATATCCC 


TGAAATAACG 


60 


TCCTAAAGAT 


TAAAGGAAAG 


AGGTTATAAG 


TTATGCCAAA 


ATTAATTTTA 


TGTCGTCATG 


120 


GACAAAGCGA 


GTGGAATGCT 


AAAAACTTAT 


TTACTGGATG 


GGAAGATGTT 


AATTTATCTG 


180 


AACAAGGTAT 


TAATGAAGCG 


ACTAGAGCAG 


GTGAAAAAGT 


AAGAGAAAAT 


AACATTGCCA 


240 


TCGATGTAGC 


TTTTACATCG 


TTATTAACAC 


GTGCTTTAGA 


TACAACGCAT 


TATATTTTAA 


300 


CTGAATCTAA 


ACAACAATGG 


ATTCCTGTAT 


ATAAAAGCTG 


GCGTTTAAAT 


GAACGCCACT 


360 


ATGGTGGATT 


GCAAGGCTTA 


AATAAAGATG 


ATGCTAGAAA 


AGAATTTGGA 


GAAGAACAAG 


420 


TACATATTTG 


GCGTCGTTCT 


TATGATGTGA 


AACCACCTGC 


TGAAACCGAA 


GAACAACGTG 


480 


AAGCTTACTT 


AGCTGATCGT 


CGATATAATC 


ATTTAGATAA 


ACGTATGATG 


CCTTATTCTG 


S40 


AAAGTCTGAA 


AGATACTTTA 


GTTCGAGTGA 


TACCATTTTG 


GACAGATCAT 


ATTTCACAAT 


600 
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TTAAATATCT 


TGAAGATGTG 


TCAGATGAAG 


ATATf!ATTAA 


TTATGAAATT 




720 




CACCGCTTGT 


TTATGAATTA 


ACGGATGATT 


TAGAAGTTAT 


AGATAAATAC 


T A a A A A 
X X X X 




5 


AaAAGAGCTG 


CATGTACACA 


AGGAGTGAGT 


GTATATGcAG 


CTCTTAAAtA 


TGTGAAGTAA 


O J A 




TGTAAGGAAA 


TAGTTAAGTA 


TAGAGTTTAT 


ATTAACGAGC 


TAGGGATACT 


CGAAAATATA 


Q n A 


10 


GTTAGACATA 


CAATATAGTC 


AAATTAAAAC 


AATTATTTCG 


CTCTTTTATG 


'PfftOT^TA ATA 
X xuwx x/v\x*\ 


OCA 


ATCTTTAAAG 


CACGCTTTCT 


TGTTTTAATG 


TT AGGG err AT 


TTAAATTACG 


ACG AG CA GTP 


1 A *) A 

X u ^ u 




TGTAAATCTA 


ATTTCATCTC 


TATCCCTCCT 


TGTAAATATA 

A A **rV^<A 


TTATGACPOA 

X X X \di** w ^»\JX* 


T A A rr A r"Tr* A 


T A Q A 

X U O u 


IS 


TATX3TAAATA 


GTAATGATTA 


CGTTTTAAAG 


AAATTGTAAT 


AAAGTCGTGC 


X#V^X X X X X Xw 


X X4 u 




GAAAATGGGT 


ATAATTACCX3 


GATATCTAAA 


AATGTGTGTir* 


v> X X X X X xxwatA 


TYVSTf^ A fS/T/^n 
X viO X 


1 *> A A 

x^ uu 




GAAGCTTTAA 


ATGTCGAAGA 


AACAAAAATT 


AACGATGATT 


ATTACTATGC 


TGATGGGTGG 


1260 


20 


A'nTTTTGGA 


TTATTAAATG 


AAACACTATT 


AGTGACGGCT 


TTACCAAC5TA 


TT A TYJ A A A A 


1 1 T A 




TTTTGAAATT 


TCATATACAC 


AAGTTCAATG 




V7Vb X X X X X x/*x 


TV; A r*T A A TVy!! 


1 O A 

X^ oU 


25 


GATTGTTATT 
AGTGGGTATT 


CCTTTGT CCG 


CGCTTGTTAT 


ACAACGTTAT * 


ACAACAAGAP 


AAfSTnTT^TTT 
X o X X X-1 X 


1 A A A 

X*4 u 


TCTATCTTTT 


TCTTAGGTAC 


ATTACTCGGC 


GG CTTGAGTC 


^VJVfXlV* XXX 


X3 uu 




AACATTATTA 


GTTG CTAGAA 


TTATTCAGGC 


GTTAGGCGCA 

\J X X **VS ww\J^>** 


GGTATTATY? A 


Tf^ A TTYl A T 


XsoU 




GATGACAACG 


ATTTTGGATG 


TTTTCCAACC 


ACATGAACGC 


GGTAAATATA 


TGGGGATATT 
X vTVjvjFvxx* X r* X X 


X o z u 


30 


TGGTTTGGTA 


ATTGGTTTAG 


CACCAGCTAT 


TGGACCTACT 


\v XXX ^ainxTV? X X 


nxvW X xv7X Xxjin 


icon 
X o o u 




ATATTTTAAC 


TGGAGATCGC 


TTTTCCATGT 


TGTCGCTCCA 


ATTGCAGCTG 


TGAC!!ATTTTT 

X\jn\«**X X X X X 


X / *t u 


35 


AATTGGaTTT 


AAAACGATAA 


AAAATGTTGG 


AACTACAATT 


AAAoTACCTA 


TTGATTTTAT 

X X V7X* X X X X X* X 


X O U V 


TTCTGTCATT 


TTTTCTGTAC 


TAGGTTTCGG 


cGGGTTATTG 


tATGGAACGA 


fiTTf**A ATTTf* 

X X ^vOW^ XXX w 


1860 




AGAAAAAGGT 


TTTGATAATC 


CTAcGtATTA 


GTATCTATGA 


TTGGAGGCGT 


IXaTTTTAGTC 

X V9 XXX X'^*W X W 


X 7 ^ V/ 


40 


GCATTATTTG 


TAwTACGTCA 


ATATCGGCTA 


TCAACACCAT 


TATXaAATTT 


TG^TGTATTT 

XwWxWX**X X X 


1980 

X 7 O W 




AAAAATAAAC 


AATTTACAGT 


TGGTATCATT 


ATTATGGGTG 


TCaCAATGGT 


ATCGATGATT 

** X \— vjrtX w*\X X 






GGTTCGGAAA 


CGATTTTACC 


TATCTTTGTG 


CAAAATTTAT 


TGCATCGTTC 


AGCTTTAf5AT 


2100 


45 


TCTGGATTAA 


CTTTATTACC 


AGGAGCAATT 


GTTATGGCAT 


TTATGT^GAT 

X X X \9 X W««r* X 


GACfTT C*f3r3f;T 


2160 




GCTTTATATG 


AAAAGTTTGG 


TCCTAGAAAT 


CTTGCTTTAG 


TAGGTATGGC 




2220 




ATTACTACGG 


CTTATTTTGT 


TGTAATGGAT 


GAACAAACAT 


CAACAATCAT 


GTTGGCAACA 


2280 


50 


GTTTATGCGA 


TTCGAATGGT 


TGGTATCGCG 


TTAGGATTAA 


TTCCAGTAAT 


GACCCATACG 


2340 




ATGAATCAAT 


TAAAGCCAGA 


AATGAATGCA 


CATGGTTCAT 


CTATGACAAA 


CACAGTACAA 


2400 



55 



8NSCXX:iO: <EP 0786S19A2_L> 



1310 



EP 0 786 519 A2 



AACTTTTCTC CAACTATGTC AGACTATA 
(2) INFORMATION FOR SEQ ID NO: 411: 

^ (i) SEQUENCE CHARACTERISTICS": 

(A) LENGTH: 1105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 



15 


TTTTACAATT 


TCA(3ATATCT 


CTAAAACACA 


TACCGATCCA 


ATGGCCAAAA 


TTATTCGTCA 


60 




AAAATTGAAA 


AAACTAGGAA 


TTCGTAAAGG 


GATTCCAGTT 


GTATTTTCAG 


ATGAAAGTCC 


120 




AATTGTCATA 


AGAGAAGATG 


TAAAAGATAT 


AGTTGGAGAT 


AAAAATGCTA 


TCAATCGAAA 


180 


20 


AGGACAAATG 


CCACCTTCTT 


CAAATGCCTT 


TGTGCCAAGT 


GTTGTTGGAT 


TAATTTGTQC 


240 




AAGTTATGTG 


GTGAATGACG 


TATTAAAAGA 


TATTCCAGTT 


CGTCGCATTA 


AAGACAAAGG 


300 




GCAATAATTC 


ATTTTGAAAG 


GGATAATTTT 


CAACGTAAGG 


CAAGTGTAAC 


CACACCATAA 


360 


25 


AACTAAAACT 


GACTAGTTCG 


CAAGCATAGT 


AGAATATGCC 


TCGTGTACTA 


GTCAGTTTTG 


420 




ATTTGATTAT 


AGCTAAAATA 


TCAAAGGTAA 


TCGATTTAAT 


CTTATTTTTA 


CGGCCTTTGT 


480 




TTAAGTAAGT 


TGTTATATAT 


TTCTTTGAGT 


TGTTGTTCAC 


TTTTAGATGT 


CGTTTTTGGT 


540 


30 


TCGTAATAAA 


TTTTGTTTTT 


TAGTTTATCA 


GGTAAATATT 


GCTGTGAAAC 


ATAGCCATTA 


600 




ACATATTGGT 


GTGGATATTT 


GTAACCAATA 


GATCGGCCTA 


GATCTTTAGC 


ACCTTGATAA 


660 


35 


TGTCCATCTT 


TTAAATGGTT 


TGGAATTTGG 


CCCACATGAC 


CGTTTCTAAT 


ATCGGACAAT 


720 


GCACTATCAA 


TGGCACTCAT 


TGCTGAATTT 


GACTTAGGTG 


ATAAGCATAG 


TTCGATTACT 


780 




GCTTGGCTTA 


GTGGAATTCT 


AGCTTCTGGT 


AAACCTAGAC 


GTTCTGCTGA 


TTCAATAGCA 


840 


40 


GCAAGTGTTC 


TCTGACCAGC 


ATTAGGTGAG 


GCTAAGCCTA 


TATCCTCATA 


GCTAATTACA 


900 




AGTAATCGTC 


GAACTATTGT 


AGGTAAATCT 


CCAGCTTCAA 


TTAATCGTGC 


T AAATAATGT 


960 




AAAGCGGCAT 


TGACGTCGCT 


ACCACGGATA 


GATTTTTGGA 


AAGCGCTCAT 


AACATCGTAA 


1020 


45 


TGCATGTCAC 
TGCAATGTAA 


CATCCTTGTC 
CATGTCGATA 


ACTTACAAAT 
ACCGT 


GCACCTTTTT 


GTAAACAGTC 


TTTAGCATCT 


1080 
1105 



(2) INFORMATION FOR SEQ ID NO: 412: 

SO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 

^ TAACGTCGAA CTTGAGCTGT TACGTTATGA CTCATAATTA TTTTAGCATA GTCATTTAAA sO 

TMiACTTCTG TTCTGTCTGT TGGATAAGCA AATTCAAGCA ATTGACTGTA ACTATCATTA 12 o 

ATGACTTCTT GATCAACATG ACTATCAAAA TATACAGCAT AATAATAAGT ACCATCAACC 180 

,0 ATATATAACA AATCTTCAAA CTCTGTAGTT ATTGGATTGC TATGATATGC ATAATTAATA 240 

ACATCTTCTA AATCGTTAAA TTTCACAATG ATTGTTCTTG TATTTTTACG TGCTGAAGAC 300 

TTTTGACGTT TAGAACCTTG AGCTTCTTTT TCTTTTGTTC GTTGCTCGAA TAATTCTTCT 360 

^5 AATTGATCTT CACCTTCTAA TGTTTGAGCT AACAATTCTT GAACTTGTTC ATCAAATkGA 420 

TCAGTTGCAT CATCATCAGA CATATTCATC ATATCTTCAT TTTTAGATTT AGAAATTGTG 4 80 

ACTTCGACAC CTTTTTCAAA GGCATGTACT TGAATCCATA ATGGAGCTnC TACAACAAAA 54 0 

TCTTCTACTT CGTTAATTTC ATCCATCATT GAnCAAAAG 579 

(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dotible 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 

GTGGTGAAAC TTCTTGCTTT GTAATTTTAT AAAGTGGATC AATATCACTC TGACTTACAT 60 

3S CAGGTTGAAC TGTCATCTTT TTAGTTACTT TGTTTTCAAG CGTAATATTT GCTAATTcTA 120 

GGtTTTACAG AATAATGAAT GTCATCAGCT AmCCCTTTmm CtTGATACTy CTCTTGATAT 180 

TTTCCAGCTT TAGCATTTGA TAAATCAATC ACTACTCTTA AATCTTCTGG ATTTTCAATT 240 

40 TTTATTATCT TTGATTGTGG TCCTGAAATA GTCACATTAA CTGTTTCAGG CGCTTTTGTT 300 

AAATGCAAAT CTTTAGTGTT ATAAAGAATT TCAACGGGTA CATCTTGAAT CGTTTTACTA 360 

GACTTTTGAC CAAGATTACC AGTGTTAAAG ATATTTCCAA AAACATTGTT AACAQATAAA 420 

AAGAAAAACA ATGCCAAAAG AAAGGCAATA TJ^TCTCAAGC CCCATTTACT TTCTAGCATA 4 80 

TTATTTCACA CCTTTCTTTT GAAAGCGTGT GCCAAACCAA TGTTCAGCAA GCAACTCTTC 54 0 

AAAAATTTCG TTTGAAATGT CTCGTCGTAA TTTTCCATCA AATGTTACCG AAATATCACC 600 

AGTTTCTTCA GATACAATAA CGGTAAATGC ATCAGATACT TCTGAAATAC CAACCGCAGC 660 



45 



SO 
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TGCTGCTGCT GCAATCTTCG TGCCTTGAAT AATCATTGCA CCATCATGTA AAGGTGTGTT 780 

AGGTATAAAG ACATTAATTA AAAGTTCTTG CGAAATATTT GAATCCATTG CAATACCTGT 84 0 

^ TTCAATATAA TCTTGAAGAC CTGTTTCTTT TTCAAAGACA ATTAATGCAC CTATACGTCT 900 

TTTAGCCATA TATTGCACAG CCTTTGAAAC CGATTGAATC AATTTCTCTT CATCTTTACT 960 

ATACGTATTA GAAGTATAGC G TTTTAAAAA GCTACCTCTA CCAAGTTGTT CTAACGCACG 1020 

10 

TCTAATTTCT GGTTGGAATA TTACTATTAA AGCTAATACC CCCCATTGAA TAACGATATC 1080 

GAATAATTTA GATGTTGCAG TCAAGTTCAA TATCATACTT ATCTGCTGAC CAATAACAAT 114 0 

TACTAATATC CCTTTAAGTA ATTGTATCGC TTTAGTTCCC TTAAAGACCG TGATGAGAAG 1200 

IS 

ATAAAGTACA TACCAAACTA TCAGTAAATC AAGGATACTC GTTACAATrT TTAACGTACT 1260 

GAGGmTGA AAAAAGTTGG AAAAATCCAT AACATCTCCT CCGGGTAATA TTTTTCCATA 13 20 

20 ATACCCATTA TACCAATCAT TT 1342 

(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 
25 (B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
( D ) TOPOLOGY : 1 i ne ar 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 



35 



40 



45 



CTGTATAAAG 


ATGGAGGTGT 


TTTGCATGGT 


AAAACGTACT 


TATCAACCAA 


ATAAACGTAA 


60 


ACATAGTAAA 


GTTCATGGTT 


TCAGAAAACG 


CATGAGCACA 


AAAAATGGCC 


GTAAAG'rrrr 


120 


AGCGCGCCGT 


CGTCGTAAAG 


GCCGTAAAGT 


TTTATCTGCA 


TAAGATCACT 


GACTCATCAG 


180 


TGAtCTkTTT 


TTTCGTTTAA 


ATTAAGAATA 


AATAGAAATT 


TATGTTATAA 


GCTCAATAGA 


240 


AGTITAAATA 


TAGCTTCAnA 


TAAAAACGAT 


AnATAAGCGA 


GTGATGTTAT 


TGGAAAAAGC 


300 


TTACCGAATT 


AAAAAGAATG 


CAGATTTTCA 


GAGAATATAT 


AAAAAAGGTC 


ATTCTGTAGC 


360 


CAACAGACAA 


TTTGTTGTAT 


ACACTTGTAA 


TAATAAAGAA 


ATAGACCATT 


TTCGCTTAGG 


420 


TATTAGTGTT 


TCTAAAAAAC 


TAGGTAATGC 


AGTGTTAAGA 


AACAAGATTA 


AnAQAGCAAT 


480 


ACGTGAAAAT 


TTCAAAGTAC 


ATAAGTCGCA 


TATATTGGCC 


AAAGATATTA 


TTGTAATAGC 


540 


AAGACAGCCA 


GCTAAAGATA 


TGACGACTTT 


ACAAATACAG 


AATAGTCTTG 


AGCACGTACT 


600 


TAAAATTGCC 


AAAGTTTTTA 


ATAAAnAGAT 


TAAGTAAGGn 


TAGGGTAGGG 


GAAGGAAAAC 


660 


ATTAACCACT 


CAACACATCC 


CGAAGTCTTA 


CCTCAGACAA 


ACGTAAGACT 


GACCTTAGGG 


720 
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840 
900 

960 
1020 



TTTAGATACA ATTACGAGTA TTTCAACACC AATGGGTGAA GGGGCAATTG GAATTGTTCG 
ATTGTCTGGA CCGCAAGCCG TTGAAATTGC TGACAAATTA TATAAAGGAA AACATCTTTT 

5 

AAATGATGTT CCATCACATA CGATTAATTA CGGTCATATT ATTGATCCAG tCTAAAGAAG 

TGGTTGAAGA AGTTATGGTG TCTGTGTtAA GAGCGCCAAr AACATTTACA CGCGArGATA 

TTATAGAgAT TAATTGTCAT GGTGGtATTk TAACGATTAA TAGAGTGCTG GGA i073 

(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3176 base pairs 
^5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: '415: 

CTTTACCAAT GCCAAATCCG AAGTAAAGTA TAGCAATAAA GATTACTAAT ACAATTCTGT 
AAATGGCAAA TGGAATTAGT TTGATTTTGT TAATTAGATG CAAGAATGTT TTQATTGCAA 
TTAGTCCAAC AGTAAATGCA GCTAAAAAGC CTAAAATATA AAAAGGTATA TCAGCAATCT 
GAATATCTTG ATAATGTTTT AATAAAGATA AACCACTAGC TGCTAACATA ATTGGAACAG 
CCATAATAAA TGTAAAGTCC GATGCTGCTT TATGATTTAA TTTCATTAAT ACCCCAGTTG 
AAATTGTTGA GCCTGAACGG CTGAAACCAG GCCACATAGC TACTGCTTGA gAAATACCAA 
TTACAAATGC TTGGAAATAA CTGATTTGAT CTACTGTTTG TGGGTTTTTA ACTTTAGCTG 
AGTATTTATC AGCAATAATC ATATAGATAG CACCTACGAA TAAGCCAATC ATAACAGTTG 
GCACACTAAA TAAATGTTCT TCGATGAAAT CATCAAATAG TAAGCCTAAA ATACCTGCTG 54 0 

GCACCATACC CACTAATACA TGTAATAAAT TTAAACGTCT TGG C TT TG AA CGTCTTTGTT 600 
GATCGTTATC TCCTTCAACA TGTTTGTGTT TACCAATATG TAAAATCTCT AAGAAGCGTT 660 
CGCGGAACAC CCATGCTGCT GCAAAGACGG ATCCTAATTG GATGACGATT TTAAATGTAA 720 
ATGCTGACTG AGAACCTAAA AATTCAGATG ATTTTAACCA CATATCATCA ACTAGGATCA 780 
45 TATGTCCAGT AGAGGAAACA GGTGCAAATT CTGTTAATCC TTCGACGACC CCTAAGATAA 84 0 

TACCTTTTaT TAATTCAATG ATAAACATAA TGTACCCACT TTCATTACTC AATTTAATTT 900 
ATTTAAATAT CAAAATTACC ATATCATGAT AGCATATTCA TTTAAAGACA TGCTAGTTAT 96 0 

AGTTATAATA CTAGACTAAA GATGTATATA TTCATTTTCT TTTACATGTA AAACTACAAT 1020 
ATTTTATTGA GCTATTTAAT TTGATTTTAA GGAAAACCTT TTATAATAGG TTTAGGTGAT 10 80 



60 
120- 
180 
240 
300 
360 
420 
480 
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TTCTTGGTCA GTACTGGTCT CGGCATAATC GTTATAACGC AAAATATTTT AATAGCAGAT 1200 

TTTTTAGCTA AAATTATAAG ACATCAATTT CAAGGTTTAT GGATTGTATT ATTTATTTTA 1260 

TTAGGTGTTT TACTTTTAAG AGCAACTGTG CAATTTCTAA ATCAATGGTT AGGTGATACA 1320 

TTAGCATTTA AAGTTAAGCA TATGCTTAGA CAGCGGGTTA TTTATAAAAA TAATGGTCAT 1380 

CCAATCGGTG AACAAATGAC TATACTCACA GAAAACATTG ATGGTCTAGC ACCTTTTTAT 1440 

AAGAGTTATT TGCCTCAAGT GTTCAAATCA ATGATGGTTC CGCTCATCAT AATCATTGCA 1500 

ATGTTTTTCA TCCATTTCAA TACCGCATTA ATTATGTTAA TAACTGCACC ATTTATTCCT 156 0 

TTGTTTTATA TTATTTTCGG TTTGAAAACG CGAGATGAGT CAAAAGATCA AATGACTTAT 1620 

TTGAATCAAT TTAGTCAACG GTTTTTAAAT ATTGCTAAAG GTTTAGTGAC GTTAAAGCTA 1680 

TTTAATCGTA CAGAGCAAAC AGAGAAGCaT ATTTaCGACG ATAGTACTCa GTTTAGAACT 174 0 

TTAACAATGC GCATTTTaCG CAGTGCTTTT TTATCGGGAT TAATGCTCGA ATTTATAAGT 1800 

ATGTTAGGTA TTGGATTGGT TGCATTGGAA GCAACGGTAA GCTTAGTAGT ATTTCATAAT 1860 

ATTGATTTTA AAACTGCGGC AATTGCGATT ATTTTAGCGC CTGAATTTTA TAATGCAATT 1920 

25 AAGGACTTAG GGCAAGCGTT CCATACTGGA AAACAAAGTG AAGGTGCCAG TGACX3TTGTG 1980 

TTTGAGTTTT TAGAACAACC GAACTATAAT AATGAATTTC TATTAAAGTA TGAGGAAAAC 204 0 

CAAAAGCCAT TTATTCAGTT AACAGACATA TCATTTCGAT ATGATGATTC TGATAGATTG 2100 

GTATTAAATG ATTTAAATTT GGAAATATTT AAAGGTGATC AAATTGCACT TGTAGGTCCA 2160 

AGCGGGGCAG GTAAATCCAC TTTGACACAT CTTATTGCAG GTGTTTATCA GCCAACAATA 2220 

GGTACTATAA GTACAAACCA GCGTGATTTA AATATAGGAA TACTTAGTCA ACAGCCATAT 22 80 

ATTTTCAGTG CTTCTATAAA AGAGAATATT ACGATGTTTA AAGATATAGA AAATAATACT 234 0 

ATTGAAGAAG TGCTAGACX3A AGTAGGTTTA TTAGACAAAG TGCAATCTTT CACAAAAGGC 24 00 

ATTAACACAA TAATAGGTGA AGGAGGCGAA ATGTTATCTG GTGGACAGAT GAGACGCATA 2460 

GAACTTTGCC GTCTTTTAGT TATGAAGCCA GATCTCGTTA TATTTGATGA GCCTGCAACT 2520 

GGTTTAGATA TTCAAACAGA ACACATGATT CAGAACGTTC TGTTTCAACA TTTTAAAGAT 2580 

ACAACGATGA TTGTCATTGC ACATAGAGAT AATACAATTC GCCATTTACA ACGACGCTTG 264 0 

TATATAGAAA ATGGAAGACT GATTGCTGAT GATCGCAATA TTTCAGTAAA TATAACAGAA 2700 

AATGGTGATG ACTTATGAAA ACACGACTAA AATTTCAAGT AGATAAGGAT TTATTGTTAG 2760 

50 CTATAGTTGT TGGTGTTTGT GGAAGTTTAG TTGCGCTCGC CATGTTTTTC TTAAGTGGTT 282 0 

ATATGGTGAC ACAAAGTGCA CTTGGTGCGC CACTATACGC TCTGATGATT TTAGTCGTTA 288 0 
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ATAAAGCTAC ATTTACAATG CTACGTGATA TTCGGGTACA GTTTTTCGGT AAATTAGTAA 3000 

ATGTCATTCC TAATGTTTAC CGTAAACTGA GTTCTAGTGA TTTAATTTCA CGTATGATTA 3 060 

GTCGTGTTGA GGCATTACAA AATATAkATT TACGTGTTTA TTATCCACCA GTCGTCATCG 3120 

GTTTGACAGC GCTAGTTACA GTCATAGTTT TGGCGTTCAT TTCAATCGGC CATGCG 3175 
(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 416: 



20 


TCrrTTAT*l*T c 


rtrt 1X1 ^\v.rv\ 1 


l^jnAl 1 X X X 1 


TATTATTTAC 


GCATAGCTCT 


TAAAATTAAC 


60 






r* A ATT 2V A fS aT 


MAX X\3J\J\K^\JJ\ 


AX i/\Al\7i_X\3 


GCAAGATGTA 


AATACtTCCT 


120 




AATTCAGGAC 


V— l 1 1 


W7>v\X/av7X X 


C wiL.v. X Av..C_(J 


ATCaATCCAAT 


AATACCTGCG 


180 


25 


ATAATATTGC 


CTAAAATACd 


AfT'TYWIATA 


^P^^^p^^^W* ^^^^^^^ 


TAATAGCACC 


AGCAGCCCAT 


240 




CCAATTAAGC 


CACCCACLAAT 


TAAPATTrT'A 
■1 X X ^ ^B-A 


AXAAAX L.L.nA. 


ACATAATTTT 


CAGTCTCCTT 


300 




TTTCTATTTA 


TTTTGCGTTA 


TTCTAAGTAG 


TACCCCTTAT 


TTACAATTCT 


AAAACAAATT 


360 


30 


CAAATTATTT 


TTATCCAAAT 


ATTTTTAAAA 


GTAGTAATTG 


AATATCAATT 


TTATTCAATG 


420 




TAGCTATCGT 


TATTTAAAGT 


CTCTGTACCG 


ATAATATCAT 


ATACATTTAC 


ATTATTTTa-r 


480 


35 


CTGCCGAATT 


CATAGCTTGA 


TTATTTTATG 


TTATAGGACT 


AgAATATACA 


CATATTATTA 


540 


GAGCATCTTT 


GAATTTTAAA 


TCAAGAAGCG 


AGGTTAATGA 


ACAATGAATA 


TGCATATTTT 


600 




ATATaACTTA 


CGAACTAAAC 


ATAATTTAGA 


AATTGACGAA 


TTAGCACAGC 


AATTAAATGA 


660 


40 


GAAATATGGT 


ACTAAATATG 


AAGCACATCA 


AATTTGGGAA 


TGGGAGAATC 


ATCACCATGA 


720 




ACCTAAATTT 


AAAGATGCCA 


TGCATTTAGC 


TGACTtCTTT 


GATGCACCAT 


ATGAAATGTT 


780 




TTTAGAAAGT 


AAGGTTAAAG 


AATATCAGAA 


ACATTTAGAA 


GAAGTCGATA 


TTCGCATGGA 


840 


45 


TAAATAGATG 


CAAATAAACC 


CTCACAACAC 


GTTTGGCATA 


TATCCTTTCA 


AATCTATACT 


900 




GGATATATTA 


CATTACGTTG 


TGAGGGTATT 


TTATTAATTA 


ATATGAATTA 


AGACATTTTA 


960 




CAAGCGTTAA 


TGCAACGAAT 


Cri-i-iTAGTG 


ATCTTGCTCA 


CTCTTTAATA 


CTTTACCGTT 


1020 


SO 


CTTAGCATCA 


ACAGTAACTT 


CTTGTTTTTT 


ATTACCTTTT 


TTCAAATCGA 


TATTGTAAAC 


1080 




AAGTTTGCCA 


TCATCTTTTT 


CAAGTGACCA 


TTCTTTAATA 


TCACCATCAA 


ATTCTTTTTG 


1140 
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5 



20 



ATTCATTGTA 


TCTTCTTTTT 


CAGTCTTTTT 


GTTAATCACT 


TTTTTATTTT 


TATCAGCAAC 


1260 


AAGTACTTCT 


GACTCTTCAC 


CAGATTTTTG 


TTGCGTCACT 


TTATAAGCCC 


ATTCACCATT 


1320 


AGAATTTTCA 


AATGAAATTC 


CTTTCAACTT 


TTGGCCTTTG 


TAAGTTTCTT 


CAGCTTTTra 


1380 


CACAGCATCT 


TCTGGGCTTG 


TTTTAACATC 


TTTTAAAGCA 


ATAACATCTT 


TTGTTGTATT 


1440 


AGTGTCTTGA 


TTAGTATTTG 


ACTCTGTTGA 


TTTTGTTTCA 


TL-rx-nx3GAG 


TATCATTGCC 


1500 


ACATGCAGTA 


AGCACCACCG 


CTGACATTGA 


TAACACTGCT 


AATGATTTTA 


ATTTCATAAT 


1560 


ATCACTCTCT 


CTTCCTATTT 


TTGAAACTCA 


TAACAAAAGC 


TTATATGCTA 


TATAGATTGT 


1620 


ATTACCCCTT 


G'n"rrrAATT 


TTATTCATAA 


TTATTACAAA 


TATTTTTAAA 


TTAATCGTCA 


1680 


TGCGTTACTT 


TCGTTCGTAC 


TCCTTTTATA 


AATGAACCAT 


GTAATATAAG 


CATGCTATTA 


1740 


TCGACTCAGT 


TTGTCTAATG 


CTTTCTTTGG 


TACTTCTTCC 


TTTTCAACTT 


CTTCAAAAGT 


1800 


TTCTACATGA 


TGACCTTTAT 


GTGTAATTTT 


CAAGTATCTA 


TGCGGTTTAA 


CATCAAATGT 


1860 


AGCAGTATAC 


ATTAATTGCG 


TCTCTTTCCC 


TTCTTTATTA 


AAAACACTTT 


GACTATAACT 


1920 


TCGGAGTTGA 


TCATCCATTC 


TAGTAGACAC 


CTCTGTCGTT 


TTAACATAAG 


AATCATCCTT 


1980 


CTTTACTAAT 


GGGTTAAATT 


GATCTGTTAT 


ACCATGGGAG 


TCTATTGTTT 


TTAAAATGAA 


2040 


TAAAGCAgCA 


TAAGCGCCAA 


TGATAGTAAG 


TACAAGATAt 


GTkATTGTTT 


TTAAAATCGT 


2100 


TTCAAAAGA 












2109 



(2) INFORMATION FOR SEQ ID NO: 417: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

40 

GTTACATAAA TGAAAACAAA AAAGATAATT TTACTGCTTA TGCTACACCA- GAACATAATT 60 

ATCAATTTGG TGGTGCTATG ATAGAAAGTG AAAAATTAAG CGAGTTACTA AAGCCAGCCA 120 

ATCAGTTAAA ATCACCAGAT GATATAAAAA AAGAACTAAA TAAAAAGAAG AGTCACTAAA 180 

GTTAGGAGTT ACTTIAATGT CCAAAAAACA TGTTTTTATA ATTATTGGTG TCATATTGTG 24 0 

TATATGTACA GTTTCTACGG TCATGCATTT TAAAATGAAA TATGATGAAA AAGAAAAACA 300 

50 AAAAGCGATT TACTACAAAG AACAACAAGA ACGTATTACA CTCTATCTTA AGCATAATAC 360 

TAAAGAAACG AACACGATTA AATCTGTACA TTTCACAAAC TTGGAAACAA GTCCTATGGG 4 20 
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ATCGCCTGAA CATAATTATC AATTTGGTGG CGCTATGATA AAAAGTGAAG GAGTAGATT^ 54 0 

ATTATTAAAA CCAGCACATG AAAGAAAATC ACCAGAAAAA ATCAAAGAAG AATTAGATAA 600 

5 

AAAAGAAGGC CACTAGGGTC TTCTTTATTT TTGATTTAAT CTTCCAATAA TCTATGTCAT 660 

TGCTATCGAA GGTGTTTCGC AATTAATATA AATCACTTCA TCATCACCAA TACTTCCCCA 720 

GTTTTGTACA GTACATTAAC ACAAACGAAC CACGTTAATT TAAATGGAwT AaTAGTTTGG 780 

10 

CCATTATAAG AACAATATAT ATCGAnTAAC AAT 813 
(2) INFORMATION FOR SEQ ID NO: 418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: €40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 



25 



30 



40 



AAAAGCAATC 


GTTGGTGGTG 


CTAAATTCAT 


CGGCAACTCA 


TATGTAAAAG 


CTGGTCAAAA 


60^ 


TACACTTTAC 


AAAATGAGAT 


GGAATCCTGC 


ACATCCAGGA 


ACACACCAAT 


ATGCTACAGA 


120 


TGTAGATTGG 


GCTAACATCA 


ATGCTAAAAT 


CATCAAAGGC 


TACTATGATA 


AAATTGGCGA 


180 


AGTCGGCAAA 


TACTTCGACA 


TCCCACAATA 


TAAATAAGCA 


ACATGAACAT 


AGGATCAAAA 


240 


GTCATCCCCC 


ACTATCAATC 


ATGGGGGATG 


ACCTTTGATC 


CCTTTTTTAT 


ACATACACAA 


300 


GCAAAAATAG 


CGGTGATTGT 


TTACCATCAA 


TTTTAACAAT 


CACCGCTACT 


TTTGCTTGTA 


360 


ATTCATGATT 


CAATTTTTGT 


TGTGTGCACA 


ACGACACTAA 


ATTATGTGTT 


TGCTATTGTC 


420 


GTGTTACAAC 


GATATGCGTC 


GTTGATTTAA 


CTTATCAAGT 


AATTGATTTA 


AATTGTCTAA 


480 


TTCGACTTCC 


GATAAACATT 


GACATCTTGC 


TTCAATCAAT 


TCGCAACGTG 


CATTATTTAT 


540 


TTGTGAAATT 


AATGTACGTG 


CTTGATCAGT 


CAAAATTAAT 


TCTTTACATC 


TTAAATCTTC 


600 


GCTAGATTGT 


TGACITGTGA 


TGTACCCnnT 


CAAAACTAAT 






640 


(2) INFORMATION FOR SEQ ID NO: 419: 









45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 



BNSOOaO: <£P 078651 9A2J_> 
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TTCTTGGATA CGTAATAAAT TACCTTTTGT CATAGGATTA AACATAGATC CAGGTAAAAC 120 

ATAAACATTT CTATTTTGCT CTAATGCAAA ATCTATCGTG ATGTGACTGC CACTTTGTTC 180 

CTTAGCCTCA GTAATTAAAA CCCCTTTTGA CAAACCGCTG ATAATTCTAT TGCGCTCAGG 24 0 

AAATCTATAT TTAGCGATTG GTGTATGTGG TGGATATTCA GATATAACTA AACCTTTTTC 300 

TTCTATTTTA TTTCTTAATG CTAATGTACT TTTGGGATAA TGTGTTTGAT GGCCAAAGGC 360 

TAAAACTGCA ATTGTAGGGA GATTGTATTT TAAAGCTATT TGATGTGCCA TTGCATCAGC 420 

TCCTTGAGCA AGGCCGGAAA CAATTGTTAA ATATTTGCTT TTATCATTTG ATAATAAAAA 4 80 

TTCTAAAGAC TGTTGGGTAT AACTTGTAGA ATCTCTTGCA CCTACTACTG CCAAATGATG 54 0 

CATATTATTT ATTAATTTGA TGTTCCCTTT ATAGAAAAGA AGTAATGGAA ATTGATATAT 600 

TTCTTTTAAT AGCACTGGGT ATTCAGAATC CATATATGTA ATGTAACTTA CTTTTAATTT 660 

CTGCAGTTCT TTAATAATAT CGTTATGATC AAGTTTTATA AAAGCATAGT ATTTACGTAA 720 

TAAATGAACA TTTTCTTCCC TATTCACCCA TTCACATAAA TAACTATCTT TTTTTCTTCC 780 

CTCCTCTTTA ATTACATTAG GATATGCCAT TAAAAATTGA TGAATTTGTT TAGTCGAAAA 840 

25 GTGTGCCCAG TATAACTTAA GCAAAAATAG TTTAATCAAT AAATCAACK: CTTTTTGTAA 900 

TCATACAAAA TCATATTCTA TTTTTGTTTT ACATTTCTAA TACAAAAACA TTGTCGATGT 960 

AATGTTATTT TAAGGAGTAA AAATACTGAC TTAAAAAGTG AAAAGTATGT TGGAAAGAAT 1020 

^0 TTAAAATTTC AATTTCCAAC ATACTTTATA ATTAAACCTT ATAAATAAGT TTTGCAAATT 1080 

TATATAGAAT TGGTCTTACT GGTTTGATGA AATCACCAAT TAATTCTTCA ACATGTGCAT 114 0 

TAAAACCCTT TTTAAATTG X159 
(2) INFORMATION FOR SEQ ID NO: 420: 



20 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 

TCAATCAGCG ACTACAAGAA GTGCGGGTCT TCAAACAATT GATGTGACAA CACTAAGTGA 60 

CCCCACTAAT ATTATCATGG GTATTTTAAT GTTTATAGGA TCTTCGCCAA GTTCGGTTGG 120 

so TGGCGGTATT CGTACAACAA CTTTCGCTAT TTTAATTTTG TTTTTAATTA ACTTTAGTAA 180 

TAATGCCGAT AAAACATCCA TTAAAGTTTA CAATAGAGAA GTACACATTA TGGATATTCA 24 0 



55 
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ATCAGCTACT 


GAAAATGGTA AGCTTACATT 


1 1 lACAAGTA 


T TTl"riXiAAG 


TCATGTCTGC 


360 




ATTTGGAACT 


TGTGGACTAT 


CGCTTGGTGT 




ATTAGTGATA 


TTTCTAAGGT 


420 


5 


CX3TACTAATG 


ATACTCATGT 


TTATAGGACG 


i \j 1 i/GGCTTA 


ATATCATTTA 


TCATTATGAT 


48Q 




AGCAGGACGT 


CGAGAACCAG 


ATAAATTCCA 


'PTHT^^R R IV TV 

1 1 AI L.w\AAA 


GAACGTATTC 


AAATAGGATA 


54 0 


10 


ATATAATAGC 


AATCTAAGTT 


TAGTTAATGT 


AvaA i\l 1 lAAC 


TGGAACTTAG 


ATTGC'i"!"!"!"!' 


600 


TAGTTTGTAT 


TTTTAACTTA 


TTTTATAAGA 


r^r^ j\ fT>nv^/ "i**i>»i* 
l-oAl i\j<j l 1 1 


CGAAAATGGT 


AAACTAGTAA 


660 




CAATGAGAGG 


TGTAACATGA 


TGGAAAAAAA 




AATGTAGAGA 


TTTTAACTAC 


720 


IS 


GTCAGATATG 


CATAGTCATT 


TCTTAAATGG 


TGATTATGGT 


TCAAATATTT 


ATAGAGCTGG 


780 




TACTTATGTT 


AACCAAGTAA 


GAGCACAAAA 


TCATCGCGTC 


ATTTTATTAG 


ATAGTGGCGG 


84 0 




AAGTTTAGCT 


GGCTCGTTAG 


CGGCCTATTA 


TTATGCTATT 


GTTGCACCTT 


ATAAACGACA 


900 


20 


TCCAATGATA 


AAGTTAATGA 


ACAGAATGCA 


1 lAlTjATGCT 


AGCGGTGTGA 


GTCCAAGTGA 


960 




ATTCAAGTTT 


GGTTTATCAT 


TTTTAACTCG 


TTCAATTGCT 


TTGGCACGTT 


TTCCATGGTT 


1020 




ATCAGCAAAT 


ATTGAATACA 


ATGTTACTAA 


GGAGCCTTAT 


I'lU^rCAACTC 


CATATTGTAT 


1080 


25 


TAAACATTTT 


GGTGACTTAA 


AAATTGCTAT 


CGTAGGCGTC 


ACAGCAG^TG 


GTTTAATGGA 


1140 




AAATGAGTAT 


TCTGAAATGG 


AGCAAGATGT 


ATCTATTGAA 


AAGACATTAG 


TGGCATCAAA 


1200 




ACGTTGGATT 


AGATATATCC 


ATGAAGTTGA 


AG AG C CAG AT 


l'-l~i~i~X'GATTG 


TAATTTATCA 


1260 


30 


TGGTGGATTG 


AATAAAATTA 


GTAATAGTAC 


/^2V211VafVt*JVTVTV 

GAAAAATAAA 


AAGGCAAGTT 


CGAATGAAGC 


1320 




TGAAAAATTA 


ATGGAAGAAC 


TCGGTGTTAT 


AGAi 1 TAAXG 


ATTACAGCTC 


ATCAGCATCA 


1380 


3S 


AACAATAGTA 


GGTCAAGATC 


ATGAAACGTA 


1 1 A i Gil GAG 


GCTGGTCAAG 


ATGCCAAAGA 


1440 


GCTTGTACAT 


CTTTCGATTA 


ATTTTAAAAA 




ACTTATGATG 


TTGAAAGCAT 


1500 




TGATTCTAAA 


GTGATTGACT 


TAAATGAGTA 


i GAAGAGG AT 


^ IV TV TV TV TWfWTV «V« 

CAAGAATTAT 


TAGATTTAAC 


1560 


40 


ATTCTATGAT 


AGAAAAGCAG 


TGGCTTATTG 


GX V.AVJAGGAA 


1V1^^TVT*TV TV i wiw 1 

ATCATAACvrG 


ATAAAGGTTT 


1620 




GATGTTATCA 


GTAAATGGGT 


TACAAGATTT 


AGTLrrGTCAA 


ACACATCCAT 


TTTCGCAATT 


1660 






GCAATTCACC 


TTGCATTTGA 


TAATGATATA 


ACATGTGTCC 


ACGTGCCTAT 


1740 


4S 


GaACGGAGAG 


AAGGGGTTGA 


GTGGACAGAT 


TCGAAATGAA 


GrTTTGTATC 


aTGCATACCC 


1800 




ATaTCCAGAT 


aAGCCAtGGG 


tATGACAATT 


aGTGGTCAAA 


ATaTCAAGGT 


ATnTTGGGGT 


1860 




ATAGTTATTC 


ACCATTAGG 










1879 


SO 


(2) INFORMATION FOR SEQ ID NO: 421: 









<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 base pairs 

55 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 

AATTCCCCCT TTTTCCCCAT CCATTTTnCC ATCCACTGGT CCTTACGGGA CCATATTATT 60 

TnAAATTGGA nAAAATTATt TTAAAGaTTA TwACtACTCT TtAATCATTT TrGTGAATTA 120 

10 

AAAAAAGTAG TGCAAAAAGC AAAATATACT TTATACACTA CAAATCATTT ATTTATAATA 180 

AAGTTTCACC AAAAAATGTT CCAACTAATG AAACCGCTTG TTCAGCAGTA TGATTATTAC 24 0 

;5 TGTCAATCAA TGGATTTACT TCAACTAAAT CCATTGAGGA AATTAAATGT GATTGATGCA 300 

GTAATTCCAA TGCAAAATGG CTTTCTCTAT AACTAAGACC ACCCAAAACT CTAGTACCAG 36 0 

TGCCTGGCGT TTCAAGCGGA TCTAAAGCAT CAACATCTAA AGATAAATGA ACGCCATCAA 42 0 

20 CATTGCGTGA CTTCAAATAT TCTATTGTAT TTTCAATTAC TTCCTTTATC CCCAATTTAT 48 0 

CAATATCTGA CATAGTAAAT GTTTTAATAT TATGATCTTT GATAAATTGT CTTTCACCTT 54 0 

TATCTAAATC TCTCATACCA ATTAGTACGA TGnTTTCTGG CTTGATTACA TTACnATTTA 600 

ATTCTAAAAG TTCTTTGGGA CCTTCGCCTG TCAAAATCCT TAGAGGCATA CCATGAATAT 660 

TTCCACTTGG TGACTCTTCA GGTATATTTA AATCACCATG TGCATCATAC CAAATAACAC 720 

CTAAATTATT ATAATGTTTA CTTATTGCTG ATACTGAACC TACCGCAATA GAATGATCTC 780 

30 

CACCAAGAAC TAGAGGAAAT CTGTTATTTT CAATTGATGC TGAAACCTCT TTATTTAATT 84 0 

TTTGATTAAC ATCTATAATT TCATCATAAT TTCTTAATCC TTTTTGTTCA CTATGAAATT 900 

TTTCAATGTT CACAGCAGGt ACCTTAATAT CCCCCTTGTC ATATACATCA AGGTCTAATT 960 

35 

GCTTTAATCT TGAAATTAAT CCAGCATATC TAATTGCTGT TGGTCCTAAA TCAACACCTA 1020 

ATTTTCTTTG TCCAAATGTT GATGGTGCAC CTATAATATC AATTGCTTTT GTCTTTGTCA 1080 

TAAGCGTCCC CCTTTGCTCT ATGTAATTAA AGAATAATGT ATGCGCTTAC CATTATCAAG 1140 

40 

CAATAGCTAC ACATATAATC TGTTTATCTT ATTACTTCAT AAAAAAAGGT TCTTCATCTT 1200 

TTATGGTGGG AAGGTAAAAC TTCCTGCTTT TTTTAATACA CAAAAAGCGC AATTGCCTCT 1260 

4S ATAATTTAAA GTGACCAAAC CCAAACTAAA GGAGACAAGT GCGCCTATGT GTAATGATAC 13 20 

CTTAGAATTA CTAAGAATAA AAGATGAAAA TATAAAATAT ATAAACCAAG AAATTGACGT 13 80 

CATTATCAAA GGAAAAAAAG CAACAGTGGT TAATGCTGTA CTAACGTATA AGCCTTCGGC 144 0 

50 CTGTTATTGT TGTGGAGTTA AAAATGAAGG ACAAATTCAT AAACATGGTA AGCGTGTTTC 1500 

TCGTATTACT TTACTTAAAA CTCAAGGGTA TAACACATAC CTCAACTTAG CTAAACAACG 1560 

55 



,3DOCID: <EP 0786519A2.I.> 
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5 



10 



25 



30 



GTGTTTTATC 


TCAAGATGTG 


TTACTCAAAA 


AflTTATAHAA 




A AOTT^A A A AO 

AACvl l AAAAw 


1680 


AGAGATTGAT 


ACTGCAGAAG 


ATAACTGTAT 


>v X ^ X V'^'^X w X 


r\.\- xvj 1 AAWy 1 


kjlAx lA^AAL. 


1740 


TAAAGCGGCT 


AATTCATTAC 


GAATTAAACC 


%v XXX X X v9 X 




A O AT" OOOT* AT* 
AL^l 1 Al 


1800 


GGATGAATTT 


AAAAGCX3TTA 


AAAATGTAAC 


tggatcaaty; 


riVj 111 \ 111 


TT» AT'AG ATA A 
X lAlAvjAlAA 


1860 


TGATACTCAT 


GATGTTATAG 


ATATTTTAGA 


AAAT AG A A PT 
****** X *wxn/\\„ X 




T*0 OOTV* O OT' Jl 


1920 


TTTCGAGCGA 


TTCGATTTAA 


AAAATCGACIA 


APAAHTTZiar* 


AU(>Oi lAC lA 


X IX^ACATGTA 


1980 


TGAACCCTAT 


GTCCGATTAT 


TTCGCGAf CTP 


** X X X x*\nX 


uWiljIvlAl lA 


1111 1 GACAG 


2040 


ATTCCATATC 


GTTCAACATT 


T AAAT AG ACIA 


A f^*T*T* H li T a a/^ 
X X^viXAAVj 


i AXt-GTGTAC 


AAG l i ATGAA 


2100 


TGAATACCGT 


AATAAAAAAG 




Xi<lkii.A/VX 1111 


A A^ A A 'PA HOn^ 

AACyAA TAACT 


GGAAAGTCCT 


2160 


ATTGATGGAT 


ACTAGTAAAA 


C CAT ATTT Ar; 


T A A HT]^ r* HG 2V 


TY^O A A*PA A AT* 

1 vii^AA 1 AAA 1 


Cl"i"l~rAAGGC 


2220 


TTATAAACGC 


TCATCTGACA 


X X vj X AO.rv\ X X 




A A AO AO/*»7V*T»1k. 

AAAGACGATA 


TACTACGACA 


2280 


CTCCTACGAA 


CTTGTCCAAG 


GATTAPC3AAA 




1 lAlwjiAATT 


GGCCTAAATT 


2340 


TATTAATCGT 


TTAAATTC!AG 


TTAGTAAAAA 

X X AVJ X ^xrvvv^ 


0 1 W 1 Vj 1 vjJWj 1 




GGaAAGTGGT 


2400 


TAAATATTAT 


AGAAAACATC 


AAAGGATfY5TT 




ATTTATTACC 


CAGCATTTAA 


2460 


TAATGGTGCT 


ATAGAAGGAA 


TTAATAATAA 


GATAAAATTA 


ATCAAGTGAA 


TTTCTTTTGG 


2520 


TTACAGAAAT 


TTCAACAACT 


TTAAAGCACG 


TATAATGATG 


aTTTTCAGCT 


TGTACAAAGG 


2580 


AGaAAAAAAG 


AAGACAACCA 


AGCCCAATAA 


TGGACTGGCC 


GCCTAATAwA 


nGGGskCTAA 


2640 


AAGTTgTATT 


TTTAAAAATA 


GTTCCTTTAA 


ATTATATACC 


CnCCACATTT 


GGGGGAGGAC 


2700 



CTAAAAAAGC 

35 (2) INFORMATION FOR SEQ ID NO: 4 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(b) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 

CATTTTAATT GtTaAAATTC CAAAAAAtGT ArGTGGATTw AAAGrAAACC CtGTGTTTTT 6 0 

AAAAGGCACC ATTaAAATAg TTCCGATTGT ACCATCCCAC GTGAAAtTTT TAgTAtTGCC 12 0 

GGTGaGAGAA AATGCCAATG CAATCATCGC AGTTCATAAT CATCCATCCG GTGATGTAAC 180 

GCCCTCACAA GAAGATATCA TAACAACAAT GAGGTTGAAG GAGTGTGGTT TGATTTTAGG 24 0 
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5 



10 



IS 



20 



GGGTTACTTT 


GATGAAAATG 


ATTGAAGTTG 


ATAGAATTAA 


TGACGTATCT 


TGTGTATAAT 


360 


ACCTACGAAG 


TACTTTCATT 


GGAGGAAAAA 


TAGTGACTCT 


ATTTATTATT 


ATCGGGGTTC 


420 


TCGTGCCAAT 


GGTTTATACC 


ATGCAGTTAA 


ATATTAAAAA 


TGAACCTGTa 


ACAAAGCGCA 


480 


ATCTTTTAAT 


AACATTAGCT 


TTATCTACGT 


TAGGTATTTT 


AGTAACCGCG 


TTAGCAGGTG 


540 


TAATCGTTAC 


GAAACAAGCT 


TTTCCTTTAT 


TAAGTGTAGC 


AATTGGCTCA 


ATTTTTACTG 


600 


GAATCGTTTG 


GGGCCTTTTA 


CTAAGTGGTA 


GcTACGCGCT 


GATACGATTT 


TTATCTAACG 


660 


CATTTGGGCG 


TAAGTAATAG 


TAATCTGTTC 


ATCAAGTAGT 


ATCCGTGCTT 


GAAAACAAAC 


720 


TAAAACTCCT 


AATGTGGAAC 


TAGTTAAAAA 


ATTCTAAACT 


AGTACATTAG 


GAGTTTTGTT 


780 


ATGCAGAATA 


AATTTAATTG 


TTAAATTGAA 


AGTGCGGTAA 


AAATCCACTA 


TTTATTTGAA 


840 


AAAAATCGAG 


CAACCAAATT 


AAACCTTGAT 


ATCCTAAGTA 


AATACATAAT 


AAAACAAGTC 


900 


CGACATAAAT 


TAAAAATCGC 


AAAATAGAAA 


GTCCAACTCT 


AAAAAGGAGG 


ATGACTAATA 


960 


GTGCTATTAA 


AATAATTGTt 


AGTATACTCA 


ATGCTACAAA 


CCTCCTArTA 


CGCTTTTAAA 


1020 



TCCATAA 1027 
(2) INFORMATION FOR SEQ ID NO: 4 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 



45 



SO 



CGGCACATTT 


TTAATTTATA 


CATATTTTAA 


AACTAAGTAA 


CAGTTTGAAG 


AAATCGTAGT 


60 


TCAATAATGT 


TAATTGTGAA 


AATGTATATA 


AACATAAAAA 


AATCATGTAT AATATATGTT 


120 


GTTAATTAAA 


CAGTTCGAAA 


GCGAGATGAC 


ATTATGGGAC 


GTAAATGGAA 


TAACATTAAA 


180 


GAAAAAAAGG 


CCCAAAAAGA 


TAAAAACACA 


AGTAGAATAT 


ATGCGAAATT 


TGGTAAGGAG 


240 


ATTTATGTTG 


CAGCAAAATC 


TGGTGAACCC 


AATCCAGAAT 


CTAACCAAGC 


TTTAAGGTTG 


300 


GTGCTTGAAC 


GCGCTAAGAC 


ATATTCAGTG 


CCGAATCATA 


TTATTGAAAA 


AGCAATAGAT 


360 


AAAGCTAAGG 


GTGCTGGAGA 


CGAAAACTXT 


GATCACCTAA 


GATATGAAGG 


ATTTGGCCCA 


420 


AGCGGATCAA 


TGCTAATTGT 


TGATGCGTTA 


ACAAATAATG 


TAAATCGTAC 


TGCCTCTGAT 


480 


GTGCGAGCTG 


Crm'GGTAA 


AAACGGCGGT 


AATATGGGTG 


TATCTGGATC 


AGTTGCTTAT 


540 


ATGTTGTCAT 


GTGGCCACAT 


TTGGTATTG 








569 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 
5 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

TGGACCTATA AAATGGATGC ATCCAAATAT GGATGnGGGG TAnGGCGGGa AAATAGGaGG 6 0 

TTATATAGTA AGTCATTTAA GTCGAATAGC CATCCTTTTT AAAATGAAAA AAATAGAAAG 12 0 

CTCAATAGTT TGTTAAAGCC TTTAAATAGC GTCGTTACGC AATTTTAGAA TGCTAAAAAT 180 

TGTCACAAAT TTCAAATTAC GTGCTAAAGC TTGAGATATC AATATTTATT GGCGATAGAG 24 0 

TGTAATTTGA CTCCGCTTAC ATCTCAGTTT TATGTTTGTA ATTTGGTAGC ATAATATTAT 300 

AATAAAATAA AATTGTTAAT CTTTAATTTC AGTATAGATA TTTTTACGTG TAGTCACGTG 360 

20 

TAAAATAAAT TCAATTAGGT TAGGAGACAT AATTATGAAA ACATTTGGTA AAAAGGTTGT 42 0 

ATTAATCGGA GATGGATCTG TAGGATCAAG CTATGCCTTT GCAATGGTTA CGCAAGGTGT 4 80 

TGCTGATGAA TTTGTAATTA TTGACATTGC AAAAGACAAA GTAAAAGCAG ATGTTCAAGA 54 0 

25 

TTTAAACCAT GGTACAGTCC ACAGTCCTTC ACCAGTTGAT GTGAAAGCAG GTGAATACGA 600 

aGaCTGTAAA GATGCAGATT TAGTTGTTAT TACAGCTGGT GCACCTCAAA AGCCAGGTGA 660 

AACACGTTTA CAATTAGTTG AAAAAAATAC TAAGATTATG AAGAGCATCG TTAAGAGTGT 720 

30 

TATGGATAGT GGCTTTGATG GATATTTCTT AATCGCGGCA AACCCTGTAG ACATTTTAAC 780 

AAGATTTGTA AAAGAATATA CTGGATTACC AGCAGAGCGT GTTATCGGTT CAGGTACTGT 84 0 

35 ATTGGACAGT GCACGTTTAC AATATTTAAT TAGCCAAGAA CTTGGTGTTG CACCTTCAAG 900 

TGTTGACGCT AGTATTATTG GCGAGCATGG TGATACTGAA CTTGCAGTTT GGTCACAAGG 960 

AAATGTAGCA GGTATTTCAG TATATGACAC ATTAAAAGAA CAAACTGGTA GCGAAgcTAA 1020 

40 AGCGGAAGAA ATTTATGTGA ATACACGTGA CGCTGCTTAT GAAATTATCC AAGCTAAAGG 1080 

GTCAACATAC TATGGTATTG CATTAGCATT GATGCGCATT TCAAAAGCCA TTTTAAATAA 114 0 

TGAAAATAAT GTCTTAAATG TTTCTATACA ATTAGATGGT CAATATGGTG GTCACAAAGG 1200 

45 CGTTTACCTA GGTGTACCAA CATTAGTTAA CCAACATGGC GCAGTTAAAA TTTATGAAAT 1260 

GCCATTAAGT GCCGAAGAAC AAGCGTTGTT CGATAAATCT GTTAAAACAT TAGAAGATAC 1320 

ATTTGATTCA ATTAAATATT TATTAGAAGA CTAAGCCTAT TTTAAGTATT AATTAGAAAT 13 80 

SO 

ATATTAATGG TAAGAGGATC TATTAGTGTT GCAGTAACAC GTGGCACTGA TAGGTCCATT 14 4 0 
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TATGACTCGT 

GAAAGTAGTT 

ATGGTTTTGG 

CTTACTAGAA 

AACGTTAACA 

GAGCATAAAG 

AATTTTAAAG 

CAAATTAAAA 

TTTAAACATA 

GATTCAGCAC 

TTTTTTACAC 

GATGATGAAA 

GAGATACAAA 

AAAGCGAAAA 

ATAATTAAAA 

TTTTATATTA 

TATCAAGATA 

TCGCTTCGAC 

CGTCGGGATA 

ACCCCTCTGC 

TTTCAGGATT 

CCAAGTTAAC 

CGATTTGACr 

GTAATTGATA 

CAACTGCTTG 

CTAAAATGAT 

GTTGATGTGT 

CTTTCACAAC 

CGTGTCTCCC 



GAAAATTTAG 
CTGCATTAAT 
AATGACGAtG 
GGCACAGCTA 
GGGTCTGATG 
AATTTATAGA 
CGAGTAAGAC 
ATGAAATGTT 
TGCATGTACG 
GCAGACAACC 
CAGAATTATT 
GAGCTTATGG 
ACTTTGAAAC 
TAGACTATAA 
GACCTTAGCG 
TATATGTGCT 
TGGGGTGTCA 
TTCAAGAGGT 
ACCTTGATAT 
CTGTGCAACA 
ATGCATGATT 
TAACAATTTA 
GTCTTGGACC 
ATCTCTGAAG 
GCATACGTTT 
GAGCGTGTCC 
TTTGACTGCA 
AATATCTTGT 
GATAAGCGTT 



TGACATGAAT 
CACGAaGTAG 
TCTTGTATCA 
CAATTAATGA 
GCGAAtAATA 
ATTAAAAGGC 
ATTTCCATTG 
AAGTGAGAAT 
AATGATGCCT 
TGAGGAAAAA 
TCATGGCGTA 
TGGACATGTT 
ATTCCAACAA 
AGATGTGGCA 
ATATTAGGAA 
TACATATTTT 
AGAATGTGTT 
TGCTGATGCG 
ATAGTCATAA 
CGACAACCAT 
GCAACGGTCT 
TACCAAATAG 
AAATCTCTAA 
TGCGTAACAA 
TTAAATGAAA 
TCATGTGCAA 
ATAATAATAA 
GCAGGTGCAT 
GTATGAGGCA 



AAAAATGTTG 
AGCATGACmA 
ACATGGTACA 
ATTGTTAGAA 
TTTTTAGACG 
GATGAGAAAG 
CAACAATTAT 
TTATTTTCGG 
GCTCAGCAAC 
AGACAAGATA 
GGGTCTGCTG 
CTTGACTTTG 
CATTTCCCGG 
GAAGAAATTA 
CAGGTGGTTC 
GTTGATACGC 
CACGTGCACG 
CAATGTCGTA 
TAGTATCGAC 
CTAGTAATAG 
GTCTACCGAG 
CTTGTTGAAT 
ATTGTCGAGT 
CATCGCCTTT 
TATGTTCGAG 
GATGAGTTAA 
CATCAAATGT 
GTGGTACAGT 
ATGATTGTTG 



AGGCGTCATT 


GAGTATAAAG 


1560 


CGAGGAATAA 


CTATAGGGaG 


1620 


TTAGGTACGT 


TAATGGCTGG 


1680 


CATGGGAATT 


TAGGGATTGC 


1740 


GAAAGGCATA 


TCATGCTAAC 


1800 


TACCGTATGC 


ATCGATTACT 


1860 


CACAAGATGA 


TGTATTTGCA 


1920 


CTGTTAAAAT 


TTATGGCACA 


1980 


CGCCATATAC 


ACGTTTGATT 


2040 


TTCGTGGTGC 


CATTGTTGGA 


2100 


GTTTTCATAT 


ACATTTTGCG 


2160 


AAGTGGATGA 


CGTTGTCGTT 


2220 


TAAATAACGA 


GACGTTTGTT 


2280 


GA6AAGCTGA 


ATAGTCTAAT 


2340 


TAAATGCATC 


GTTAAGGTCT 


2400 


GCGTAAAAAG 


CTATAAATAG 


2460 


TCGATAAATA 


AAGCCTTGAA 


2520 


ATACATGCTC 


GTTCCCATTT 


2580 


AGTTTGTTCT 


GAAAAGTTTA 


2640 


CTGTCTACAC 


AGTGTACGTA 


2700 


TGCTGTGATA 


GAATTAATGC 


2760 


ATTTGCTTCT 


AATACAATGT 


2820 


TAATGCATTA 


TCTTGTATAC 


2880 


CTTTTGACCA 


CTTATATAGA 


2940 


TTGACCATAA 


CCATTTTGGG 


3000 


ATGAGGAATA 


ACTGCATCAA 


3060 


GTTTGTGACA 


TCTTCATAAC 


3120 


ATAATATGTT 


ATTGTTTTGG 


3180 


TAATTCATAG 


GCAATAGTTG 


3240 
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(2) INFORMATION FOR SEQ ID NO: 425: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 



IS 



25 



30 



35 



AGAACG AA C C 


T*TT A A'P A f^n*T* 




v_AC-GAACTGT 


CATGACAAAT 


TGATAAAACG 


60 


A. A T* ArSTTTTTHT 
nfxxfwj X X X X 


r* A TT A n TTT* A 




AATuACl 1 1"! 


GTkTCACCTT 


TAATCACAGC 


120 


A T'TTTTf^ A'^r' A 

X X X VvAX 


TA A ATATTA h. 




X IGATCAGTG 


TTAGTAATTA 


TAATTGGTGA 


IBO 




TTAGCGTTAT 


TATTAATATA 


TTCAAGGTTG 


AATCTTACTA 


ATGGATCTCC 


240 


GACGTTAACT 


TCGTCACCAC 


TAGACACTAA 


CACTTCAAAT 


CCTTCACCGT 


CTAATTGAAC 


300 


TGTGTCTAAA 


CCGATATGAA 


CTAATAATTC 


TAATCCGTTA 


TCTGCTTTTA 


ACCCAATCGC 


360 


ATGCTTAGTT 


GGAAAGACAT 


TGTCAACACG 


TCCTGCAATT 


GGAGACACAA 


CTTCTCCTTC 


4 20 


AGTTGGATTA 


ATACCAAAAC 


CTTCGCCCAT 




GCGAATACAG 


GATCTGGAAT 


480 


ATCTTCAATT 


TTCACGAATT 


CTCCAGTTAA 


TGGTGCATAA 


ATTGCGATAT 


CTTTCTGAAC 


540 


TTCTTTGCCT 


TTTCCGAATA 


ATTTTTTAAA 


CATACTTTCC 


ACTCCTACTT 


ATCAAAATGT 


600 


GATATTAAAT 


CGCCATAACC 


CAATTCTTCT 


AACTTTTCAT 


ATGGAATAAA 


TTGAATTGCA 


660 


GCGGAATTGA 


TACAGTATCT 


TAAGCCGCCA 


CTTTCTTTAG 


GTCCATCATT 


AAAGACATGT 


720 


CCTAAATGAC 


TATTTGATTC 


TTCTGAACGC 


ACTTCAGTTC 


TCAACATACC 


AAATGATTTG 


780 


TCGACTAATT 


CTATAA 










796 



<2) INFORMATION FOR SEQ ID NO: 426: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 93 base pairs 
40 <B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxxble 
- (D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 

ATCCAGCAAA TCTTTCTTAT CACGTTCTGT AATTTTCTTA GTATCCATCT TGATTAGCTT 60 

TGATAACTTT TCAGCCGTAT CCAACATTTC CGATTGTGTT GTTTTTCGAC CCCTAGTATA 12 0 

50 

TGTAATAGCC ATTTTAGAAG CATTATCAAC TAAAACTTTC CCATTTCTGT CTAAAATACG 180 
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ATGTGAGCCT TGTGCGATTT GTAAATAACC TAAACGTAGT ACTAGTACTG CAAAAATAAA 300 

TACAATCACA CCAAATATAA AGTTAATTCT CTTGTTAATT GTATTTTGAA CGATTTCATC 360 

ATTTGATTTT TCTTTTAGTC TTTTTAACAA AACTACCTAC CTCTATTCAA AGTCTTTCAC 420 

TTTAAATCAT ATATGAATTT AGAAATTATT TCTATCTTTT TGACAAAAAA ATAACGGTCT 4 BO 

CATTtAAGAG ACCGAACAAG TAATCATACT TTATTTTGTT GCATTATATA ATTCGTCAAC 54 0 

TTTTTCCCAG TTAACTACAT TCCAAAATGC GCCAATGTAG TCAGGGCGTT TGTTTTGATA 600 

TTTTAGGTAA TAAGCGTGTT CCCATACGTC TAAACCTAAA ATAGGTGTTT TACCCTCAGT 66 0 

TAATGGATTA TCTTGGTTTG GTGTAGTCAC AATTTCTAAC TGGCCATTGT TTACGACTAA 720 

CCAAGCCCAA CCTGAACCAA AGCGTGCAGC TGCTTTGTCA GCAAATTCTT TTTTAAATTC 7B0 

TTCTAAAGAA CCCCATTGTT CTTTAATm TTCTACTACA GTACCTTTTT CTTCTGAGTT 84 0 

TGGTGAAAGT AACTCCCAGA ATAATGAATG GTTTAAATGT CCACCGCCAT TATTACGTAC 900 

AGCAGTTTGG ATGTTAGCTG GTACACTGTC TAAATTAGCA ACAATTTCTT CAATAGATTT 960 

AGATTCTAAA TCTGTACCTT CTACTGCAGC ATTTAATTTC GTAACATACG TGTTATGATG 102 0 

TCTGTCATGG TGAATTTCCA TAGTTTCTTT GTCAAAATGT GGTTCTAATG CATCAAATGC 1080 

GTATGGTAAT TTTGGTAATT CAAAAGCCAT AAATAATCAT CCTCCTAAAA TGTCTGTAAG 1140 

TAAATAATAA CAAGCAGTGA CTGGTTCAAC AAAGAATTTG CTTAAATTCT ACTACTTATT 12 00 

ATTTTCTCTA CTCATTTAAT ATAACTCAAA TCAAAAATAA TTAAACATTT TGTATATAAA 1260 

AAGTTAACAG ATTTGCCATA AAATCATACG AACGGAGTAT GAAATGAACC TTTATCTTCT 1320 

ATAATTTAAA AAATGAGATT TATGCATACA TCGGACCAAA TGTGCATAAA TCTCATTTCT 13 80 

3S TTATATTAAT CTTGGcAAGA CTCACATACA CCATAAACTT CAAGTTTGTG TTTGTGAATA 144 0 

TTAACACCAG GTAGTGATAA TTTTATCTGA TCTATTGGAC AATAATCTAT TACCTTTGTA 1500 

TCTCCACACT TTTCACAGAT AAAATGATGA TGATGATGGT TTGTACAAGC GATTCTAAAC 1560 

TTCATTTCAC CATCAAGTTC TGTATTTTCA ATAATTCCTA AATCTTTAAA TAAGTGCAGG 1620 

TTTCTATATA TTGTGTCGAA TGAAATTCCA GGATAATTTT CATCCATAAC TTGTTGTATA 1680 

TACTTTGCGT TTATATACTT ATCTTCTTCG ACAAAAATAT CTAACATATC TTTACGTTTA 174 0 

TCTGTATATT TTAAACCGTT CTCTTTTAAA ATTTTAATAG CATCATTTGT ATTCATTGAT 1800 

ATTAGCTCCC TTTTTAAACT TCATTCGCAT TTTCTGATAA GCCATTGTAA TCATAAGTAA 1860 

AATAACAAGT AGAACTACAA TTACACCACC CGGAGAAATG TCCATATAGA AAGCTAGGAC 1920 

TAAGCCTAAT ATTACTGATA ATTCACCTAA AAATACACTT AGTAATATCA ATTGCTTAAA 1980 
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TATCCCTACA ACACGCATTG AGGCAGAAAT AACCATCX3CT ACAATAACAA TAAATAAAAA 2100 

TTGAATCCAT TTAGGAATGC CAATGACTTT ACTATATTCC TCATCAAATG ACAATATAAA 2160 

TAATTCTTTA TAAAACAATG TAATAAACAG AACAACTATG ATGGCAATGA CAATAATCGT 2220 

TGTTAAATCA CTTATATTCA CTGCGCTTAT TGAGCCAAAT AGCAATCCAA CAATTTCTTG 2280 

ATTGAACCCA TCAGCTAATG AAATGAAGAT TGCACTCAAG GCGATACCAG CACTCATTAT 234 0 

AATTGGAATA GCAATTTCTT GGTAAGCAGT GTATGACGTT CTTAATTTTT CAATTAGAAG 2400 

CGCACCTACT ATTGCX3AATA AGATTCCAAA CCACATTGGA TTAATAAATA CTAGTGTTGG 2460 

CATAATAGTA AGTAAAAACA TACCGAAAGA TATACCACCT AAAGTTACAT GACTTAGAGC 2520 

ATCAGCTWTA AGTGATAGTC GTCTAACAAc GATAAAAGCA CCGaTTAGAG GCGCAATAAA 2580 

ACCTAtCAAG ATAcCACTAA TTAAAGAGTA CCTCATAAAA TCAAAATTCA ATAATGCATC 2640 

TATCAATTGT GACACGCCTT TCCATTTTAA ATAAACTCAA ATCTTTATTA ATTACAACAT 2700 

TCTCGATTAT GCTGATGATC GACAAAACGT ACAGGATGTC CATAAATTTT TGAAATTTCA 2760 

ACTTCATCAA GTGATTTAAA CTCATCAGTT GTACCATGGA AATGCAAATG CTTATTTAAA 2820 

CATGCTACTT CAGTAGCAGT ATCTGCTACA ACACCGATAT CATGAGTAAC TAAGATAATG 2880 

GTGATACCTT CTTGTTTTAA TTGATCTAAA GTATTATAAA ATTCACTTAC ATGTTTTGCA 2940 

TCAATACCAT TCGTTGGTTC ATCAAGTACT AATACTGcAG GTTCTGAAAT CAATGCTCGA 3000 

GCAATCATTa CACX3TTGTTG TTGACCACCT GATAATTCTG CTAT A TTTTT ATGAATTAAA 3060 

TCACTTATAT TCAGTCTTTC TAGTACTTTA ATCACTTTTT CATTATCTTT GCTATTAAAT 3120 

GTTTGGAAAA GACGTTTTGT CTTTGTTAAT CCGCTTAAAA CAACTTCTTT AACACTTGCT 3180 

55 GGGAAACCTG AATTAAAGGC ATTTGCTTTT TGTGATACAT AGCTTAATTT AATTQATGTT 3240 

TcTTATTTTT AAAATCAATA CCTTCAACAA AAATCTCACC ACTTTGTAAA GGTAATAACC 3300 

CTAGAATCAA CTTCAATAAT GkTGATTTAC CAGCACCATT TGGkCCAACA AvrTGCtAAAA 3360 

40 ATTCACCTTT ATTTATTTTG AATGnnnTAT ATT 3393 

(2) INFORMATION FOR SEQ ID NO: 427: 



25 



30 



4S 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1123 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 
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CCCTGTTTCT 


TAGCCTTTTC 


GTATATGCGT 


TCATAAGAAG 


AATTCTTTTT 


GATTTCAGAT 


120 


TGTAAAGCAC 


TGTTTTCACT 


TGATTGTTTA 


TCTATTTTAT 


AATCTAAATC 


TGCAATCTTT 


180 


CCTCGCGTAT 


CATACGCATC 


CA'rrnTAAA 


GATAGCATAT 


AAATACTTAA 


CATAGCAATT 


240 


ACAGTAATCA 


AAGTTATGTA 


TAAAACTTTT 


TCAAATTTAG 


TTAATTGTAC 


AACCACTTTT 


300 


CTCGAAACAG 


TCTTCTTTTC 


GGGCTTAGTT 


TGTGGTTGTT 


GCTTCGGTAT 


ACTATTATAA 


360 


ACTTGTTCGT 


CATATGGTTG 


GTACACTTTT 


-TCTACAGCCA 


TTATAAATTG 


CTCCTTATTT 


420 


AAGTATTTCA 


GCTACACGTA 


ATTTCX3CGCT 


TCGTGCTCTG 


TTATTGTCAT 


CTAAATCTTC 


480 


TTCTGTAGCG 


GTAATCGGTT 


TACGATTAAC 


ACGCTTTAAC 


TTAGGTGTAT 


ATGCTTCTGG 


540 


TATAACTGGT 


AATCCTCTTG 


GTACCTCTGG 


ACCTTTTTCA 


TATTCTTGGA 


ACACCTGTTT 


600 


ACATAAACGA 


TCTTCTAAAG 


AATGGAAAGT 


GATTACCGAA 


ATCCTGCCAT 


CTACTTTCAC 


660 


TAATTCAATC 


GCTTGTTCTA 


TTGAATCTTC 


AAAAGCTGAC 


AATTCATCGT 


TTACTGCAAT 


720 


TCGTAGTGCT 


TGAAATACTG 


GTTTTGCAGG 


ATGTCCGCCT 


TTTCTTCTTG 


CTTTTGCAGG 


780 


AATACCTTCT 


TTTATAATGT 


CAACTAATTC 


TAATGTTGTT 


GTTATTGGTT 


GTTGTTCGCG 


840 


ATGTGCTTCG 


ATTCTTCGAG 


CTATCTGTTT 


TGAAAATTTC 


TCCTCGCCAT 


AG CGATAAAA 


900 


AATCTTCACT 


AACGCTTCAT 


ATGACCAATT 


GTTAACAATT 


TCATATGCTG 


TTAGTTCTTG 


960 


TGTTTGGTCC 


ATACGCATGT 


CTAATGTTGC 


GTCATGGTGA 


TAACTGAATC 


CTCGTTCTGG 


1020 


AATGTCGAGT 


TGTGGGCTTG 


AAACACCCAA 


GTCGTAATAA 


ATTCCATCTA 


crrrrrcAAT 


1080 


GTTTAAGTCT 


TTTAATATTT 


GAGTTAATTC 


ACGGAAGTTG 


CTA 




1123 



(2) INFORMATION FOR SEQ ID NO: 428: 



^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 
TATATCCTCC AnACCATAAA AATGGAATTA TAGTGAAGAC CGCTATGATC CAGGATGACC 60 

4S 

ACGTATTATA GAATCTTTCg ATAGAGAAAC GAATCaACGT ATCCgACATC aCGAAATGAA 120 
ATTAGAAGAT TATAAAGATG AGTTAAGAAG AGAATATCTA AAACAATCTG ACAGAATTGA 180 
AGGAGATGAA TAAGCGTGGG ACTTGATTTT AGTGGTTTAC CAGATTTAGC AGTATTGGAA 240 

50 

CAAATGAAGG AAAAAGAACA GATTAGTGAG GTTATTGCGC CTGAACATGT TCGTATGCAT 3 00 
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CATTTCAAAA AATTTGAAGA TGATTTTAAA AATGCGGCAC AAGGGGCTTG GGTGAAAAAT 42 0 

GCCACAGACG AATTAAAAGA TATTAGTAAT GATTTAGAAA AAATTCAAGA TATTAAAGTA 4 80 

TAAAAAGGTA TTAAGAAACA CAATAAGTAT ATAATCCATT TAATAATAAA TGAATTATAT 540 

AGTTCATAAT TTCGACTATA AGTGGCTATT AGCCATTATT TTTCGGGATC TATGTCAAAT 600 

CGGACTAATG AATTCAATAA TGGAAGTTAA GCAACCAATC TTTGTTTAAC TTCTTTTTTA 66 0 

TTTTTTGGAA AATAAAGTTT TGAACATAAT AAAATTTGAT TATGTTTTAA CGAATTTTGA 720 

TGTTTCTTGA ACTATATATC ATCTAGTCGT CATTTACAAT TGGTAAATAT GACTTCAAAC 780 

TGkATGAAGG TGATGGCGAT TAAAAGGCTC ATCCGTAGGT TCTAAAGAAC TAGAnTTT 83 8 
(2) INFORMATION FOR SEQ ID NO: 429: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2S 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 

nGTGTTCAAA TAATAGAATG GCTAATACCC CAACAATAAA TAGGAATAAA TATGTTGATG 60 

TCGTAATTTT AGTAAATAAT GAAAATCTAA AATTAGTAAC CCTATTTTGA ATATAAGCTT 120 

TAACTTCTAA TAATACTGGG AAGCCGATTG ATCCTAAAAT TATTAAAAAT ATAACTATCG 180 

CTTGTACAAA ATAATCATGT GCATAAGGAA TTAATGACTT ACCTGTAATA TCTAATCCAC 24 0 

35 CATTGGTAGT GGCAGAAATA GAAACAAAAA CACCTTGCAT GATTGCATAT TTTAAATCTG 3 00 

GATTATCTCG ATAAAAGTAA AATGCTAATA ACATGGCTCC TACGAGTTCG ATTACAAATA 360 

TTGATTTTAC AATATCAATA ATCAATTTGA CGGTACCACT CATTGTGTTT TTGTTATTAT 420 

CTAACATAAT TAACTGACGT TCTCTAATTC CAATATGTTT ACCTAGTACC ACCCATAACA 4 80 

TCGTACGAAT TGCCATGACG CCAATTCCAC CAATATTTAA TATCACGAGG ATAATTAATT 540 

GTCCAAATGT AGAATAGGTA TCGACAATAC TTATCGGAGA CAATCCAGTA ACACTAATTC 600 

CGGATACGGC AACAAATAAT GTGTCAATTG GATTTACTTC TACACCTGGT TTATGaACAT 660 

ACGGTAAGTT TAATAATAAA AATGCAATGA CAATTGCGAC GATATAGTAC AATACAATAC 720 

CTTGTTGAGG GCTTGATCTT TTTAAAAACT GGCTAAAAAT TGACACGTTG TTCACGTCAA 7 80 

CATTATTTTA GTTTAATATT AATTTCTTTC TCTTTACCAT CTCTATAAAT CTTCGCTGTA 84 0 

ATTGATTTCA AGTCATCTTT ATGACTAAAT ATAATCTGCC TAAAGCGTAA ATCATCTTCT 900 
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AAACCATTGT TC5TCAACTTG ATCTACAACA ACACCGTTCT TAACTTTTCC TGGCAATTTA ' 1020 
ACTGCTTGTC TTTCAAAACT ATTTAGACTG ACAATATTCT TCATTTTAAC ACCTACATCG 1080 
GGATAGTCAA TTTTACCTTT tGTTTCTAAA TCTTTTACAA TCTTTtGTAC TTCATTAaCA 114 0 

GGtATTGCAA 

(2) INFORMATION FOR SEQ ID NO: 430: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 

20 AAGAATATTT AGGATTTTAT GTTTCGCAAC ACCCAGTAGA TAAAAAGTTT GTTGCCAAAC 60 

AATATTTAAC GATATTTAAA TTGAGTAACG CGCAGAATTA TAAACCTATA TTAGTACAGT 120 

TTGATAAAGT TAAACAAATT CGAACTAAAA ATGGTCAAAA TATGGCATTC GTCACATTAA 180 

ATGATGGCAT TGAAACTTTA GATGGTGTGA TTTTCCCTAA TCAGTTTAAA AAGTACGAAG 240 

AGTTGTTATC ACATAATGAC TTGTTTATAG TTAGCGGGAA ATTTGACCAT AGAAAGCAAC 300 

AACGTCAACT AATTATAAAT GAGATTCAGA CATTAGCCAC TTTTGAAGAA CAAAAATTAG 360 

CATTTGCCAA ACAAATTATA ATTAGAAATA AATCACAAAT AGATATGTTT GAAGAGATGA 420 

TTAAAGCTAC GAAAGAGAAT GCTAATGATG TTGTGTTATC CTTTTATGAT GAAACGATTA 4 80 

^5 AACAAATGAC TACTTTAGGC TATATTAATC AAAAAGATAG TATGTTTAAT AATTTTATAC 54 0 

AATCCTTTAA CCCTAGTGAT ATTAGGCTTA TATAACTTTy ATCAACTAAT AAATTATGAT 600 

ATAGTAAACT GATGGTTAGA TATTTyTyAA CCATCAGTTT GCGTktATAT TAGTTTTTTA 660 

TGCTTATTAT TTTTATGAGT TTCACTTTAC ATTATTGATT AATCCATAAG AATAATTAGC 720 

ACAAAAAGCA GTATACATAA ATTGAGTAAA GAATTTTGTC GATATAACCG nOCGGAAAAA 780 

TAATAACnTT TTGGATn 797 
(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TTTCCAAGAG AAGTTGCGGA AGTAATTAAT AAAACGCATC ATAATAAATT GGTCATTTCG ~ €0 

ATGATTTCGk CACAAATCGA TGCGGATAGA ATGGATTATT TACAACGTGA TGCGTATTTC 120 

ACAGGTGTAT CATATGGTGC TTTTGATATG GAGCGTATTT TAAGATTAAT GCGACCTTCT 180 

AAAGATGAAG TACTAATCAA AGAAAGTGGT ATGCATGCAG TTGAAAACTT TATTATGAGT 240 

CGTTATCAAA TGTATTGGCA AATTTACTTC CACCCAGTTA GTCGTGGTGG AGAAGTGCTG 300 

CTTAATAATT GtTTGAAACG CGCAAAACAG CTTTATAATG AAGGCTATGA ATTTAAGTTG 360 

CATCCACATG ATTTTATTCC ATTTTTTGAA GAGACAGTTA CGATTGAACA ATATGTTGAA 420 

CTCGATGAAG CGGTAGTTAC GTATTATTTG GAAAAATGGA CAAAAGAAGA TGATGCTATT 4 80 

TTAAGTGATT TAGCAAGTCG ATTTATTAAT CGAGACTTAT TTAAATATAT TCCATTTGAT 54 0 

GGCTCAATTA TTACAATATC AGAACTGCAA GAACTGTTTG AAGCAGGTGG TATTAATCCA 600 

20 GATTATTATT TTGTGAGTGA AGCATTTTCT GATTTGCCAT ATGACTATGA TCGACCGGGG 660 

TCTU^TCGCA AACCGATTGA TTTATTAAGA CAAGATGGTA CGATTAGAGA AATAAGCAAT 720 

CAATCATTAG TCATTCATAG TATTACAGGC ATTAATCGCC AAGACTATAA ATTATATTAT 780 

CCTAGAGAAA TGGTTGCAAA GATTAAAGAT AAGACAATTA GAGAAGCTAT TGAAAATTTG 84 0 

ATTAATGAGC TTAATTAAAC AGGGCTAAAA TTGTTATCGT TAAATATGGA GGTTATATCA 900 

TTGTCTGAGA AAAAAGGCTT TAATTTTAAT ATCATAAAAA ATGACCCTCT AGATGGTCAT 960 

AAAGGTACAA ATATTGGTTC AATTAGCTTA GACAATATTG CACCAGTTTT TATCGATGTT 1020 

GCTAACAAAG AAGCATTTAT TGATATTGGA GGCATGCATG CTCGTGCCAA AGTTGAAAAA 1080 

GGTGTGAAAT GGATTACTGA TAAAGCTGCT GTTGAAGGCG ATGAAGCTAA AGaATATTGG 1140 

TTGTGTTGGG TAACAACAGA ACGTAATGAA CAAGGACCAT ATTACGCTGG TTTAACAGCX5 1200 

TGCTATTTAT TAGTGAATAA AGCAATTCGT CGTGGTTATA AAAGTATGCC TGAACATGTT 1260 

AATATGATGG ATAAATCAAT GAAACATCAT ATTATCATAG ATCAAATTGG TGACGAGAAT 1320 

AAAGCTATTT TAAAAGACTT TTTAATGAAC CATGATGAAG GTATGTGGAA GCATTCTTCT 1380 

GATGCTTTAC ATCAAGCATT TAATTAAATA TTAGAAACTA AAATTTCCCA ATTAATCTAT 1440 

AAAGATATGA TTCATTTCTC AATGAC 14 66 
(2) INFORMATION FOR SEQ ID NO: 4 32: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2304 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doub 1 e 

(D) TOPOLOGY: linear 
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TCATTCTCAT CATATTCTAG GTTGTTTTTA ACMiACTIiAA TATAGTGAAT GCAAATCAAC " 1900 

TATTATTTAA ATTATGAATT ATTTTAATTC TTTCTTCTAC GAGCCAATAA CATTAATCCA i860 

GCAATTCCAA TTATACTACT AAAGATCAAA CCTTTTTGCG TGCTTTCTAA ACCTGTTTTT 1920 

GGTAATTCTG CTCGTTTTTT CTCTTGATTA GCTACTGATT CTTTAGCAAT TTTAGATTTT 1930 

TTAACTTTAT CATTTTTATC CATTGAATGA ACTGGGCCAT TTGGTTTTGC TCTGTCTTTC 204 0 

GATAATCCTG GATTGTTAGG ATTTACTGGG CCACTTGGAT GAGTTGGTCT GCTCGGCTTC 2100 

TCTGGGTTTT CAGGTCCTTT TGGATCTTTT GGTTTCTCTC CACCGAACTC TACAATCTTA 216 0 

TCTACTGGTT GTTTtGTGAT CTCTTCTGTT GGTTGACCCT CGCCAACTTT TTCACCTGTT 222 0 

AATGGGkTCA CTGTGAwTGG TGkTGTGAwT GTCyTACTTC CTGGkTGTCC TTCTTGTTTC 2280 

ACTCGCTCTT CACCAGGTTG TAAT 23 04 
20 (2) INFORMATION FOR SEQ ID NO: 4 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 

GTCTTAATAT TTTACAAGAT ACAGCCTGAC AATATTTATA TTACAAACCA ATATGGTAAA 60 

GAGGCTGTGT CTCGATTAAA AAGAATTTTA GAGCAAAGTA TACCTAATAA AGAATATATT 120 

35 GATTTATCAG AGATATCATT AATTACATTT TTTAAACCAG AATATAATAA ACAACATGTG 18 0 

AATGAGGAGT TTAATTCTTT ATCTTCTTCG AAAATAAAGA ATTTAGCCAA AGAAAATGAT 24 0 

GGTATTATTA TTTATTTAAA TTTTAAGGAT GTTAATTATC AACTGaTAaG TGAAGGCAAT 300 

ACTTTTTTTT CTGCAAAACC ATATTTGCAT TGTGTGTTCA ATAAAGATAT TTTAAATATG 360 

GATAAAGTTG ATATAGAAAA TTTTTTCAAA AGTATAGAAG TCAAATACAG TAAATAAGAT 420 

CAATTATTGA ATCCACCGAC CAGGAGATTT TTTTAAAAAA TTAATACCCC GTTAATACCC 4 80 

TTTGCTTCAA TTTGATGAAA ATCAATGAAA TTCAAAATTG AAGAAATCCT TTAATATCAA 54 0 

GGTGTACGAC AGTCTATATA ATCATGCGAA ATTCTAAAAT TTTCtGAcGT AAAAAAACAA 600 

ATTCyTAAAG CAGCTCGTCG TTCACCTCAA TTCTCAAAAC GTTAATTGTC GGACGATATA 660 

TATACAAAAC ACCTCGATGT TATGTCGAGG TGTTTTTTTG CGTTTGTGTG GGGAATATGG 720 

AATAGAGTAT AAATGAATTA ATGTCTCAGG TATAGAATTA ATTCAACTAT GAATTATTGA 7 80 
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TTAGCTTGCG GAGACCGTAC GATAGTGTTC TAATAATAGA TATTCAAC3GG AAAGAGCTAT " 900 

CGGATGATAA CCAGATGATA ACTCAAGAAC CATGrTTTAA GAGTGTTATC MJATPJiGGlK 96 0 

GTGACTTAAT TGAAAGATTC TATATTTTGG AAGAAAGCTT TTATTyCTGT TTATTTTATT 1020 

GTTGsGATGC TGGTGTTTCT ACTTTTTAGG TTTTATATTA AaACAGATAA CTTTTCTATA 1080 

TATTTAATGA GTATCTTCTT AATTTGTTTA GGAACTGCTT CTATCATTTA TAACTATAAA 114 0 

ACCAATCGAT AAATATAAGT ATGAAATGTA TAAGTCACAG TTAGATCTAA GTCTTGCTGT 1200 

GGCTTTTTAT ATCTCCATAA AACGACAAAT TCAAGCCCGA CATAAAACAG CATTTTCAGC 1260 

CCACCATAAA ACGACAATTT CAGCCCGCCA TTGACTAAAC ACCACATCCC AAAAATATCG 1320 

TAACAATCCT CTACATCAAT CAATCCAACA TCCCTCATAA TCACAACGCA CAAAATCTAT 138 0 

. TCATGCATTT TTGGAATACT TAGTATTACA AATAACGATT TTTATTCATC TAACAAAGGT 14 4 0 

20 TATATAATGT ACTGAAGGCA ATTTTTATGT ATTACAAATC TAATCGTACA TGTAAAATTT 150 0 

TGATAAACAT CATTAATTTT GCGTAACTAT CATTAGATTA CAAATCACAA AGTAATTACA 1560 

TGTAATACAC ATCTATACAT CACATTTGAA GGGAAATGaA TATAAATGAC TGATAAAAAG 1620 

TACACTGCAG CCGATATGGT TATTGATACT TTGAAAAATA ATGGGGTAGA ATATGTTTTT 1680 

GGTATTCCGG GTGCAAAGAT TGACTATCTA TTTAATGCTT TAATTGATGA TGGTCCTGAA 174 0 

CTTATTGTCA CTCGTCATGA ACAAAACGCC GCAATGATGG CGCAAGGTAT TGGAAGATTA 180 0 

ACAGGCAAAC CGGGTGTAGT ACTTGTTACA AGTGGTCCTG GTGTAAGTAA TTTAACTACT 1860 

GGATTATTAA CCGCAACATC TGAAGGGGAT CCTGTATTAG CGTTAGGTGG CCAAGTGAAa 1920 

CGnAnATGAT TTATTACGAT TAACGCATCA AAGTATTGAT AATGCTGCGC TATTAAAATA 1980 

TTCATCAAAA TATAGTGAAG AAGTACAAGA TCCTGAATCA TTATCAGAAG TTATGACAAA 2040 

TGCAATTCGA ATTGCTACTT CAGGAAAAAA TGGCGCAAGT TTTATTAGTA TTCCGCAAGA 2100 

40 CGTTATTTCT TCACCAGTTG AATCTAAAGC TATATCACTT TGCCAAAAAA CAAATTTAGG 2160 

AGTACCGAGT GAACAAGATA TTAATGATGT CATTGAAGCG ATTAAAAATG CATCATTTCC 2220 

TGTTTTATTA GCTGGTATGA GAAGTTCAAG TGCAGAAGAA ACAAATGCCA TTCGCAAATT 2280 

AGTTGAGCGC ACGAATTTAC CAGTTGTAGA AACATTCCAA GGTGCAGGTG TAATTAGTCG 234 0 

TGAATTAGAA AATCATTTCT TCGGTCGTGT GGGCTTATTC CGCAATCAAG TTGGTGATGA 24 00 

ATTATTACGT AAAAGTGATT TAGTTGTTAC AATCGGTTAT GATCCAATTG AATACGAAGC 24 60 

TAGTAACTGG AATAAAGAAT TAGAAACACA AATTATCAAT ATTGACGAGT TCAAGCTCAA 2520 

ATTACTAATT ATATGCAACC GAAAAAAGAG TTGATTGGTA ATATTGCTAA AACGATTGAA 2580 
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CAATTAAGAA CACATATTGA TGAAGAAACT 
CTACATCCAG TGGAAATTAT TGAATCTATG 
^ ACAGTTGATG TTGGAAGTCA CTATATTTGG 

AGACATTTAT TATTTAGCAA TGGTATGCAA 
TCAGCTGCAC TTGTGCGCCC TAATACGCAA 

10 

TTATTTTCAT CACAAGATTT AGAAACGGCC 
ATTTGGAATG ATGGAAAATA TAACATGGTT 
TCGTCAGGTG TAGACTTCGG TCCTGTAGAT 
AAAGGTTTAC GAGTTACTAA TCAAGAAGAA 
ACAGATGGTC CAGTATTAAT TGATATACCT 
20 ACAAATATGT TACCTGACGT ATTTAACTAA 

GATTTTATGA CTAATGTTTT ATACCAACAT 
TTAAAAGGAA CTGCATCAAT AAATGAATTA 

2S 

CTAACAGGTT CAAACGGTGA GGTAATCTTT 
CATAAAGAAT TTGTAGAATT AAAAGGTGAC 
TTTGTAGCAG ATACAAGCTA TGAAACGAAA 
ATTAAGGAAA AGATGTTGAG TGAAAATTTA 
AAAAAAATGC ATGTACGTAT GATGCCGGCT 
55 TCAGCTAGAA GACAACCTGA ACAAACTGAA 

TTTACACCAG AATTATTCCA TGGTATCGGA 
GATGATCGTA ACTTTGGTGG ACATGTCTTA 
ATCCAAAATA TAGAAACATT TGAACAGCAT 
GCAAATA3TG ACTATAAAGA TATTGCAGAC 
AAATATAATG ACGGTTTATG AAAATTGACT 

4S 

GTAAATATGA TTAACCATGA CTACAATAGA 
TAAAATAAAT CCCTTCACTA TTAGCAGTAG 
AGATTTGCTG TTATGTTTAG ATTATAAATT 
GTATAGGCCA ATCTGTTACT ACGAGAAGCA 
AAAAGACAAA CGCCAATTTT TCAGCGCTTG 

55 
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GGTATTAAAG CGACGCATGA AGAAGGAATT 2700 

CAAAAGGTAT TAACTGATGA TACTACTGTA 2760 

ATGGCACGTA ATTTCAGAAG TTACAATCCA 2020 

ACGCTTGGTG TAGCATTACC GTGGGCAATT 2880 

GTTGTGTCCG TTGCTGGCGA TGGTGGCTTT 294 0 

GTACGTAAAA ATTTAAATAT CATCCAGCTT 3 000 

GAATTCCAAG AAGAAATGAA ATATAAACGT 3 060 

TTTGTAAAAT ATGCAGAATC ATTTGGCGCG 312 0 

TTAGAAGCGG CAATTAAAGA GGGCTATGAA 3180 

GTAAATTACA AAGATAATAT CAAACTTTCA 324 0 

AATAAAGATA AATGTTAAAG AGGAGTGGGA 33 00 

GGTACATTAG GCACATTAAT GGCAGGATTA 33 60 

TTGCAACATG GTGACTTAGG TATCGCTACA 3420 

TTAGATGGAA AAGCTTACCA TGCAAATGAA 34 80 

GAGTTAACAC CATATGCAAC TGTAACTAAA 354 0 

GATAAATCTT CAGAAGCAGT TTTTGCAGAA 3600 

TTTTCAGCAG TAAAAATTTC AGGCTTATTT 3660 

CAAGAACCAC CTTATACACG TTTAATCGAT 372 0 

ACGTATGTCA AAGGTTCAGT CGTTGGTTTC 3780 

TCAGCAGGAT TTCATGTACA CTTTGCGAAT 384 0 

GATTTTGAAG TAGAAGATGT TAAAGTAGAA 3900 

TTTCCAATTC AAGATAAAGA TTTCACTAAA 3960 

GAAATTAGAG AAGCTGAATA ATGAATCCAG 4 020 

TCATAATGCG CGATTTAGAA ATGATAGTTT 4080 

ACAAATATAT TTATAATTAC GTCTAAGTAA 414 0 

TGAGGGGATT TATTAGGTTC CAGATATTTG 4200 

GTGGTACACA CTCATATAAA ATTTACTATT 4 26 0 

AACAACAATA ATTTACAAGT TCAATAACTA 4 320 

CCCTATAAAA CTATTTTCAA ATTATTATTT 43 8 0 
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TCGAATGAAA TCACTACATT TCCCCCTAAA ACTAATATCA ACATTTTAAT AAGATAAACC 12 00 

AATTTCAAAA CTAGTTCGAT ATTTAAAATG TATTATGGAT GGnTAAAGTT TGTATCGCAT X260 

TATCGCGAAG TTGnATAAAT ATAT 3^284 
(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

IS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 5: 

ACAGCTTTTG GTAArGGAGA AAwTCAaTAt GAAACAGTAT AArGCGTATT TaATCGATTT 60 

2^ AGATGGCACA ATGTATATGG GAACAGATGA GATTGATGGA GCAAnACAAT TCATCGATTA 120 

TTTAAATGTA AAAGGCATTC CTCATTTATA CGTAACTAAT AATTCAACAA AAACACCTGA 180 

GCAAGTAACT GAAAAATTAC GTGAAATGCA CATTGATGCT AAACCAGAAG AGGTTGTAAC 240 

GTCAGCGTTA GCCACTGCTG ATTATATTTC AGAACAATCA CCAGGAGCAT CAGTATATAT 300 

GTTAGGTGGG AGTGGTTTAA ATACTGCGTT AACCGAAGCG GGACTTGTCA TTAAAAATGA 360 

CGAGCATGTT GATTATGTAG TTATTGGACT TGACGAACAA GTTACATATG AAAAGCTTGC 420 

GATTGCAACG TTAGGTGTAA GAAATGGtGC AACATTTATT TCTACAAATC CTGATGTATC 480 

AATTCCTAAA GAGCGTGGTT TATTACCTGG TAATGGTGCT ATTACAAGTG TTGTAAGTGT 540 

3S ATCGACAGGT GTATCGCCAC AATTTATTGG TAAACCAGAA CCGATTATTA TGGTTAAAGC 600 

ATTAGAAATT TTAGGATTAG ATAAATCCGA AGTTGCTATG GTAGGCGATT TGTACGATAC 660 

CGATATTATG TCTGGTATTA ACGTAGGTAT GGATACGATT CATGTACAAA CAGGTGTATC 720 

TACGTTAGAA GATGTGCAAA ATAAAAATGT GCCACCAACG TATTCTTTTA AAGATTTAAA 780 

TGAAGCAATA OCTGAATTAG AAAAATAGAT ATAGTCATTT TATAAAGTAG GTGAATTGAT 840 

TTGGTAAAAA TAGTTGTTTC GAGGAAAATT CCAGATAAAT TTTATCAACA ATTAAGTAAA 900 

CTTGGTGACG TTGTTATGTG GCAAAAATCA TTAGTGCCTA TGCCTAAAGA TCAATTTGTG 960 

ACaGCcTTCG TGACGCAGAT GCTTGTTTTA TTACATTAAG TGAACAGATC GATGCAGAAA 1020 

TTTTAGCGCA ATCACCAAAT TTAAAAgTAA TTGCGAATAT GGCTGTAGGA TA 1072 

(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3271 base pairs 

SS 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 





TAAAAACTTA 


CTTTAACACC 


ATTccrrrri' 


AAcrrn-iTC 


GTGTTTCnTT 


TnCTTAAGTC 


60 


10 


CATCCATATT 


TTTAATGATG 


TCATCTGCTG 


TTTTATCTTT 


TAAATCTAAC 


ACTGaGTGAT 


120 




AmCGAATTTG 


TAGCACAGGA 


TCAAATCCTT 


TATGGAATCC 


AGTATGTTCA 


AATCCTAAGT 


180 




TACTCATTTT 


ATCAAAGAAC 


CAATCATTAC 


CAGCATTACC 


TGTAATCTCG 


CCATCATGAT 


240 


IS 


TCAAGTATTG 


ATATGGTAAA 


TATGGATCGA 


TATGTAGGTA 


TAGACAACGA 


TGTTTTTTAA 


300 




CATATTTTGA 


TAATTCATTA 


AAGAAAAAGT 


GTACGAGTTC 


TTGATTTTCA 


TAATCAATCA 


360 


20 


CTGGACCGCG 


ATTTGAATAA 


AAATACTTGA 


ACACTTTCAT 


AACAGGTACA 


GCAGTAAGtA 


420 


AGCAAGCTGC 


AATGACCTCG 


TTATTATTGT 


TTTTTATTCC 


CACTAAATGT 


GTTTCATAAC 


480 




CTTCAGCAAG 


CTTTAACTCA 


TAGTGGCCAA 


CAGTTTGCGT 


G^aAATGACTG 


TATGGCATGC 


540 


25 


TATCTGTAAa 


GGCACCAAAC 


TCTTTAGCTG 


TTAAATTTGT 


AAACTTCAXT 


ATCATTACTC 


600 




CTATTTGTCT 


CTCGTTAATT 


AATTTCATTT 


CCGTATTTGC 


AGTTTTTCTA 


TTTCCCCTCT 


660 




GCAAATGgCA AAAATAATAA ATCTAATCTA AATAAGTATA CAATAGTTAA TGTTAAAACT 


720 


30 


AAAACATAAA 


CGCTTTAATT 


GCGTATACTT 


TTATAGTAAT 


ATTTAGATTT 


TTGAATACAA 


780 




TTTCAAAAAA 


AGTAATATGA 


ACGTTTGGGT 


TTGCTCATAT 


TACTTTTTTT 


GAAATTGTAT 


840 




TCAATTTTAT 


AATTCACCGT 


TTTTCACTTT 


TTCAAACAGT 


ATTCGCCTAA 


rrrTTTTAAA 


900 


35 


TCAAGTAAAC 


TTAATTATTC 


AATGTTTGTT 


GGATAGATTG 


TAAATATTTA 


ATGATTTCCT 


960 




CACGCGTGTT 


AGATTTAAAT 


CGCTTAACGA 


TTTCGCTACC 


AATGACAATG 


CCATCTGCAA 


1020 


40 


CCTCTTTTAT 


ATCTGCAACA 


TGTTGTGGTG 


TTCTTATACC 


AAATCCTGCG 


ACAACTGGCA 


1080 


CATTGGCTAT 


CGCTTTAATT 


GACTCAATTT 


TTCGTTTTAA 


TTCTGGATGA 


AACGCACCGT 






TTTGCCCTGT 


TGTCGCATTC 


ATCGTCACAG 


TATAAATAAA 


GCCTTCCGCA 


TGGGATACGA 


1200 


45 


TATCTTTTAT 


ACGTTTGTCA 


TCAGTAGTCA 


TCGCAACTAA 


CGATATGATT 


TTGACGCCAT 


1260 




AGTGACTAAA 


TTGTTGTTTT 


AAACGCTGCG 


ATAATTCATA 


TGGTAAATCA 


GGAATAATTA 


1320 




AGCCGTAGAC 


ACCAGTATCT 


CGACATTTTT 


CAAAAAACGC 


TTGTTCTCCA 


TAATGACAAA 


1380 


SO 


TAATATTATA 


ATACGTCATT 


AATACATAGT 


TACACTTAAT 


TTGATCACCA 


TGTTTTTCTA 


1440 




ATTGATTGAA 


AATATAATCT 


ATCGTGATGC 


CTTGTTTAAT 


CGCTTGTTGA 


CCTGCTTCCA 


1500 




TGATAACTGG 


ACCATCAGCA 


ACCGGATCAG 


AGAAAGGTAC 


TCCAATTTCA 


ATTATATCTG 


1560 
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GTATAAATAA TTTAGTCATT 


TGCAAGACCT 


CGCTCTACCA TATATTGTCT 


AATTGTTTCC 


1 £ fi rv 
-1 o o U 




ATATCTTTAT 


CGCCACGTCC 


AGAAATAGTT 


ACTACAATAA TATCTTCTTT 


CGACATCGTA 


1 lA ft 


5 


GGCGCTAGTC 


TTTCAACATA 


ACTCAGTGCA 


TGTGCACTTT 


CAATTGCAGG 


TATAATACCT 


1 ft nft 

J- o w u 




TCATGTTTTG 


TAAAGTTGAT 


TAAAGCATTC 


ATTGCTTGTG 


TATCACTAGC 


ATTTTCAAAA 


1860 


10 


GTTACTCTAC 


CAATGTCGTG 


GTAATAAGAA 


TGTTCTGGTC 


CAATACCAGG 


ATAATCAAGT 






CCTGCTGAAA 


TAGAATGTGC 


TAGTTGCACT 


TGCCCATCTT 


CATCTTGAAT 


TAAATACATT 


1980 




TTAGTACCAT 


GTAATACGCC 


AGGTGATCCT 


TTGCCAATTG 


CAAGTGCATG 


TTTATCAGTA 


2040 


IS 


TCATCGCCTT GACCTGCGGC 


TTCAACACCG 


TATAATGCAA 


CATCATCTTT 


AATAAATGGA 


^ ^ \j \j 




TAAAATGTAC 


CGATTGCATT 


TGAGCCACCA 


CCGATACATG 


CTACAATTGC 


ATCCGGAAGT 


2160 

A X V w 




CGACCTTCTT 


TCTTCAATAT 


CTGTGATTTT 


ATTTCTTTAC 


CAATCACACT 


CTGAAAATCT 


^5 ^O 


20 


CTAACAATCG 


TTGGGAACGG 


GTCTGGACCT 


AATGCAGAAC 


CTAATAAATA 


ATGTGTATCA 


29 so 
^ ^ o u 




TCTACATGAC 


TTACCCAATA 


TTGCAATGCT 


TTATTAACTG 


CATCCGATAA 


AGTCCCTTGA 


A ^ *X V 


2S 


CCATCTTCAA 


CTGCCACAAC 


CTTTGCACCA 


AGTAATTCCA 


TTCTAAATAC 


ATTAAGTTGT 


24 on 


TGTCTTTTAA 


TATCTTCACT 


TCCCATAAAG 


ACAACAAGTT 


CCATATC5U^ 


TAATGCAGCA 


2460 
A *s o w 




ACCGTAGCAC 


TAGCTACACC 


ATGTTGACCC 


GCACCAGTTT 


CAGCAACAAG 


CTTCTTCTTG 


2520 


30 


CCCATTCTTT 


TAGCAAGCAA 


CGCTTGACCT 


AACGCATTAT 


TAATTTTATG 


GGCGCCTGTA 


2580 




TGATTTAGAT 


CCTCTCGTTT 


CAAATATATT 


TTAGCGCCAC 


CTAGGCTTTC 


AGTATATGAT 


2640 




GCAGCATATG 


TAAGTGGTGT 


CGCGCGTCCT 


ACATACTCTG 


ATAAATAGTA 


TTCCAGTTCT 


2700 


3S 


CTTTGAAACT 


CTGGGTCTGC 


TTTTGCCTCT 


TTATAAGCTT 


TTTTCAACTC 


AATAATTGCT 


2760 




GGCATTAATG 


TTTCTGGAAC 


ATATTGCCCT 


CCATATTCAC 


CAAAGAAACC 


TAATTCATCT 


2820 




GCTTCTGTTT 


GTATTTGTTT 


ATTCATTGTC 


TCTATCTCCT 


TTCACAATAT 


TTACAATTGC 


2880 


40 


TGTCATTTTT 


TCTATATCTT 


TTCGCCCATT 


TACTTCTATA 


CCTGATGCAA 


GATCATAACC 


2940 




TTGATGTGAT 


AATTTAAGTT 


GATTAACTGT 


TTGAATATTT 


TCAGAGTTAA 


TGCCTCCTGC 


3000 


45 


TATCAAATAA GGTATGTCTT 


TTATGTGCTT 


CAAAATAGTC 




TTTGACCGGT 


3060 


ACCACCATAC 


GACACTGAGG 


GTGTGTCGAT 


AATAAATAAA 


TCTACGAACC 


CTTTATATTT 


3120 




ATTTATGTTT 


TGGATTATGT 


TTTCATCTnG 


CAGtAAAGCT 


TTAGTGATTT 


TAATGCTTGA 


3180 


SO 


ATATkTCTTT 


TTAATTTCCT 


GTAtAAAAtC 


AATAGATTCt 


GtGTAACTGT 


ATTGTGkTAA 


3240 




wTGaCGwATg 


CtTAAwACgT GTGCCAATGG 


T 






3271 




(2) INFORMATION FOR SEQ ID NO: 4 37: 









BNSOOaO: <£P 078651 9A2_I_> 



1340 



EP0 786 519 A2 



IS 



20 



2S 



(A) LENGTH: 1553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLCX3Y: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 
^0 ATAATGAAAT AATACTGTGT TTTATCTGCG AAATGTAtCA TTTTCTAATc GTTTCACAGT 60 

AAAATGAAAA GATAAAGTGT GTTTTTACTT GAATTTTGAC TAAAATTACT CTATATTTAT 120 

TAATTGAGCT ATGCTTATTA TTACAATTTG ATTACAAATT TTAAATTTGT TAATTGAATG 180 

ATAATATTAA ATAAAGAAAC TTACACAAGC AAATATGAGT TGTAGCCCAA AATACTTGTT 24 0 

AAATCAAAGT TGAAAGCTAC AAATAATGAA AATTATAAAC TTGAATCTGA AAGTAATTAC 300 

TATAATTATG ACAATGTTAA CTTTTAAACG CACTTATTAA TTAACTACAT AATGTTAATA 360 

TCTAATTTAT TCAAGTACTT TCGCAAGATT TATTATCTAA ATAACGGGGG AAAGAATCAT 420 

GAGTtCACAA AAAAgAAAAT TAGTCTTTTT GCGTTCTTCT TATTAACCGT AATAACGATT 4 80 

ACCTTGAAGA CGTATTTTTC TTATTATGTT GATTTTTCTT TAGGTGTTAA AGGTTTAGTA 54 0 

CAAAACTTAA TATTATTGAT GAATCCTTAT AGTTTAGTAG CACTGGTTTT AAGTGTGTTC 600 

CTATTCTTTA AAGGCAAAAA AGCATTTTGG TTCATGTTCA TAGGCGGCTT CTTATTGACG 660 

30 TTCCTATTAT ATGCCAATGT TGTGTACTTT AGATTCTTCT CTGATTTTTT AACGTTTAGT 720 

ACTTTAAACC AAGTAGGTAA CGTAGAATCT ATGGGTGGTG CGGTTAGTGC ATCATTCAAA 78 0 

TGGTATGACT TTGTTTATTT CATTGATACG TTAGTTTACT TATTCATTTT AATATTTAAA 84 0 

ACAAAATGGT TAGACACAAA AGCATTTAGT AAGAAATTTG TTCCTGTCGT AATGGCGGCT 900 

TCAGTAGCAT TATTCTTCTT AAACTTAGCT TTTGCTGAAA CTGACAGACC AGAATTATTA 960 

ACACGTACAT TTGACCATAA ATATTTAGTG AAATATTTAG GACCTTATAA CTTTACAC3TA 1020 

TACGaTGGTG TTAAAACTAT CGAAAATAAT CAACAAAAAg CGCTAgCATC TGAAGATGAC 1080 

TTAACaAAAG TATTAAAtTA TACGAAAcAA CGTCaAACmG AGCCTAACCC rGAwTATTAT 114 0 

GGGGTGGcAA rOAAGAAAAA TATTATTArG ATTCATTTAG AAAGTTTCCA AACCTTCTTA 1200 

ATTAATAAAA AGGTTAATGG TAAAGAAgTA ACACCGTTTT TAAACAAATT ATCAAGTGGG 1260 

AAAGAGCAAT TCACATACTT CCCTAACTTT TtCCATCAAA CAGGTCAAGG TAAAACATCT 1320 

SO GACTCTGAAT TTACAATGGA TAACAGTTTA TACGGTTTAC CGCAAGgTTC TGCCTTTTCA 13 80 

TtaaaaGGAG ATAATACGTA TCAGTCATTA CCAGCAATTT TAGATCAAAA GCAAGGCTAC 14 4 0 

AAATCTGATG TCATGCACGG TGACTATAAA ACATTCTGGA ACAGAGACCA AGTATATAAA 1500 
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(2) INFORMATION FOR SEQ ID NO: 438: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 





n a R R R T^r^ Ti A 
AC-AAAAI V-^AA 


TCAGCAAAAT 


gattggatta 


AGACGCAAAT 


TGAGCGTTCA 


ATGGAAGGCG 


60 


15 


AAACAGTTGG 


CATTAATGAT 


CAAAATATAG 


AAATATATAG 


TGAAcGtTgA 


GATTTATACC 


120 




ATACACT CGT 


ACCTTTAAAT 


caagaattgc 


ATAAGTTGCG 


ACTTAAAACT 


CAAAACTTAA 


180 




UuAATGAAAA 


TTATAATATT 


AATGATGTGA 


AAGTTAAAAA 


GATTATTGAA 


GATGAACGTC 


240 


20 


AAAGACTA^C 


ACGAGAACTT 


cacgattctg 


TTAGTCAGCA 


ACTTTTTGCG 


GCmAGTATGA 


300 




1 Vat, i AXCTGC 


TATCAAAGAA 


ACGAAtTAGA 


ACCACCATTA 


GACCAACAAA 


TTCCTATTTT 


360 




2i/^ R R R 11 1/ < 

AUAVtAAAATG 


GTTCAAGATT 


CGCAGTTAGA 


AATGCGTGCT 


TTGCTGTTAC 


ATTTAAGACG 


420 


25 


V3V» X Iwijl 1 1 A 


R R R TV IS fry 

AAAGACAAAT 


CTTTAGGTGA 


GGGTATTAAA 


GATTTAGTTA 


TTGATTTACA 


480 




& A A A a 2V A<^TV^ 


^^R R TV^ R R R 

v-t-AATGAAAG 


TTGTGCATGA 


AATACAAGAT 


TTTAAAGTGC 


CTAAAGGTAT 


540 


30 


X WAMon X t»r\X 


11^11 vJAC? AA 


TTACACAGGA 


AGCAATTTCG 


AATACATTGC 


GTCATTCAAA 


600 




V7 1 vjACJAvjTAG 


AATTGTTTAA 


TAAAGACGAT 


TATTTATTGT 


TGAGAATTCA 


660 




A AT A a. TYSr^T 
x.rv\. X VjVj X 


AAAGGTTTTA 


^ ^^^^^^^^^^ % n 

ATGTTGATGA 


AAAATTAGAA 


CAAAGTTATG 


GACTTAAAAA 


720 


35 


TATV^OTTYS A A 


AGAG CTTTGG 


AAATTGGTGC 


AACGTTCCAT 


ATTGTATCAT 


TGCCAGATTC 


780 




AGGTACACGT 


ATCGAGGTGA 


AAGCACCTTT 


AAATAAGGAG 


GATTCGTATG 


ACGATTAAAG 


840 




TATTGTTTGT 


GGATGATCAT 


GAAATGGTAC 


GTATAGGAAT 


TTCAAGTTAT 


CTATCAACGC 


900 


40 


AAAGTGATAT 


TGAAGTAGTT 


GGTGAAGGCG 


CTTCTGGTAA 


AGAAGCAATT 


GCCAAAGCCC 


960 




ATGAGTTGAA 


GCCAGATTTA 


ATTTTAATGG 


ATTTACTTAT 


GGATGACATG 


GATGGTGTAG 


1020 




AAGCGACGAC 


TCAGATTAAA 


AAAGATTTAC 


CGCAAATTAA 


AGTATTAATG 


TTAACTAGTT 


1080 


45 


TTATTGAAOA 


TAAAGAGGTA 


TATCGTGCAT 


TAGATGCAGG 


TGTCGATAGT 


TACATTTTAA 


1140 




AAACAACAAG 


TGCAAAAQAT 


ATCGCCGATG 


CAGTTCGTAA 


AacTTCTAGA 


GGAGAATCTG 


1200 


SO 


TTTTTGAACC 


GGAAGTTTTA 


GTGAAAATGC 


GTAACCGTAT 


GAAAAAGCGC 


GCAGAGTTAT 


1260 


ATGAAATGCT 


TACAGAACGA 


GAAATGGAAA 


TATTATTATT 


GATTGCGAAA 


GGTTACTCAA 


1320 




ATCAAGAAAT 


TGCTAGTGCA 


TCGCATATTA 


CTATTAAAAC 


GGTTAAGACA 


CATGTGAGTA 


1380 
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(2) INFORMATION FOR SEQ ID NO: 43 9: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 608 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 
ATGTnGGCGG ACGTAAAGGA CATGTTTATA CTGATGATCG AGCATTAGAT ATTGATATCG 
TTCCGCCTGC TCAAGCAGAT GGTAAAGCTA CTAACCCCGA ACAATTATTT GCAGCAGGTT 
ATGCATCTTG CTTCAACGGT GCTTTCGACC TAATTTTAAA GCAAAACAAA GTGCGTGATG 
CTCATCCAGA AGTAACACTA ACAGTGAGAC TAGAAGATGA TTCAGACTCA GAAAGTCCTA 
AATTAAGTGT TTCAATTGAT GCGACAATTA AAAATGTTAT ATCTCAAGAA GAAGCTGAAA 
AATATTTACA AATGGCTCAT GAATTTTGTC CATATTCAAA AGCGACTCAA GGAAATATTA 
ATGTCGATTT AAATGTAAAT GTTGTAGATT AGCATTAACT TAAAGAGATT ATTCAACGTT 
ATTAATAAAA TTCACATAAA ATTCAAATTt StCrAcCAAA AATTTTTGGT TGGyTATTTT 
TTCTATTCGT GATTGAAATT TCTGGCAATA TTTAACTGAA AATGATTGTA CCTTAGTCAT 
CATAAATGTG ACCGGTTCCA ATACTGGCTT GACTTCTTCG CATACCGTCT ACAAATAAAA 
GTCCAGTG 

(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 
CCTTTCAAtT TCTCCCAAgA TTTACGCATG TCTGACCAAA TGTCAATACA ATACCTGAAT 
AATGATTTCT GGAGTAATTG CGAAAAATAT TTAAAAGTTG CTATAGATCA ATTTTCAAAT 
TATAGTATCT CATCTCAAGT TTCTAACTAT CATTTCACAG TATTACTTGG AGATCGCCAA 
AAACCACTTA TGTATCTAAA TAAAAATCGC GGTGGTGATG GTGGCATACC AGGTTATATT 
ATGATTTATT TAGTGCCGAG TACAAGTACA ATTAATTCTA TGAAAAGCTT AATTGCACAT 
GAAGTAAATC ATAATATGCG CTATCAATAT ATTGATTGGG ATGGCGGAAG TTTGATTGAA 
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10 



15 



20 



25 



30 



35 



45 



SO 



ATTGGACCGT GGGTAACAAA TACTAATTGA AGTCGTGATA ACGTAAAAAT TAAAAATACT 430 

ATTTATAATC ATTTACATTT AAAGCATATA TTTGAATCGA tGCCTTATCT CTATGGTGAT 540 

GATATTAATA AACTTCAAGG TAGGCCTATC GTTGGCTTAT CTCATGCTGC CGGGTATGCA 600 

TGTGGCTATC ACTTGGTAAA ATACTTTTTA CAAAAAACAA ACATACCTAT TGAAGTTGCT 66 0 

ACAACACTTC CAGCACAAAA AA ^32 
(2) INFORMATION FOR SEQ ID NO: 441: 

( i ) S EQUENCE CHARACTER I ST I CS : 

(A) LENGTH: 2574 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

ATCTTCTTTT AACATGACAA ATTGCAACAA AATATTAATG ACGCCAAAAG ACATTTTTTC 60 

ACGTTCAATT AATTCTTCAA CCATTGTCTT TTGCGATATA GTTGGTTCTG ATTCAGACCA 120 

AGAAGCTAAC ATATCAATTG GACTCGTTTG TTCAAGTAAC TCAAACCATT CATCACTTTG 180 

TGGCTTTGGA TTCACTTCTG AAGATTTGCC CGCCGAAGAT GATGTAGCAG GAGATTTCAC 24 0 

CTGTAATTTA GGCATTTGAT TTTCGTGTTC CATTAAGTAA TACGAGCGTG CTTGTTTACG 300 

CATTTCTTCA AAGGATAACT GTTGTCCACT TGTAATTGAA TTTAAAATAA CATGCTTCAT 360 

GCCATCTGCT GTTAAACCAT ATAAAGTCGC GAGTTGTGTA ATTAAACGCT TTGCATCTTT 420 

GGTAACAATG TCTTGACTAA TAAAATGTTT ACCTAACATT TGTCTCAACA TTTCAAAGTC 4 BO 

AAAAGATTCA TTTGATAAAT CGATACCTTG GTACGGTTCA TTAATCGGaA TATCACTTGT 540 

ATCGATATCT ATTTTTGTAG ACGGCACTTT AAAAACATCA GTAAATTGTC TT G TT A CCTG 600 

40 TTTAAATTCA CTCAAATCAA TTTGTTGATA CTCAAAGTAT TTCTTCAACT CATGAAATCG 660 

ACGATGCTCG ACTTCACTAT ATAAAAAGAT TGACAACATT GGATCATTAA AAAATAAATG 720 

TGCTGAAGGG GtTGaATTAA TTGGTAAACA AATTGTGTTT CTTGTTCATC ATGTTTGACA 780 

AACGCCTTTA ACAATCCAAT CGCTTCAAGT AAGTCCATTT GTTGTCTAAA CTCTAGTAAA 34 0 

TTAATTTTAA GTTCATTCAT AAAAATATAA TGAGAAAGAA TCAATGTTTC ATTATGACTT 90 0 

TCTTTAACGA ATTGAGTCAT AAAATGATAT AAACCCACTG CTTGCGTTCC AATTAGCGGT 96 0 

GTATACAGTC GATTCAATAC CTCTAAATGA TTCGTATTTA AATCAAAGTG TTGCATAACT 102 0 

TTGAATTGAT CCTTTGGTCT TAAGCCGAAT TCGAAGGCTT GTCGTCCCaT TTAAGCnATC 1080 
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AAATTCTTTA 


TAGACTGATG 


CAAATCTAAC 


ATATGAAACT 


TGATCAACAT 


GCATTAACAA 


X200 




GTTCATAACG 


TGTTCACCTA 


TATCTCGTGA 


AGACACTTCC 


GTATGACCTT 


CATCTCGTAA 


1260 


5 


TTGCCATTCA 


ACCTTGTTAG 


TTATGTCTTC 


AAGTTGTTGA 


TATCTAACTG 


GTCGTTTCTC 


1320 




ACAAGAACGC 


ACAAGTCCAT 


TAAGTATCTT 


TTCTCTTGAA 


AACTGCTCTC 


TTGTGCCATC 


1380 


10 


TTTTTTCACA 


ACTATAAGCT 


GACTAACTTC 


GATATGTTCA 


AATGTAGTGA 


AACGTGTTCC 


1440 


ACAATTTTCA 


CATTCTCTTC 


GTCTTCGAAT 


GGCATTTAAT 


TCATCGGCAT 


GCCTTGAATC 


1500 




TACAACTTTA 


GATTGTGTAG 


AATTACATTT 


CX3GGCATTTC 


ATTACATCAC 


CCTCTTTATT 


1560 


IS 


TTGATTATGC 


CTAATTATAC 


TATAAATCTA 


GAGATGAAAA 


AAGAATCCCT 


CAATTTAATT 


1620 




CATTTAACCA 


AATAATGAAA 


CAATAAAAAA 


CATTATATCG 


TTACTTATTA 


AGTAATTTGC 


1660 




ATGACAATAT 


TATTGTATTA 


AAAATAAAAA 


ACCTAACTCC 


GAAGTCAGAG 


TTAGGCTATA 


1740 


20 


AATTAATTGT 


ATTAACTTGC 


ACTTACAGTT 


TCTTTTGATG 


TCAAAAGTGC 


TCCAATTTGC 


1800 




TCAGCAACAT 


CTACAACTCT 


ATTTGAATAA 


CCCCATTCAT 


TATCATACCA 


AGCAATAACT 


1860 




TTTACTTTAT 


TCCCTGACAT 


GACCATTGTT 


GATTTTGCAT 


CAATAATAGC 


TGAATTTGGA 


1920 


2S 


TTAGTATTAA 


AATCAACAGA 


CACTAGTGGT 


TGATGTTCGA 


CTTCTATGAT 


ACCTTCTAAA 


1980 




CCTGCATTTT 


CAAAAGCTTG 


GTTTACTTCT 


TCTGCAGTTA 


CTTCTTTTTC 


TAAATCAACA 


2040 




ACTAAATCAA 


CGAGCGATAC 


ATTCTTTGTT 


GGTACACGTA 


ATGCCATGCC 


GTGTAATTTA 


2100 


30 


CCTTCTAATT 


CTGGTAATAC 


TTCTTTTAAA 


GCTTTCGCCG 


CACCAGTAGA 


AGTAGGAATA 


2160 




ATGCTTTCAT 


TACATGAACG 


TGCACGTCTT 


AAATCTTTAT 


GTGGATTATC 


AATATTTTTT 


2220 


35 


TGGTCATTTG 


TAATAGCGTG 


AACAGTAGTC 


ATTAAACCAT 


TAACTATTCC 


AAACTGATTA 


2280 


TTTAAAACTT 


TTGCAACTGG 


ACCAATGCAA 


TTAGTAGTAC 


ATGAAGCATT 


ACTAAAAATG 


2340 




TCAAATGCTT 


CTATATCTAA 


TTGGTTATCA 


TTTACGCCTT 


TAACTACCAT 


TTGAACATGT 


2400 


40 


CCACCTTTTG 


nAGGACCAGT 


TAACAAAAsT 


TTTtTGGCAC 


CTGCTTTAAT 


ATGTGCGATG 


2460 




GCTTTATCAC 


CATGaTTAAA 


TTTACcAGTT 


GCATCTATAG 


CAATATCGAT 


ATCTAATTCT 


2520 




TTCCATGGCA 


AGTTTTCAGG 


ATTGCGATCA 


GCAACCAATT 


TAATTTTATG 


ATCT 


2574 



45 (2) INFORMATION FOR SEQ ID NO: 442: 



(i) SEQUENCE CHARACTERISTICS: 

(A; LENGTH: 3326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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CAAGGTACCG 


GTTTAGGTTT 


GTTCATTTGT AAAATGATTA TCGAAGAGCA TGGTGGTTCC 


€0 




ATAGATGTTA AAAGCGAATT 


AGGGAAAGGC ACAACATTTA 


TTATTAAACT 


ACCAAAACCA 


120 


5 


GAATAAAATT 


GAATATAGTT 


ATTTCAGAAC GCATGTTATT 


GATTAGAGAC 


TCTAATTTAT 


180 




AGCATGCGTT 


TTTTGATTGA 


TGTGGGGAAT TTTGTATGTG 


GATTAGAACT 


TAGGGTTTTT 


240 


10 


GCGAATATCA 


ACTATTAAAT 


ATATTACTAA TTTATACAAA 


AATATAAAGT 


TTGATAAAGT 


300 


TATTTATTTG 


ATTATAAAAA 


TAGGGTAAAA TATAGATATA 


TTGTATTAAT 


TAAATTATTC 


360 




GAGGTGTCAT 


ATGAAAAAAT 


TCATTGGATC AGTTTTAGCT 


ACX3ACATTAA 


TTTTAGGGGG 


420 


IS 


ATGTTCCACG 


ATGGAAAATG 


AATCAAAAAA AGACACAAAA 


ACAGAAACAA 


AATCTGTACC 


480 




AGAAGAAATG 


GAAGCTTCAA 


AATATGTAGG CCAAGGCTTC 


CAACCGCCTG 


CAGAAAAAAA 


540 




TGCGATTGAA 


TTTGCGAAGA 


AGCATCGTAA AGAATTTGAA 


AAAGTAGGTG 


AACAATTCTT 


600 


20 


TAAAGATAAC 


TTTGGACTAA 


AAGTTAAAGC TACAAATGTT 


GTAGGTAAAG 


ATGATGGTGT 


660 




AGAAGTTTAT 


GTGCATTGTG 


AAGATCATGG CATTGTATTT 


AATGCAAGTC 


TACCTTTGTA 


720 


25 . 


CAAAGATGCC 


ATCCATCAAA 


AAGGATCAAT GCGCAGTAAT 


GACAATGGTg ATGATATGAG 


78 0 


TATGATGGTG 


GGTACAGTGC 


TGAGTGGCTT TGAATATCGA 


GCGCAAAAAG 


AAAAGTATGA 


840 




TAACTTATAT 


AAATTCTTCA 


AAGAAAATGA AAAGAAATAT 


CAATATACAG 


GCTTTACAAA 


900 




AGAGGCAATT 


AACAAGACAC 


AAAATGTCGG ATATAAAAAT 


GAATATTTTT 


ATATTACATA 


960 




CTCTTCTAGA 


AGTTTAAAAG 


AATATCGAAA GTATTATGAA 


CCACTGATTC 


GAAAAAATGA 


1020 




TAAAGAATTT 


AAAGAAGGAA 


TGGAACGAGC AAGAAAAGAA 


GTGAATTACG 


CTGCAAATAC 


1080 


35 


AGATGCTGTT 


GCTACACTTT 


TTTCTACTAA GAAAAACTTT 


ACTAAAGACA 


ATACAGTAGA 


1140 




TGATGTAATC 


GAACTAAGTG 


ATAAATTATA TAATTTAAAA AATAAACCAa ATAAATCTAC 


1200 




AATCACAATA 


CAAATAGGGA 


AACCCACTAT TAATACTAAG 


AAAGCCTTTT 


ATGATGATAA 


1260 


40 


TCGTCCAATA 


GAATATGGGG 


TGCACAGTAA AGATGAATAA AATTAATGAT AGGGATTTAA 


1320 




CAGAATTAAG 


TAGTTACTGG 


GTTTATCAAA ATATTGATAT AAAAAAAGAA 


TTTAAAGTTA 


1380 


45 


ATGGAAAAAG 


GTTTAAACAA 


GTAGACAGTT ATAATGATGA TAAGAATAGT AATTTGAATG 


1440 




GTGCTGCTGA 


TATTAAAATA 


TATGAGTTAT TAGATGATAA 


AAGTAAACCA 


ACTGGTCAAC 


1500 




AGACAATAAT 


TTATCAAGGA 


ACATCTAATG AGGCAATTAA 


TCCAAATAAT 


CCATTAAAAT 




SO 


CATCGGGGTT 


TGGAGATGAT 


TGGCTCCAAA ATGCTAAATT 


AATGAATAAT 


GATAATGAAA 


1620 




GCACAGATTA 


TTTAAAGCAA 


ACAGATCAAT TATCAAATCA 


ATATAAAATA 


AAGTTAGAAG 


1680 




ATGCAGATAG 


ATTATCAAAT 


AGTGATTTTT TAAAAAAATA 


TAGAATGGAA 


TCAAGTAACT 


1740 
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ATCAAGGAGC GAAACATCCG AATGAAAAAG TTGTTGCTAC TGACTCAGCA ATGATTCCTT 
ATGCTGCTTG GCAGAAATTT GCTAGACCAC GCTTTGaTAA TATGATTAGT TTTAATAGTA 
CCAACGATTT ATTAACATGG TTACAAGATC CATTCaTCAA AGATATGCCA GGAAAACGCG 
TTAACATTAA TGATGGTGTG CCCAGGTTAG ATACTTTAAT AGACAGCCAT GTAGGTTATA 
AAAGGAAGTT AAATAGAAAA GATAACACAT ACGATACTGT ACCACTAATC AAAATAAAGT 
CGGTAAAAGA TACAGAAATT AAAAATGGAA AAAAAGTAAA AAAGACTATT AACATAACAT 
TAGATATGGA TGGGCGAATT CCAATAAATG TTTGGACAGG AGATTCGATT GCACGTTCTG 
GAAGAGGAAC TTTAATTAAA CTTAATTTAG AAAATCTTGA TGCGTTGAGT AAACTGATTA 
CTGGTGAAAC AAGTGGTATG TTAGCAGAAT GCGTAATCTT TTTAAATGAA AGTTTTAACA 
TCTCAGAAAA TGAAAATAAA AATTTTGCAG ATAGAAAGAa ACAATTATCA GAAGGATTTA 
AGGATAAGAT TAACTTATTT CAGTTAGAAG AAATGGAAAG AACTTTAATT AGTAAAATAA 
ACTCACTTGA AGAAGTTGCA GATGAAACAA TAGAAAGTAT TAGTGCTGTT AAACACTTAT 
TACCTGATTT TGCATTGGAT GCATTAAAAG AAAGAATTAA TGAGTTGTTT AAAGGTATAA 
AATCTTTTAT AGAAAAAGTG TATGATAGTA TAGATAATGA AATTTTAGAA ATTTTCAAAA 
ATATAGATCA CXJACTTCAGA GATGGAGTAT CTGAAGAAAT GATGAAACAT TTGAAAGTAG 
TGAAACAGAA TATAGACCAA ATAAAAAATC AAAATGATAT TTATGGTAGG CAAATTGCAG 
ATATTAGAAG TATTATGAAA CAACAAGATG CAACAATTTT AGATGGAAAT TTTCAAATTA 
ATTGTAGCGG CGAAAATATG GTACAGGGTC TAGTTATACC TTCTAATTAT TTAGGAAGAA 
AAATGAAAAT ATTAAAAGAC CATATCGATG ATGGTATTAA AAAAATAGCA GACTATGTTC 
AAGGTATATA TGATGAATAT GCATCGAAAA TTGTCGATGT AATAAAATAT TTGATTAATA 
CAATTCCCAA AATACGTAAG AATTTAAGAC ATGCAATTGA AATGTTAAAT GTAAAAAAGA 
AAGAATTTTT GTCCCTGATT CCTAATGTAA CTTGTAATTA TATTAAAACT AAATTAGAAG 
AATTAGATAA TACTTTAGGC AAATGGGAGC CTTTTCTTAA TGATTTAAAA GCAGTGTCAC 

CAATTTTAGA TAACCATTTA GATGATATTG TTAAGAACAT GAAGCCTTTG ATTGTACAAA 

TGAyATwTGA ACCATCACAT TATGaCGATA TGTTTAATTC aAGAAAAGCT TTAACGcCAG 

TGTTCTCAAG CGTTTTATAA AGGTTG 

(2) INFORMATION FOR SEQ ID NO: 443: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5301 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 



5 


ACCTATAGCT 


GGCCCGTCAA 


GTCCTGAATC 


ATATGTCGCT 


ATAAACCCAG 


TTGCATGCCC 


€0 




AGCAATCTCG 


GTTTCAATTT 


CAAAATCATG 


CTCTTTCAAA 


CGATCTATTA 


AAGTTCGAGA 


120 




CGCAAATATT 


TCTTCATTAC 


CAAGTTCAGG 


ACGTTCATGA 


ATTCTATGAC 


TGATTTCGAT 


180 


10 


ATAACTATAT 


TTATTTGTTT 


CTATATAATC 


GAGAATTTGT 


TGTTTTTCAC 


TCATTTrrrA 


240 




CTATCTCCCT 


TTTACCCTAC 


ACTCATTTTA 


ATCATCCACA 


ATATTTTGTT 


CTTTCAAAAT 


300 


15 


GAATTATTAC 


TTATTCTATC 


GGTTTTATCT 


CATGATGTCA 


TCTAGTTTTT 


CTTTATTTAC 


360 


AAAATTTTCT 


AATAACTAAA 


aGCCTTTCAT 


AAATTTATAA 


AACAGTTTCA 


AATTGTAAAA 


420 




CATAAGCCCT 


ATTGTTACAA 


TTTAAGTATT 


GTTAGAAACA 


CAACTTACAA 


CAAACTTAAG 


480 


20 


TTTTATAATG 


CAACAAATCA 


TAAGCGTTTT 


ATAGTTTTGA 


GTAGTTAATA 


GGAGGAAATT 


540 




CAAAATGACA 


AAAATGAATG 


TTGAAAGTTT 


cAATTTAGAT 


CaTAcTAAGG 


TGGTTGCCCC 


600 




ATTTATTAGA 


TTAGCGGGAA 


CX3ATGGAAGG 


ATTAAACGGA 


GATGTCATTC 


ACAAATACGA 


660 


25 


CATTCGTTTC 


AAACAACCAA 


ACAAAGAACA 


T ATGGATATG 


cCCGGACTAC 


ATTCATTAGA 


720 




ACATTTAATG 


GCTGAAAATA 


TTAGAAATCA 


TAGTGACAAA 


GTTGTTGATT 


TAAGTCCTAT 


780 


30 


GGGTTGCCAA 


ACTGGTTTCT 


ATGTATCATT 


TATTAATCAT 


GATAATTATG 


ATGATGTATT 


840 


AAATATTGTT 


GAAGCAACTT 


TAAATGATGT 


GCTAAATGCT 


ACTGAAGTGC 


CTGCTTGTAA 


900 




TGAAGTACAA 


TGTGGCTGGG 


CAGCAAGTCA 


TTCATTAGAA 


GGTGCTAAAA 


CTATCGCTCA 


960 


35 


AGCATTTCTA 


GACAAACGAA 


ACGAATGGCA 


TGATGTTTTC 


GGTACAGGAA 


AATAAATCTT 


1020 




AGTCAATCAA 


GTTAATCAGA 


AAaGCAGTCG 


AACAATGATT 


TTACAATCGC 


CATTGTCCAA 


1080 




CTGCTTTTI'A 


TTATGCTTCA 


AAGTCAAAAA 


ATCGAACAAA 


TGAAAAAGTA 


AAATCTTTAA 


1140 


40 


CATTTGTCCG 


ATTTATTTGA 


GAACCACTAT 


AATTTCTTAA 


TTAGTCCCAT 


TAACACGAAC 


1200 




TGCATAGGTA 


ACCTTAAATA 


TAGTTGCCAT 


GTTGGCAATT 


GTTTATCTCC 


TAAAGGTAAC 


1260 


45 


TTTTTAACTG 


CCATATAGAT 


ATTAGCTGGG 


AATACAGCTA 


GCAAGAATAG 


ATTGATTGTA 


1320 


■rrrrrcAAGC 


ATTGAGATGG 


TCTTTTAATT 


AAAAGTGCAA 


GTCCAAATAA 


TATCTCAAAG 


1380 




ATTCCTGTAA 


CAAGAACCGC 


TGTTTTTCGA 


AGTGGCAAAC 


ATTTTCGGTAT 


GATATTTCTA 


1440 


SO 


AATTGTCGTT 


CTCGTGTAAA 


ATGCAATACA 


CCTATTACAC 


TAAAACCTAT 


TCCTAATAAA 


1500 


TATCTTAGTA 


TGTTCAATCA 


GCTTCAACTC 


CTATTCTGTA 


ATGATTTTAT 


GAATTAATGT 


1560 




AGGCGATACA 


ACATGaTCAG 


CAATTGTTAT 


GCTTGAATCT 


AATTTTTTAA 


CAACATCGTC 


1620 
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AATTTTTTTA TTTAAAACAA TACCAACCGC TAAATCAATA TCATCCTCTT TTGTTAAACG 174 0 

TCCCGCTCCT AACATCATCG AAGCGACACC TATATCGTTA GAGACTAATT CAGTCACATA 1800 

ACCTGATTTT TTAGCTTTAT ATTCAATTTG ATATTGAGCT TGTGGCAAAC GCTCTGGATG 18 SO 

GTCAATAACA GTTTCGTCGC CACCTTGGTT TTTAATAAAT GTTTTGAATT TTTCTAATGC 1920 

TGCACCTGAA TTAATTGCCT CAATTAGCAA CGCTCTCGCT TCTTCAAGCG TTTCAGCTTT 1980 

GTTTGCAAGT ACAACCATTT GAGAACCTAA TGTTAATACA AGTTCTGTTA AATCTTTCGG 204 0 

ACCTTGTCCT TTCAACGTAT CAATTGCTTC TTGTAACTCA AGCGCATTGC CAATCGCACG 2100 

TCCAAGTGGC TGATTCATAT CAGAAATAAT CGCCATCGTA TTACGTCCCA CATTATTACC 2160 

AATACGTACC ATTGCGTGCG CTAATGCTTC AGCATCTTCT AATGTTTTCA TAAATGCACC 2220 

GCTACCAGTT TTTACATCTA ATACAATTGC ATCTGCACCA GCAGCAATCT TTTTACTCAT 2280 

20 AATTGAAGAG GCAATTAATG GTATTGAATT GACAGTACCA GTAACATCCC TTAAGGCATA 2340 

TAATTTTTTG TCTGCAGGAG TTAAATTTCC TGATTGTCCT jPlCAACTGCCA CTTTATTTTC 24 00 

ATTAACCAAT TTCACAAATG TTGCTTCATC TATTTCAACA TGAAAACCAT CAATTGCTTC 2460 

TAATTTATCA ATCGTACCAC CTGTATGACC TAATCCACGC CCACTCATTT TTGCAACAGG 2520 

AACATCTACA GCTGCTACTA ATGGTGCTAA AACCAATGTA GTTGTATCTC CTACACCACC 2580 

TGTTGAGTGC TTATCTACTT TGACACCTTT AATATCACTC AAATCTATCA TATCACCAGA 2640 

ATTAACCATA GCCATCGTTA ATGCTGCACG CTCATCATCA TTCATATCTT GGAAATAAAT 2700 

CGCCATTGCT AAACTTGATG CTTGGTAATC AGGAATATCC CCTTTAACAT AGCCGCCAAT 2760 

AAAGAAATTA ATTTCTTCCG TTGTTAGTGT ATGACCGTCA CGCTTTTTCT CAATAATGTC 2820 

TATCATTCTC ATTTTTATCA TCCTTTTCTT AAAAAGCTTA GGACAAAGCA TCTGCGCTTT 2880 

CTCTAGTCCA TTTTTAAAAG CACAAGCGAA AATTATTATA GCAAGCTATC GATTTCATTT 2940 

40 TTAATATCAC AATTTCAATG CGATGTTATT ATTCTTAAAT AGATTGGTTA TAACGTTAAA 3000 

GTCCCTATTA AATTATCTTA GAATCATCAT GGCATTTATG ATGTCTTAAA GCTGATATCG 3060 

ACATACTTAT ATATGGTTAC GATGTCCCAT GCTTACATAT TTTTATAAAA TTAGTAATCT 3120 

GAATCTGCTT CTAAACCTTG CATAATTTGa ACGCCTGCGC TCGCACCAAT ACGTGTCGCA 318 0 

CCTGCTTCAA CCATTTTATT GAAATCTTCT AAATTACGTA CGCCACCTGA TGCTTTTACT 324 0 

TCTATATCAG CACCTACTGT ATCTTTCATT AATTTAACGT CTTCTGCAGT CGCACCGCCA 3300 

CCTGCAAAAC CTGTTGAAGT TTTAACGAAG TCCGCACCAG CCGCTTTTGT TAATTCACTC 3360 

GCTTTTACAA TTTCGTCATG GTCCAACAAT ACCGTCTCAA TAATCACTTT TACTGTGTGA 34 20 
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TTTAATGCGC CGATGTTGAT GACCATGTCA ATTTCATCTG CACCATTTTG AATCGCATCT 3 54 0 

TCTGTTTCAA ATGCTTTCGT CGCAGTTGTC GATGCACCTA ATGGGAATCC TATTACCGTA 3600 

CAAACTAACA CCTCTGAATC AGCTAGTCGC TCTGCTGCAT ATTTAACATG CGTTGGATTC 366 0 

ACACATACAG ATTTAAAATG GTATGCTTTC GCTTCATCGA TGATTTGATC GATTTGCGTA 372 0 

CGTGTTGACT CAGGCTTCAA TAAAGTGTGA TCAATCAATT TTGCACTATT CATTTTCTAT 3780 

CTCCTCCTTT ATGGTTGATT ATAAAAATAC GGTTGTAAAT TAGTTGATTG AGCGTCAGGT 3 84 0 

TCATTTAAAT ATCAGGTTAG ATGTTCGCTT TTTATGTAAC CGCATACATA TACTATTACA 3 90 0 

TTAATTCATT TCCCATAAAC AAACAATACA ATTGAACGTG ATATCTTCAT TATGAACGAT 3 960 

GACTTGACAA CAAGCTAATC AGGATTATAT TTTTATAATT CTTTAATTCT ATAGTACAAA 4 020 

AATTCGCAAA AAAGGGAAAC AAATGTTATC TTAAAATTAT TAATGAATAT TAAGGAGAAG 4080 

ATAACAAATG ACAAAAGGTA CACCACATAT TCAACCAAAT GGAGTAAAAA TTGCTAAAAC 414 0 

AGTATTAATG CCTGGCGATC CGCTACGTGC AAAATATATT GCTGATAATT TTTTAGAAAA 4 200 

TGTTGAACAA TTTAACGATG TACGTAACAT GTTTGGTTAC ACTGGTACAT ATAAAGGTAA 4260 

^5 AGAAGTTTCT GTAATGGGTT CTGGTATGGG TATTCCAAGT ATTGGTATTT ACTCATATGA 4 320 

GTTATACAAC TTCTTTGATG TAGATACAAT CATTCGTATC GGTTCTTGTG GCGCATTACA 4 3 80 

AGAAAATGTT AACTTATACG ATGTTATTAT TGCACAAGCT GCATCAACTA ATTCAAATTA 4440 

TGTAGATCAA TACAATATTC CAGGTCATTT CGCGCCTATC GCTGACTTCG AGTTAGTAAC 4 500 

TAAAGCTAAA T^TGTCGCTG ACCAAATCGG TGCTACTACA CACGTAGGTA ACGTATTATC 4560 

TTCTGATACA TTTTACAATG CCGATCCAAC ATTCAATGAT GCTTGGAAAA AAATGGGTAT 4620 

TTTAGGTATC GAAATGGAAT CAGCTGGTTT ATATTTAAAT GCGATTCATG CTGGTAAAAA 4680 

AGCACTTGGT ATTTTCACAG TAAGTGATCA TATTTTACXST GACGAAGCTA CTACACCTGA 4740 

40 AGAACGTCAA AATTCATTTA CACAAATGAT GGAAATCGCT TTAGAAATCG CAGAGTAACT 4800 

TATTTAAATT GACTTTAATT GCTCTTTAAC AATGCGATTA AACTCAAAAA GCCAACACAT 4 860 

TCTGGGCGTA TCCCCATTTA TGTGTTGGCT TTTATTTATA TTATTACTTA TCTGTAGATT 4 920 
''^ AGCTTAAGTA AGATTTAAAC ATCCAATTAT GTTTATCTAC TGATGTTTGC ATACCTATAA 4 980 

ACATATCTTC TGATACATCA TCGCCAGCAT TACCAGCAAT TTCGATTGCG TTTTCTAATT 504 0 

GTTTTGAGAT ATTTGTGAAG TCTTGTGATA ATTCTTCAAC CATTTGTTCT GCAGAGTAAC 5100 

SO 

CTTTCGCAGC TTCTTTAACA ATTGATTGCT CTAAGCATTC AGTTAATGTA CCTACAGGGT 5160 
TTCCTCCTAC CGCTAAAATT CTTTCAGCTA ATTCGTCTAC ATATTGGCTT GCTTCATTAT 5220 
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AATTGTGTAG CTTTGTGTAA G 
(2) INFORMATION FOR SEQ ID NO: 444: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 



IS 


GAAGATGAAA 


GCTATATTGA 


AGATGATGTT 


ACTAAAAAGG 


AAGCTATTTT 


AAGTATGCAA 


60 




ATTCCTAAAG 


GTTTCTCTCA 


AAAATTAAAA 


GAGAACCGTT 


TAAAAGAAAC 


GATACAGTTA 


120 




TATGGTAGAG 


ATGACTTTAT 


AGGTGGTATT 


GCTGTAGAAA 


TTGTTAGTAG 


TTCATTATAT 


180 


20 


GAGCAGCAAA 


TTCCTAACAT 


TATTTATGAA 


CACCTTGAGG 


ATATGAAACA 


GCATCAATCC 


240 




ATCGATGCTA 


TCAACAAGTC 


CTATCATAAA 


CATACACCTG 


AATCTAAAAT 


CAAATTTGTG 


300 




TCGCTTACTA 


AACAAGCACA 


ACACTCTATT 


TCAATTAGCT 


TAATCTTTGC 


GGTGATTTTA 


360 


25 


TTTGTTAGCG 


CTGTTCAAGT 


AGTACTTCAT 


TATCGTTTAA 


ACCAACAAGC 


AGCATTGCAA 


420 




CGATTATCAC 


AATATCATTT 


AAGCCGTTTC 


AAACTATATA 


GTACTTATGT 


AATGACACAT 


480 


30 


ACGATTTTGT 


TATTGTTGGT 


ACTATTGGCA 


GTTAGTCTAT 


ATTTGTCTCA 


ACCACTCAGC 


540 


TTAATATTTT 


ACTTAAAATC 


ACTGTTACTT 


ATATTGATTT 


ATGAGATAGG 


TATCGTCTTT 


600 




ATCTTATTCC 


ATATTCAAAC 


AATAAGTCAT 


CGATTATTCA 


TGACATTTAT 


ATATGCACTT 


660 


35 


GCTATGGGTA 


TCGTATACTT 


GATTATTTTC 


ATGTAAAGGA 


GCGTAACTGA 


TGATAGAAAT 


720 




TAATAACCTT 


TCAAAGCGTT 


ACCGTAACAA 


ACAGATTTTC 


AATCATTTAA 


CTATGTCCTT 


780 




TGATAGTAAT 


CGTTTAACCG 


TATTACTTGG 


TGATAATGGT 


GCTGGAAAAT 


CAACATTACT 


840 


40 


TCGTATGATT 


GCTGGTATTG 


AAAAAGCTAA 


TGATGGAACT 


ATCAACTATT 


TCGGCGAAAA 


900 




ATGGAATCAA 


AGACaAATAC 


aAAATCACAT 


CGGTTATGTG 


CCACAAGACA 


TTGCGTTATT 


960 




TGAACACATG 


ACAGTGGCTG 


AAAACATTAA 


ArrrriTAAA 


TCACTTTGTA 


AAAATCCAAT 


1020 


45 


TAACGATACA 


ACTATCAACG 


AATATTTACA 


GCAATTAAAC 


TTTGATGATA 


CGTCTGCCAA 


1080 




AGTATCTACA 


TTGTCCGGTG 


GGAATAAACG 


TAAAATTAAT 


ATATTAGTAG 


GTTTACTAGG 


1140 


SO 


TCAACCTCGA 


ATTCTCATTT 


TAGATGAACC 


GACAGTTGGT 


ATTGATTTAA 


AATCTAGACA 


1200 


TGACATCCAC 


CAACTACTTA 


ACATCATGAA 


ATCTAAATGT 


TTAATTATAT 


TAACTACCCA 


1260 




TCATTTAGAT 


GAAGTTGAAG 


CACTTGCAGA 


TGATATCAAG 


TTAATTGGCC 


AAGATCCTTT 


1320 
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CCCAAGCTGC GTATGATATC GCAACTTGGG ATTTTCTGTA TTATCTACTT TGCAAGTATG I440 

ACGTTGGGTC TACTGCATAT TGATTACCGA TGCCACCAGA CATACGTTGG AAGTGTACGT 1500 

GAGGCGCTGT TGAATTACCC GTACTACCTG AATATGCAAT TTGGTCACCA GCTTTGACTT 1560 

TATCACCAGC TGAAACAGTT AAACGATTAT TATGCATATA CCATTGGTAG TTATTACTGT 1620 

TCGCTTCTTT AATCGTTACT TGATTGCCGC CACCATAGTT ACTCCAACCT GCTTGTACTA 1680 

CTGTACCATC AGTTAATGAG TAAACTGGTG AATTTTCAGG CATTGCATAG TCGACACCGT 1740 

AATGCGCACC ACCACCGTGA TATTGTCCAT ATGGTTGTAG TTGTTTACGA CTTGTTAACC 1800 

,5 AGCTTGCGTC TTTCGCATGA CCACTAGcTG TCGCTTTACT TGCTGATCCA CCATTTTGAT 1860 

TAGATGTGCC ATTAGGATAA TTGACCTTTC CATTACCATC ATGGCTGTTA TACGCTTGGT 1920 

TGTTGTTACT ATGTGAATAA TAGCTCGCGT CTGGACCTAC ATTTGATTGA TAACCATATT 1980 

20 GATTAATATG CTGTTGGCTT TGACTCGCTG TGTAGTCATT GTTATCTCCT GCTGTTGCTG 204 0 

GATTCACATA TGTTTGGCCG CTTCCATTGG CATTTGCATT TTTTGGATAA CAGTTATAAA 2100 

AATAATGCGT ATGTCCTTGA GCATCTACGA ATGTATAGCT ATATTCTTTA TTATCAAACA 2160 

TTGCTTGATT CCAGTTACCA TCAGGTGTGT GATGATAATC CCCATTAGAA TCAATTGTAT 2220 

AATAAGTACC ATAAGATACG TCTTGTGATT GTGTTGACAT TTGTGTATGT GCTTGTTGGG 2280 

TGTTTGTCGT TTCTGCTGCA TCTGCTTGAT GCGCCATTGT AAATGTAGCG AAsCCATCGT 2340 

TGCAATCGCT GCTGCTGTTA ATTTTTTCAT GTATAAAACA TCCTCCATTA AAGTTAAAGT 2400 

TAGTTTTCAA TTAAACTGTA CTGCACATAC TAAAAGAATT AGACAACTGA GTAAAGGATT 24 60 

55 TAATTCTCAT TTTCCAACTA TTTAATATTC CCGAAATGTT TTACTAAACT CATTACATTG 2520 

TCATTACAAA ATAGCCATAC ATTGATATTA AAATGACATC TCTCACTGCA TTCGTTTAAC 2 580 

CTTTTATAAA TTTTCAAAAT TAACAACTAA TCGTTCGTCC ATGTTCGCGA TTCAACGCTA 2640 

ATGCATGATA GTAATCATCC ATCAAATCAT ATCAACCAAA TTCCATTATC AATCGCTATT 2700 

GATTGTCATT CAACTTTCTA ATAGTGATAT GCTTCTCAGG CTTAAAAATC GTCATATCTT 2760 

TTCTATTAAT TAAATCATCT GTGAGCTTTA ATGCTACTAA TTCATTGCTG CCATAATACT 2820 

TAATATATAA CX3TTCTTGTA GTTAAATTTA TTACGGTCTG ATACATCGTA TAGTGATTGT 2880 

CAGCATCATG CGGACGTACA ATTCCAATCG GTATATTTAC CGCATCTAAT AAATAAAATG 2 94 0 

CATTCATTAA ATCCATTTCT TTATCATTGT TTTGAGCAAT GTTTGCTTTC ATAAATGCCA 3000 

TTCTCACAAA GCGCTCAGTT GAAGTAAATC CACCTGGCAA TCCAAATGTA CCTGCTTCAT 3060 

TGCCTAAAGG TTCAATCGTT ACACCTTCCA ATAAATTTGC TGTTGCTGGA TAAGGAGAAA 3120 
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CACCAATAGG ATTATCTTTT ATAACCACTT CACCCTCTTT AAATGAAACT TCGACTGTAT 3 24 0 

GTCCAGTTGC ATCGGAAACA TGATAATGCA ATGGCGGAAC TTCACCGATG TCATTTAAAT 33 00 

ATACAGCTAC AACATGTATT TGGGATGCTT GTTGTTTCAT ATCTTCAATG CTTGTTGTAT 3 360 

ATCCCAAAAT CCATGTCACA ATTTCATTTT GCGTAATATT CATCGCGTCC GCTTTGTGTG 3420 

TTGATCCATA TGAACTATAA CCTCGGAAAT ATTGTGTTGA AATGGCAACG CcATGTtCAT 34 80 

TAACACCATC ACCATAAATA AAACCTTCCA TATCTGTTCC TGTGCCAATA AAGCCATATT 3 54 0 

GCGTTTGGCC TGTCGTGCCA GTGCAAGATT TCCAACGATA ATTTCTAGGC GTCACTGCTG 3600 

GCGAACCATC TAATGGATAA TCATAATCCA TCGTGCGTCC AAGAAGTACT TGATTATTTA 36 60 

AAGTTTGTAT TGTGAATCCT GTGCACATTG TTCTCACTCC TCTGTACCTT CATTTACTTT 3720 

AATCACTTTC AAATAAAGCT GTTTCACTTA AACATACTAT AAAAAATCAA TTATACAAGC 3 780 

20 AATTAATTGA TATTCATTCT CAATAACTGT GGTATGATAT GTAAGGAAAT CATGACTTAT 3 840 

GTGTGAGTGA ACGATCATCT ATACATCCGT TCACTTCATC TCATGACTTT CTATATTTAA 3 900 

TTTTTACAAG GAGTGACATC TGTGAATAAC ACACAATCTT CACCACGCAG TAATATTATT 3 960 

ATTGCGATTA TGTTGTCTGC ATTAACATAT TGGTTGTTTG CACAATCATT TATTAATATA 4020 

GGACCTCTCG TTGGTCAAAC ATATCAAACC TCTCCTGCCG TGTTAAATTT ATCTATTAGT 4 080 

TTAACTTCCT TCGCCACAGG TATCTTCATG GTGGCTGCAG GTGATATTGC TGATAAAATA 414 0 

GGACAACTGA GAATGACATA CATGGGTCTC ATAATCAGTA TGTTTGCATC TCTTCTATTA • 42 00 

ATTATATCGG ACATCACTGC ACTGCTCATC ATCGGTAGAA TTTTACAAGG TCTATCAGCA 4 260 

GCTATCTTGT TACCTTCAAC AGTTGGCGTG TTAAATAATC AATTTAAAGG AGAACATTTA 4320 

AGACGAGCGA TTAGTTATCT AATX3ATTAGT ACTGTTGGTG GCATCGGCCT AGCTGGTGTT 4380 

ATCGGCGGTT TAATTGCCTC AAATTTCGGA TGGCAAACGA ATTTCATCAT TAGTATAGTC 444 0 

40 ATTGCTTTCA TTGCCATATT GCTTCTAAAA GGCACACCTG AAAAAGTAAG TCAACATAGC 4500 

CACCGTCATC CATTCGATTA CAAAGGTATG TCGATTTTCG CTGTTATGAT TGGTAGCTTT . 4 560 

ACATTATTGT TAACACAAGG ATTCGAACAA GGTTGGTTTA GTACATTTTC AATCATTTGT 4 620 

CTGAGCATTT TTATCaTCAc TACGTTGATA TTCATCATCA TCGAACGTCG ACATGAAGTA 4 6 80 

CCTTTTATTG ATTTCTCAGT ATTACGCAAC CGTCCGTTCA TTGGTGCATT TTTAAATAAC 4740 

TTTGTTTTAA ATAGCGGTCT AGGCGTAACA GTGGTCTTTT TCATATATGC TCAAACACAC 4 800 

CTTGGTTTAt CAgcTGCGCA ATCTGGACTT GTTACATTGC CATATGCCAT TGTGGCAGTT 4 860 

GCGATGATTC GTTTAGGTGA AAAAGCAACA TTACGTTTCG GTGGCAAATT GATGCTCATC 4 920 
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TCACAATATG 


X V>X\ X X w wx\Vj X 


TATt-AX xGGT 


TT CGTCATAT 


GTGCGATAGG 


TAATGGTTTA 


5040 




GTCG CAACAC 


CTGGACrrTAf 

* X X w 


yAKk X\>(«AAX X 


X X UAG 1 ATGC 


CTAATGAAAA 


AGTTGGTTTA 


5X00 


s 


GCTACAGGAT 


TATATAAAAT 

X ^^X #\ X t^J'\t\X^ X 


vxAVj XA>Vi7 X A\JA 


X XAGGTGGCT 


CCTTTGGTAT 


AGCACTAAGT 


5160 






X X X O X X 


Ar*A Ar'^A A Ar^ 
Ak—AAV. X AAAv» 


TAnr^/^A/^/^ik Ik. 
X A X Gv-AQTuAA 


GTGTAGCTGC 


AACCGTAACA 


5220 


10 


T'T'T* A A T* f** 




GA X\7AX C_ u X X 


GGCTCATTGT 


CTGCATACAT 


GATCATTCCA 


5280 


AAAAPArSTT'A 
nrvrv\V_.Av9 x x /\ 


AAX V. X X AAA X 


ATAATAGAAG 


AATTATGTTT 


CGAAATATCT 


TTATCACTTT 


5340 




A A A ATT" ATHT 


A^ A A/^ A A A*TT^ 


CAAGAAAAAT 


AAGCGAACTG 


AATAAATAAA 


GATTCAATTA 


5400 


15 




AX X AGvaA X X C 


ACTCTAAAAC 


GATTAATAGT 


TTTATAAGAA 


GGTGTTTGAT 


5460 






L-t-ACATCAX X 


CGAATACTGT 


CATGAAGTAA 


TTTCTCTATT 


CTACGACCAG 


5520 




AAA ATA/^ A/^A 


X XQjAGTATAT 


GCATATAAGA 


TGATTTTTAA 


CATCATCTTT 


GGATGATAGG 


5580 


20 




A t- vaATGATGT 


CTGAATT CAT 


CGAATTTGCT 


ATCAGGTATC 


GTTTCAACAA 


5640 




ill 1 1 >\/\c 


ATGTCGCGAA 


ATATCATTTT 


GAGGAATTCT 


AACAGAAGTT 


TTTATTGGTA 


5700 






GvjCAAAGTGT 


CTTAi JL I l i I 


TAAAGTATTT 


CAAAGTAAAA 


TTACATGTTA 


5760 


25 


A T A r^^^T A 1^ nr* a 


X X AATGGCGA 


GACTCCTGAG 


GGAG CAGTGC 


CAGTCGAAGA 


CCGAGGCTGA 


5820 




VjAU. Vovj L. V. 


X AGGAAAGGG 


AAGCCATTCA 


ATACGAAGTA 


TTGTATAAAT 


AGAGAACAGC 


5880 


30 


A /^T* A A A T* A T* 


TTTCTAATTG 


AAAATTATCT 


TACTGCTGTT 


TTTTAGGGAT 


TTATGTCCCA 


5940 




v^XAAX XAXAX 


X GACTATCAA 


TTAGACAAAA 


TGGCCATTTT 


CAAATATCAC 


6000 




GCGTTGTTTC 


TGACCTTGAA 


TATATTTATT 


ATAATTCTCT 


TTTTGAAAAT 


CAGTTAACAT 


6060 


35 


TAATTTAGAT 


GTACCGTATT 


TTAACACTTT 


TTGCATTGTT 


TCTATTCTCA 


TTTTTCTAAA 


6120 




TAACCATCCA 


TCri-lTAACA 


W\ATAGGAl X 


AACAGCATCA 


TATGATAATT 


CTACTGTTTC 


6180 




TTTAATTTCA AATGTCTTGA 


ATV^ A A AT* A AT» 

A Xa^AAATAAT 


CGTGCACATT 


AAAAACGTAT 


CACCAAAGTA 


6240 


•w 


ATAAACATCT 


AAATCATCAC 


GTTTATGTTG 


TCCAACTAAC 


AAACGACCAT 


ATTCGAACTC 


6300 




TTTTTCTGCa\ 


TATTTCAATT 


r» Jk 9k Ik 9k ]k 7k / M|» 

GXAAAAAACT 


AATAATCTCT 


TCTTCTTTTA 


ATTTGAATTG 


6360 




CATTTAAAAA 


CATCCTCTCT 


TAAGTTTTAA 


CAAGCCTTAA 


nwT^ik Ik Ik Ik Ik nwvwff* 

TTAAAAA x X X 


TTTCAATCAC 


6420 


45 


ATAGTTCAAT 


ATACATCATT 


TCGTTATGTT 


TTTTAATACT 


TTGTTCAAAA 


ACAAATATTT 


6480 




TATTCTTTAA 


AATAATGACT 


TTTGTATTTT 


TAATCACAAT 


AAACATTTTA 


AAATTCTTGT 


6540 


SO 


TATCATAATC 


ATTAAAAGGT 


ATTAACCTTA 


ATAATATATT 


CTCTCGTCTC 


AACCTTAATC 


6600 




GTATACTTCA 


GACGTCTGTT 


TGTAGACAAT 


AAAAGTCATT 


CACGTCTTCA 


TATGTCATCA 


6660 




AATGTTTATC 


ATGATATGAT 


GAATATAATA 


ATCGGGTATA 


TAACTGTATG 


ATTAATTACA 


6720 
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TGGTGTATCA GTTACAACTG TGTCACATAT TTTAAATCAT AATGATAGTC GTTTTTCCGC 6840 

AACAACGATA AAAAACGTAC ATGCTGTTTC AGAACGTTTA GGCTATGCCC CTAATAAACA 6900 

TGCAAAACAA TTGCGCGGCA GTAAAATTCA AACTATTGGC GTCATTTTGC CTAGCTTAAC 6960 

AAATCCGTTT TTCTCAGCAC TGATGCAAAG TATTCATGAC CATAAACCAT CTGATGTTGA 7020 

TTTATGCTTT TTAACATCTA CAGCAACTGA TTTGTATGAC AATATTAAAC ATTTAATTGA 7080 

TCGAGGTATT GACGGATTAA TTATCGCACA ATACATATCA TCCCCGGACG CCCTAAATAA 7140 

CTATCTAAAG AAACATCATG TACCTTATGT CGTACTGGAT CAAAATGACC ATCAAGGCTA 7200 

TACAGATTTT GTTCGGACAA ATX3AATATCA AGGTGGACAA CTTGCAGCAC AACATTTAAT 7260 

AGAACTCGGT CACAACCATA TGATAATTGT TGCACCATAT GACATGATGG CGAATATGTC 7320 

GACTCGTGTC GCTGGATTTG TCGATACTTT GCGCGCGAAT CAATTGCCAG AACCACAAAT 7380 

20 CGTCCATACT GAATTATCTA AGCGCGGTGG GCTAACCATT GTTGATGACA TCATGGTTCA 7440 

ATCTGCCACT GCAATCTTCG CTATTAACGA TGAACTCGCT ATTGGCATTT TACGAGGACT 7500 

AATTGAACAT GGCATCAGTA TCCCGAAAGA TATCTCATTA ATAGGTTATG ACGACATTGA 756 0 

TTATGCAGCG TACGTCTCGC CACCTTTAAC TACTGTGGCA CAACCTATAA CTGATATTGG 7620 

CAAAACATCT TTAACCTTAT TACTTCAACG ATTACAGCAC TTAGATAAAT CCATTGATAT 7680 

GATTGAATTA CCAACGACTT TAAAAATTCG TGCAACAACT GGCTATCATC TTTCAAACTA 774 0 

ACTACGTATC TTCCGAAATA TACTCATCAT TGTTAGGCCC TTAGCGTTGC TTTAATGCTG 7800 

AGGGTTTTTA ATCATAATTA TTTTACTAAG AAATTAAAAT AATAATGTAT GAA TITXT AA 7860 

ATATGATTTA AACGTTTTCA GTTTTTATGA AAACGCATGC ATTTTACAAA TAAAAATGGT 7920 

ACGATGGCAC TGGTAAAACG TTTTACTAAA AACAAATCAT GAGGTGTATA ACATGAGCAT 79 80 

TGTTGCATTA CTTATCGGGT TAGGCCCCTT AATTGGCTGG GGCTTCTTCC CAACAGTCGC 8040 

40 TTCAAAGTTT GGTGGTAAAC CTGTACATCA AATTATCGGT GCTACTGTAG GTACGTTAAT 8100 

CTTCGCTATT TTATTAGCCG TAGTCACATC AAGTGGCTTC CCTACTGGAA. CCAATTTGCT 8160 

ATTCGCCTTA TTATCAGGTG CAGGATGGGG ATTCGGACAA ATCATTACAT TTAAAGCGTT 8220 

CGAATTAGTC GGCTCATCTC GTGCCATGCC AGTCACAACA GCATTCCAAT TATTAGGCGC 8280 

ATCTTTATGG GGTGTCTTTG CATTAGGAAA TTGGCCAGGC ATTGGTCATA AAATCATTGG 834 0 

ATTTACAGCT TTAGTCGTTA TTCTAATTGG AGCGCGTATG ACAGTTTGGA GTGAACGCAA 8400 

AGAAGCAAGT AACGCCAAAA ATTTACGTCG TGCAGTGGTA CTTCTGTTAA TTGGTGAATT 8460 

TGGATACTGG TTATATTCAG CTGCACCGCA AGCAACTTCT ATTGATGGCC TAACTGCCTT 8520 
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AGCAGAGAAT CCATTCCGTA ATAAAATTAC GTGGTTACAA ATTATTTCAG GTTTCTTCTT 864 0 

TGCATTTGGT GCTTTAACAT ATCTTATTTC AGCACAACCT AATATGAATG GTTTAGCAAC 8700 

TGGATTTATT CTTTCTCAAA CATCCGTTGT GCTTGCTACA TTAACTGGTA TTTATTTCTT 8760 

AAAACAACAT AAAACGTCAA AAGAAATGGT TATTACAATC ATCGGCTTAG TACTCATTTT 8820 

AGTAGCCGCT TCTGTTACAG TATTTATAAA ATAAGGAGTG TAGATGTCAT GAAAAAATCA 8880 

GCTGTTTTAA ATGAACATAT TTCAAAAGCA ATCGCGACAA TTGGTCATTT TGATTTATTA 894 0 

ACGATTAATG ACGCTGGCAT GCCAATTCCA AATGATCATC GTCGTATCGA CCTAGCTGTA 9000 

ACTAAAAACT TACCACGCTT TATTGATGTC TTAGCTACAG TGTTAGAAGA AATGGAAATC 9060 

CAAAAAATAT ACTTAGCAGA AGAAATAAAA GAACATAACC CTACACAATT GCAACAAATT 9120 

AAACAATTGA TTTCATCGGA AATGGAAATC ATTTTCATTC CTCACGAAGA AATGAAAAGT 9180 

AACTTAGCTC ACCCATTAAA TAAAGGTAAT ATTCGTACTG GTGAAaCAAC GCCCTACTCT 9240 

AATATTGCAT TAGAATCGAA TGTTACTTTT TAAAAGTTAT AACTTGAAAG GAGCGTACAC 93 00 

ATGACCAACA AAGTTGTTAT TTTAGGTTCA ACGAATGTCG ATCAATTTTT AACAGTTGAA 9360 

AGATATGCAC AACCAGGCGA AACATTACAT GTTGAAGAAG CACAAAAAGC ATTCGGCGGA 9420 

GGTAAAGGTG CCAACCAGGC TATTGCCACT GCACGCATGC AAGCAGACAC AACATTTATT 94 80 

ACTAAAATTG GCACTGATGG CGTTGCTGAT TTCATCTTAG AAGATTTTAA AGTAGCTCAT 9540 

ATTGATACAT CATATATTAT CAAAACAGCT GAAGCAAAAA CGGGCCAAGC CTTTATCACT 9600 

GTGAATGCAG AAGGACAAAA CACCATCTAT GTTTATGGTG GTGCGAATAT GACGATGACA 96 60 

CCTGAAGATG TTATTAACGC AAAAGACGCT ATAATCAATG CAGACTTTGT CGtTGCACAA 9720 

TTAGAAGTAC CCATCCCGGC TATTATATCT GCATTTGAAA TTGCCAAGGC ACATGGTGTG 9780 

ACGACAGTAT TAAATCCTGC ACCAGCGAAA GCATTACCTA ATGAATTATT ATCATTAATC 984 0 

GATATTATTG TGCCAAACGA AACAGAAGCC GAATTGTTAT CTGGGATTAA AGTAACTAAT 9900 

GAACAATCTA TGAAAGACAA TGCCAATTAC TTTTTATCTA TAGGCATTAA GACTGTTTTG 9960 

ATTACGCTAG GTAAGCAAGG TACAT A TTTT GCTACTAAAA ATCAAAGCCA ACACATCGAA 10020 

GCTTATAAAG TAAATGCGAT TGATACAACT GCTGCAGGCG ACACATTTAT TGGTGCATTT 10080 

GTCAGTCGCT TAAACAAGTC GCAAGATAAC TTAGCAGATG CTATTGATTT TGGTAATAAA 1014 0 

GCGAGCTCAC TCACTGTACA AAAACACGGC GCGCAAGCAT CTATTCCTCT ACTAGAAGAA 1020 0 

GTAAATCAAG TTTAAATGAA TCAAACACAG CTATGATATG AAGGTTTAGC ATATAACATG 10260 

CAACATTCGT ATATCATGGC TGTGCTTTTT TATCTTTATA AAACATCATC TATTAGAAAT 1032 0 
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mvrAATCT 


TTTTAACTTC 


CAAATTaTCG 


CATATAAATA 


TGCTATATTA 


ATGATAATAA 


TTATCAATTA 


AAAGGAGGTT 


ATGCTATGTC 


TAAAGAAGCT 


GCTCATACAT 


TTTTAGCTAA 


ATTAGGAAAA 


ACTCGTCTAC 


GCCCCGGTGG 


TAAAGAAGCA 


ACAGATTGGT 


TAATACAACA 


AGGGGCATTT 


TCACAAGATA 


AACAAGTGTT 


AGAAGTGGCA 


TGTAATATGT 


GCACAACATC 


TATTTATCTA 


GCTCATACAT 


ATGGCTGTCA 


CATTCAAGGC 


GTTGATATAA 


ATAAGAAAGC 


ATTAGAAAAA 


GCACAGGAAA 


ACATTTCAGC 


AGCAGGTCTT 


GAATCATATA 


TTCAAGTTCA 


ACAAGCGAAT 


GCTGTTAAAT 


TGCCCTTTGA 


TGACAATCAA 


TTCGATATCG 


TTTTAAATGA 


AGCAATGTTA 


ACAATGTTAC 


CCATCGCCAT 


AAAGGAA^VAA 


GCATTACGCG 


AGTACTACCG 


AGTCTTAAAG 


CCTGGGGGTA 


TCTTGTTAAC 


ACATGATATT 


GTCATCGTTA 


ATGAATCACA 


TGCCACACAT 


GTTGTTAAAT 


CATTATCTGC 


AGCAATTAAT 


GTCAATGTCT 


CACCGCAGAC 


GAAACTTGGC 


TGGTTAGATT 


TATATAATCA 


AGCTGGTTTT 


AATCATGTGC 


ATTATCATAC 


TGGTCCAATG 


AGTTTAATGA 


CACCAAAAGG 


TTTAATTTAT 


GACGAAGGTA 


TTGTTGGAAC 


TATAAAGATT 


ATCAACAATG 


CTTTGAAAAA 


AGAAAATCGA 


CCAATGTTTT 


GTAAAATGTT 


TAAAACGATG 


ACTAAATTGC 


GTAAAGATAT 


GAATTATATT 


ACTTTTGTCG 


CTAAAAAAGA 


GCACTAAATA 


TAATGCCACT 


AACTGTACTT 


TGTATCTATG 


TTTGACTATC 


ACTTTAATTT 


CTTTGTGACA 


CTAATCATCT 


ACTTAACAAT 


ATCGTTATCG 


TTGATTAGTA 


AGTCATCAAT 


TTTGGTTAAA 


GACTTTCATA 


AACACTCAAA 


CATTAACACT 


ATACATAGTT 


AGTGGCATTA 


n-i-i-i-iyCTn 


AAAATTTTAA 


CmTCmCGGGr 


TtGGGAmCrG 


AAaTGrtAwT 


TcGCrtnAACT 



TAwTcT 

(2) INFORMATION FOR SEQ ID NO: 445: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 
TTACATAGTT AACACTAGTT AATCTATTAG TTAACATTAG TTAATAATTA GTTAATTTCC 
ATTTGTATTC TCATGTGATA AATTCTAAAA GCATACaATA AATTTAATAT GTAAAAAGAA 
AGGGAATACA CATGAAAAAT AAATATATCT CGAAGTTGCT AGTTGGGGCA GCAACAATTA 
CGTTAGCTAC AATGATTTCA AATGGGGAAG CAAAAGCGAG TGAAAACACG CAACAAACTT 
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AAGTATTACA TCTAAAAGGT ATCACAGAAG AACAACGTAA CCAATACATC AAAACATTAC 360 

GCGAACACCC AGAACGTGCA CAAGAAGTAT TCTCTGAATC ACTTAAAGAC AGCAAGAnCC 420 

CAGACCGACG TGTTGCACAA CAAAACGCTT TTTACAATGT TCTTAAAAAT GATAACTTAA 480 

CTGAACAAGA AAAAAATAAT TACATTGCAC AAATTAAAGA AAACCCTGAT AGaAGCCAAC 540 

AAGTTTGGGT AGAATCAGTA CAATCTTCTA AAGCTAAAGA ACGTCAAAAT ATTGAAAATG 600 

CGGATAAAGC AATTAAAGAT TTCCAAGATA ACAAAGCACC ACACGATAAA TCAGCAGCAT 660 

ATGAAGCTAA CTCAAAATTA CnTAAAGATT TACGTGATAA AAACAACCGC TTTGTAGAAA 720 

AAGTTTCAAT TGAAAArGCA ATCGTTCGTC ATGATGAGCX3 TGTGAAATCA GCAAATGATG 780 

CAATCTCAAA ATTAAATGAA AAAGATTCAA TTGAAAACAG ACGTTTAGCA CAACX3TGAAG 840 

TTAACAAAGC ACCTATGGAT GTAAAAGAGC ATTTACAGAA ACAATTAGAC GCATTAGTTG 900 

20 CTCAAAAAGA TGCTGAAAAG AAAGTGGCGC CAAAAGTTGA GGCTCCTCAA ATTCAATCAC 960 

CACAAATTGA AAAACCTAAA GTAGAATCAC CAAAAGTTGA AGTCCCTCAA ATTCAATCAC 1020 

CAAAAGTTGA GGTTCCTCAA TCTAAATTAT TAGGTTACTA CCAATCATTA AAAGATTCAT 10 BO 

TTAACTATGG TTACAAGTAT TTAACAGATA CTTATAAAAG CTATAAAGAa AAATATGATA 114 0 

CAGCAAAGTA CTACTATAAT ACGTACTATA AATACCAAGG TGCGATTGAT CAAACAGTAT 1200 

TAACAGTACT AGGTAGTGGT TCTAAATCTT ACATCCAACC ATTGAAAGTT GATGATAAAA 1260 

ACGGCTACTT AGCTAAATCA TATGCACAAG TAAGAAACTA TGTAACTGAG TCAATCAATA 1320 

CTGGTAAAGT ATTATATACT TTCTACCAAA ACCCAACATT AGTAAAAACA GCTATTAAAG 13 80 

CTCAAGAAAC TGCATCATCA ATCAAAAATA CATTAAGTAA TTTATTATCA TTCTGGAAAT 1440 

AATCAATCAA AAATATCTTC TCTAGTTTTA CATC A TTTTT TAAATAATTT TCGTAACAAA 1500 

CCGTGATTAA AAAGAACCGT TGATTCTCAA TCGAATCTAC GGTTCTTTTT TCAITTTCCA 1560 

40 TCAATTAAAT GCTTCTTCGC TATTTGTCAG CCCACTTTTT TACCTGCAAC TTGTTAAATA 1620 

ATCCTTACAT CGTTAACGAA TAGTTCATCA TTTAGTTGAA TCAGCTCAAC TTTATTAACT 1680 

TCATATTTTC ACAAACTATT GCGCAATCCA TTCCTTTTCC ACTACAAGCA CCATAATTAA 1740 

ACAACAATTC AATAAAATAA GACTTGCAAA GCATAGTTAT GTAGCTATAT AAACGCCTGC 1800 

GACCAATAAA TCTTTTAAAC ATAACATAAT GCAAAAACAT CATTTAACAA TGCTAAAAAT 1860 

GTCTCTTCAA TACATGTTGA TAGTAATTAA CTTTTAACGA ACAGTTAATT CGAAAACGCT 1920 

TACAAATGGA TTATTATATA TATGAACTTA AAATTAAATA GAAAGAAAGT GATTTCTATG 1980 

ATTAAAAAtA AAATATTAAC AGCAACTTTA GCAGTTGGTT TAATAGCCCC TTTAGCCAAT 2040 
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CmTyCAAArG AcACaGACAT TACTAGCCAA CGATTTAGCT ATnACTCCAA ACCTTCCATT 2160 

GGATTTGGTA AAGGnT 2176 

(2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

IS 

AAAAGCATGG CTTAAATGAA GTACGCTATA ACAAATTACA AGAACATGCT ATTGTTATGC 60 

ATCCGGCACC TGTGAATAGA GGAGTAGAAA TACAAAGCGA TTTAGTAGAA GCTTCAAAAT 120 

2^ CAAGAATTTT TAAGCAAATG GAAAATGGCG TTTACTTAAG AATGGCAGTC ATTGATGAAT 180 

TATTAAAATA GGTAAGGGGA CGAAAATGAT GAAATTAATT AAAAACGGTA AAGTATTACA 240 

AAATGGCGAA TTACAACAAG CAGATATTTT AATTGATGGT AAGGTAATTA AACAAATTGC 300 

2S ACCTGCAATT GAACCAAGCA ATGGTGTTGA CATCATAGAT GCGAAAGGTC ACTTTGTGTC 3 60 

ACCTGGATTT GTCGATGTTC ATGTTCATTT ACGTGAACCT GGTGGTGAAT ATAAAGAGAC 420 

AATTGAAACT GGTACTAAAG CTGCTGCTAG AGGCGGATTT ACAACTGTAT GTCCAATGCC 4 80 

TAACACAAGA CCGGTACCAG ATTCTGTAGA ACATTTTGAA GCTTTACAAA AATTAATCGA 54 0 

TGACAATGCT CAAGTACGTG TATTACCTTA TGCTTCAATT ACAACACGTC AATTAGGTAA 600 

AGAATTGGTT GATTTCCCAG CACTAGTAAA AGAAGGTGCC TTTGCGTTTA CAGATGACGG 660 

35 

TGTAGGAGTA CAAACTGCAA GCATGATGTA TGAAGGCATG ATTGAAGCTG CAAAAGTAAA 720 

CAAAGCCATC GTAGCACACT GTGAAGATAA TTCATTAATC TATGGTGGTG CAATGCATGA 780 

AGGGAAACGC AGTAAAGAGT TAGGTATACC AGGTATTCCA AACATTTGTG AATCTGTTCA 840 

40 

AATCGCAAGA GATGTACTAT TaGCTGAAGC AGCAGGTTGT CATTATCATG TATGTCATGT 900 

TTCTACTAAA GAAAGTGTTA GAGTCATTCG TGACGCTAAA CGOGCAGGCA TTCATGTTAC 960 

^5 AGCTGAAGTT ACACCACACC ATTTATTGTT AACAGAAGAT GATATTCCTG GTAATAATGC 1020 

CATTTATAAA ATGAATCCAC CATTGAGAAG TACTGAAGAT AGAGAGGCTT TGTTAGAAGG 1080 

GTTACTAGAC GGTACAATTG ACTGTATCGC AACAGAcCAT GCACCACATG CACGTGATGA 1140 

so AAAAGCACAA CCAATGGAAA AAOCaCCATT CGGAATTGTT GGTAGTGAAA CAGCATTCCC 1200 

ATT ATT AT AT ACGCATTTTG TAAAAAATGG TGATTGGACA TTACAACAAT TAGTAGATTA 1260 
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TTATGCAGAT 


TTAACAATCA 


TTGATTTAGA 


TAGTGAACAA 


GAAATTAAAG 


GAGAAGATTT 


1380 




CTTATCAAAA 


GCAGATAATA 


CACCATTTAT 


CGGCTATAAA 


GTTTATGGAA ATCCGATCTT 


1440 


5 


AACAATGGTT 


GAAGGCGAAG 


TTAAATTTGa 


GGGGGATAAa 


TArTATGCAA gCAAACGTTA 


1500 




TCTAGTGTTA 


GAAGACGGTC 


TTTTACGAGG 


CTACCGTTAG 


GTCTGATAAC 


TTACTGA 


1557 




(2) INFORMATION FOR SEQ ID NO: 447: 








10 
IS 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 






20 


GCTAGAAATtti 


TTGmATGaCA 


ATACAACTCT 


GTTAAAaTGA 


TGGACGTAGA 


CAAATATGCG 


60 




TATTGACGCT 


ttattttaaa 


AATTaACATG 


CTTATAACAT 


GTTTATAGAA 


GGAGATTAAC 


120 




CTATGAACTA 


TCaAGTTCTT 


TTATATTATA 


AATATATGAC 


GATTGATGAC 


CtGAACAQTT 


180 


2S 


TGCTCAGGAT 


cacttagcct 


TTTGTAAAGC 


ACACCATTTA 


AAAGGTJJGAA TTCTTGTTTC 


240 




TACAGAAGGT 


ATTAACGGCA 


CATTATCTGG 


TACAAAAGAA 


GAAACCGAAC 


AATATATGGC 


300 




ACATATGCAT 


GCCGATGAAC 


GATTCAAAGA 




AAAATTGATG 


AAGCTGAAGG 


360 


30 


ACATGCTTTT 


AAGAAAATGC 


ATGTACGTCC 


TCGAAAAGAA 


ATCGTTGCTT 


TAGATTTAGA 


420 




AGATGACGTC 


GATCCAAGAC 


ACACAACTGG 


CCAATATTTA 


TCACCTGTAG 


AATTTAGAAA 


480 




AGCTCTTGAA 


GATGATGACA 


CAGTCATTAT 


TGATGCACGT 


AATGATTATG 


AATTTGATTT 


540 


35 


AGGTCATTTC 


CGAGGTGCAA 


TTCGTCCAAA 


TATCACACGT 


TTTAGAGATT 


TGCCTGACTG 


600 




GATTAAAGAG 


AATAAAGCGT 


TATTTGCAGA 


TAAAAAAGTG 


GTTAOGTACT 


GTACTGGTGG 


660 


40 


CATTCGATGC 


GAAAAATTTT 


GTGGATGGCT 


TTTAAAAGAA 


GGTTTCGAAG 


ATGTAGCTCA 


720 


ACTTCATGGC 


GGTATTGCTA 


CATATGGTAA 


AGATCCTGAA 


ACAAAAGGTG 


AATATTGGGA 


780 




CGGTAAAATG 


TACGTATTTG 


ATGACCGTAT 


CAGTGTTGAT 


ATCAACCAAG 


TTGAAAAnAC 


840 


45 


AATTATTGGT 


AAGGATTGGT 


TTGATGGCAA 


ACCATGTGAA 


CGTTATATTA 


ATTGCGCTAA 


900 




CCCAGAATGT 


AATAAACAAA 


TATTAGTTTC 


TGAAGAAAAC 


GAAACTAAAT 


ATTTAGGTGC 


960 




ATGCTCTTAT 


GAATGTGCTA 


AACATGAGCG 


TAATCGTTAT 


GTTCAAGCAA 


ATAATATTAG 


1020 


SO 


TGATAATGAG 


TGGCAACAAC 


GTTTAACAAA 


CTTTGATGAT 


TTACATCAAC 


ATGCTTAGTT 


1080 




TTAATTAAAT 


ACCTTTCAAA 


ACACGCTTTG 


AAAATCCGAT 


TTATAAAGGT 


TTTTCAAGGC 


1140 
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TAAATTTTAA TACTGCGGGG TGTCTTAAAA TGCACATTTT AGTAACAGGG TTTGCGCCTT 1260 

TTGACAATCA AAATATCAAT CCCTCATGGG AAGCTGTGAC TCAACTAGAA GATATTATTG 1320 

^ GCACACATAC AATCGATAAA TTAAAACTAC CAACCTCTTT TAAGAAAGTA GATAATATTA 1380 

TAAATAAAAC GTTGGCATCT AATCATTATG ATGTTGTACT AGCTATAGGA CAAGCTGGTG 144 0 

GTAGAAATGC CATTACCCCA GAACGTGTCG CCATTAATAT TGATGATGCA CGTATTCCAG 1500 

10 

ATAATGATGA TTTTCAACCT ATTGATCAAG CCATTCACTT AGACGGTGCG CCAGCTTATT 1560 

TTTCAAATTT ACCaGTTAAA GCAATGACTC AAAGTATTAT TAATCAAGGA CTTCCTGGAG 162 0 

CACTTTCAAA TAGCGCAGGT ACATTTGTTT GTAATCACAC ACTTTATcAC TTAGGTTATT 1680 

15 

TACAAGATAA GCATTACCCT CACGTACGAT TCGGATTTAT TCaTGTGCCA TACATACCAG 1740 
AGCAGGTcAT TGGTAAACCC GATACACCAT CTCATGnCCA TTGAGGAAAA GATnAGTTG 17 99 

(2) INFORMATION FOR SEQ ID NO: 448: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
2$ (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 



30 


ACTTGGTTTT 


TTATTGTTTA 


TAAATAAAAC 


TCACTTAATA 


ATGTTTTCAT 


AATCTTCTTC 


60 




GACTACTTAA 


TTCTTTAAGA 


TATTCGTGAA 


AAGAGACATT 


ACACTAGTTA 


ATTTTCAAAC 


120 




AATACAAAAA 


GCGTCTACCT 


CCTACATATA 


ATTGTAGCGG 


AGATAGACGC 


TTAATATTTA 


180 


35 


TTTAAAAATT 


ATTTTAAACC 


ACCGAATGTC 


ATAACATCAC 


GGGCAATCAT 


AcrrrciTCA 


240 




TCTGTTGGAA 


TAACGACAAC 


TTTAACTGGT 


GAATGAGGAT 


AGTTAATAAA 


TCCTTCTTTA 


300 


40 


CCACGTAGTA 


AGTTTTCATT 


TTTCTTAGGA 


TCCCAGTAAA 


CACCCATAAA 


TTCTAAGCCT 


360 


TCAAGAACTT 


TCGCACGAAT 


TTCTACTGAG 


TTTTCACCGA 


TACCTGCTGT 


AAATACGATA 


420 




ACATCAACAC 


CATGCATTCT 


CGCAGCATAT 


GATCCAATAT 


ATTTGTGAAT 


TTTAGAAGCA 


480 


45 


AATACATCTA 


AAGCCATTTG 


TGAACGTGCT 


TTACCTGATT 


CAGCXTCTTC 


TGATAAGTCA 


540 




CGTAAATCAC 


TAGATGTACC 


TGATAATCCT 


AATAAACCTG 


ATTCTTTGTT 


TAAGATTTCC 


600 




AATACTTGTT 


CAGCAGTTTT 


ACCTGTTTTT 


TCCATAATAA 


ATGGAATTAA 


A6CAGGGTCA 


660 


50 


ATATTACCAG 


AACGAGTACC 


CATTGTTACA 


CCAGCAAGTG 


GTGTGAAGCC 


aTTGATGTAT 


720 




CAATAGATTT 


ACCGCCATCG 


ATAGCTGCAA 


TTGATGCTCC 


ATTACCAATG 


TGACATGAAA 


780 
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TATGGCTTGT 


ACCATGGAAA 


CCATACTTAC 


GAATGCCATA 


ATCTTTATAA 


TAATGATATG 


900 




GCAAGCTATA 


TAGATATGCT 


TTTTCAGGCA 


TTGTTTGATG 


GAATGCTGTA 


TCAAAAATTG 


960 


5 


CCACATGAGG 


GATATTTGGT 


AATAATTTAC 


GGAAAGCACG 


AATACCCAtC 


AAGTTaGCTG 


1020 




GGTTGTGolAG 


CGGTGCTAAT 


TcGCTTAATT 


CTTCAATTTC 


CTTTTCAACC 


TCATCAGTAA 


1080 


10 


TAGCTACTGA 


TTCAGGGAAT 


TTTTCACCAC 


CATGTACAAC 


ACGGTGACCT 


GTTCCATCGA 


1140 


TATCGTTAAT 


ATCATTAATA 


ATATTGTGCG 


CTTTAAAAGC 


ATCCAACATG 


ATATCAACTG 


1200 




CCTCAACGTG 


ATCCTTGATA 


TCTTGTACTG 


TTTTAACTTT 


TTCCCCGTTG 


ACTTCAATTG 


1260 


75 


TAAAAATTGA 
CAGGCATTCT 


ATCCTTCAAT 
AATTAATTGA 


CCGATTCTTT 
A 


CTACTAAACC 


TTTTGTTACT 


AATTCCTCTT 


1320 
1341 




(2) INFORMATION FOR SEQ ID NO: 44 9: 








20 
25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : double 
-(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 9: 








TTTTGAAGAT 


ACTACCGATG 


AAAATAGACA 


AAAGATTTTT 


CAATATTTAT 


CACCTGAAgA 


60 


30 


AGTTGCAAAT 


TTCTTTGATC 


AATTAGATAT 


TGATGACGAT 


GAATATGAGT 


TGCTATTTGA 


12 0 




TAAGATGAAT 


GCGACATACG 


CAAGTCACAT 


ATTAGAAGAA 


ATGTCATACG 


ACAATGCAGT 


180 


35 


AGATATTTTA 


AATGAGTTGA 


CTAAACCAAA 


AGTTGCTAGT 


CTTTTAACAT 


TGATGAATAA 


240 


AGATGACGCG 


AATGAAATCA 


AAGCATTACT 


TCACTATGAT 


GAGGATACGG 


CCGGCGGTAT 


300 




TATGACGACG 


GAgTATTTAT 


CACTTAAAGC 


GCATACGCCT 


GTTAAAGAAG 


CATTATTATT 


360 


40 


GGTCAAAGCG 


CAaGCACCAG 


ACGCAGAAAC 


AATATATGTT 


ATATTTGTCG 


TTGA*tGATGA 


420 




TGGTAAATTA 


GTAGGTGTTT 


TATCGCTAAG 


AGATTTAATT 


GTAGCTGAAA 


ATGATGCTTA 


480 




TATTGAAGAT 


ATTATGAATG 


AACGTGTCAT 


TAGTGTGAAT 


GTAGCAGACG 


ACCAAGAAGA 


540 


45 


TGTTGCTCAA 


GTTATGAGAG 


ACTATGATTT 


CATGGCTGTA 


CCTGTTATAG 


ATTACCAAGA 


600 




ACATTTGCTT 


GGTATCATCA 


CGATTGATGA 


TATTTTAGAC 


GTTATGGATG 


AAGAGGCTAG 


660 




TGAAGACTAC 


TCTCGTTTAG 


CCGGGGTATC 


AGATATCGAT 


TCGACTAATG 


ATTCAATCAT 


720 


SO 


TAAAACAGCA 


TTAAAACGTT 


TACCATGGTT 


GATTATTTTA 


ACATTTTTAG 


GAATGATTAC 


780 




TGCGACAATT 


TTAGGGAGAT 


TCGAAAAAAC 


ATTAGAAAAT 


GTAGCGCTAC 


TCGCAGCGTT 


840 
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TCX3TAACATT ACGAcAGGGG AAATTAATGA GCAAAGTAAA TTTAGAATTG CATTAAGAGA 
AGCAGGAAGT GGTGTATTAT CGGGTGTTGT ATGTTCAACA ATATTATTTA CAATTATTGT 
TGCAATATAT CATCAGCCAC TTTTAGCATT AATCGTTGCA GGAAGTTTAA CTTGTGCGAT 
GACGGTGGGG ACGTTTGTAG GTTCGATGAT TCCATTATTG ATGAATAAAT TAAATATCGA 
TCCAGCAGTG GCTAGTGGAC CATTTATTAC AACAATTAAT GATATTATTA GTATGTTGAT 
TTATTTTGGT TTAGCTACAT CATTTATGGC TTACTTAATT TAAGGAGGAG TTATGGAGTT 
TTTATCTTTA GTTATTGTTG TTTTAGCAGC GTTTTTAACT CCAATAATTG TCAATCGATT 
AAATATTAAT TTCTTGCCAG TTGTTGTTGC AGAAATTTTG ATGGGGATTG TGATTGGAAA 
TTCATTTCTA AATATAGTAG AAAGGGATTC AATTCTAAAT ATTTTATCAA CGTTAGGCTT 
TATCTTTTTA ATGTTTTTAA GTGGTTTAGA AATTGATTTT AAAGCTTTTA AAAAAGATAA 
ACGCGCACGT CAAGGACAAA ATGATGATG 
(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 450: 
TTCTGGAAAC CAAAGTATTG TCATCTTCTA CTAGTAGTAT nGGCATCCAT ATCACCCAAT 
ATCATTTAGT ATATTTCATA TTTTCTCCTG ATTTTAATCG ACTTTGAAAA TCTTTAATCC 
GGCAGTCAAC TTCAAAGCCA TGAATCATCA TTTTTTGCAT TGGTGCTTGT ATAAAGTAAT 
AAATCGGCCA AAATAATCGA GGGATATAAT CGTATAGATG TATATAAACG ACTGCCGACT 
CTTTGATTAA TCTAAATTCT AACTTCCCTT GATTAACTGT ATATTTTTTC ACTAAACTTC 
CACTCAATAA AATTAAAGTT ATTATTCCAT CAGCTGTTTC TTCTATTTTA AATACTGCGA 
GCGGTCGCAC CTTATTCTTA ATATATATCT TAAATTGATC ATGTGATTTT TCTGTTTTCA 
CAAAAGTTCC TTTAGTGTAA CCCATCCATG CAATAAAATG GTTTACAACG TTCTTTAATG 
TCCATCCCTT TGGTAAAACT ACCTTCATCG TTGATCTAAC ATCATCATAC TTTGAAACTT 
GTAATTCTAC ATTAACTAAA QAACGTTTAA AAACTAAATT TGTTGTTTCT ACAGGTGTAC 
CATATGCACC TAGGCGTTCA ATTGTTTCAT TATCATAACG ACTCCCAGGT ATGTAGATTA 
CTTTTTTTAC TTGATTGATA GCCGCTGCTC GACCAAAATT ATCTGCTGCG ATTAATGTTA 
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CAGCAATATC TATTTGATTC ATTGCTGCAA CAACCTGTTC GTAATGAAAT ATATCACACT 

GAATCCAAGT CATTTCAACA TCATCTGTTT kTTTATTGTC TGGATATTTT GATATAGCAA 

5 AAAGTTCAGC ATCATTTTCA ATCACTTCAC TTAAATACTT ACCAATATAT CCTGTTCCAC 

CTGCTAATAA AACTTTAGGT TTCATCTAAA ATACTCCTTT AAACTGTAAC CAAAAAACAT 

ATTACTCCAC CTTTTAGTTA CATATATATT ATAATAGTAG CAAATGTTTT AAAATTTCAA 

10 

AATACTGGAG GCTTTTTATG GCCCATATTA TACGTAGAGT TAGTATCAAA GATGTAGAAA 

ATTTCATTTC AATGTTAGCG AACATATACG ACGAATCTCC GTATATGTTC TACACACCAG 

GAGAATATGA TCCTAGCGTC ACATCGGCTA GTAAACAATT AGAAGAATAT ATCACTTCTC 

CGCATAAAGT CATCTTCGTT GCTGAAAGTG ATGAACAACT CGTTCGCTTT GCCTTTGTTA 

ATACGACACC ATTTCAACGC ATTAAACATG TTGCTAAAAT TGATTTAGGT GTAAAGAAAT 

TATATCAACA TCGTGGAATT GGCCAAGCAC TTCTTGATGC CATTATGGCT TGGTGTTTAA 

ACAATCAAAT ACACCGAATT GAAGCAAATG TACCACTCAA TAACCAACCT GCCCTCGAGC 

TTTTTAAAAG TGCCGATTTT CAAATCGAAG GCGTTTTAAA AGATAAGTTA TTTATCGATG 

2s GTAAATATTA TGATGACTAT ATGATGGCTA AAATTCTTAA TTAAAGCTAT TTTATCATAA 

TCTTGTATCA GAATCGTATA ACAACGAATT TAATGGTTAC CTAATACATT ACTCATACTT 

ATCAATGTTA TCTAATCTCA AATAAATACG TACACTCTTA TTCATTTATC AAATTTAAAT 

TCAAAATAnA ACACCACTAA TGTGTAATTG ATTAACTATC AACTACGATT AGTGGTGCTT 

TATATATGTG GTTAGTTTTC CTnACTA 

(2) INFORMATION FOR SEQ ID NO: 451: 



30 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 
ATATAGATTA ATGTTTGTTA TTTGTACTGT GTTTGCCAGT GACATTTATT ATAAAACATA 60 
TGACGTTATT CTATGkTGTA CTCATTtTCT ATATTGTAGG TTTATTAACG ATTAGAAGTA 120 
TTATTAAAAA GTTGAAATAT CAGGAAACAT TATTACGAGA CTAAAAAACT TCCATTGGCA 
50 TGTATGTTGT AAAGGTGCAT GTAATGTTGA ACGCCAAATG ATACGGCGTT CAGATTACAT 



leo 



240 



TAGCATCTAT ACGTTAACAG CATAACCAAT GGAAGTTTTC TTCGAATCTA TTCTTTTATT 300 



BS 
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AATGCX3TCTT TTGAAAAATG GTCATTAAAG GCATCAGATT GCTTAAAGTC TTCGTATGCA 420 

TGTCGATCAG CAAATCCGAA ATAAATTTTG TATGTTGTAC CTTTAGCAGG TCTTAACAAA 4 80 

CGATAGCTTT TAAAGCCACC AAAGTTTCTG AAATTATCGT CTACACTAAT CAGTTTCTTT 54 0 

TCAAGTTGAT ATGCATGATC TTCTGTTGAT GGaATGAAgA TTGCACaATA GAAATGaTGT 600 

TCACtGAATT CACCAA 616 



10 

(2) INFORMATION FOR SEQ ID NO: 4 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 944 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 



35 



40 



45 



GCACGAGTGA 


TTAAACGGTT 


AATCAATGAA 


ACATTTGATG 


CAAATTACAT 


TGAAGTTATT 


60 


GAGGGAGGAA 


TTGAAGAAAC 


GCAAACGTTA 


ATTCACTTAC 


CTTTTGACTA 


TGTCTTCTTT 


120' 


ACAGGAAGTG 


nAAATGTAGG 


CAAAATCGTT 


TATCAAGCTG 


CCAGCGAAAA 


TTTAGTTCCT 


180 


GTGACATTAG 


AAATGGGCgG 


TAAAtCTCCa 


GTCaTCGtTG 


ATGAAACAGC 


GAATATTAAA 


240 


gTTGCTAGTG 


AGCGCATTTG 


TTTTGGGAAA 


TTCACTAATG 


CCgGCCaAAC 


ATGTGTTGCa 


300 


CCAGATTACA 


TTTTAGTACA 


CGAATCTGTA 


AAAGATGATT 


TAATCACAGC 


CCTATCAAAA 


360 


ACGTTGCGTG 


AATTTTATGG 


TCAAAATATA 


CAACAAAGTC 


CAGATTATGG 


CCGCATTGTA 


420 


AACCTTAAAC 


ATTATCATCG 


TCTGACTTCA 


TTACTTAACA 


GTGCACAAAT 


GAATATTGTA 


480 


TTTGGTGGTC 


ATAGTGATGA 


GGATGAACGT 


TATATAGAAC 


CAACATTGTT 


AGATCACGTT 


540 


ACAAGTGATT 


CAGCAATTAT 


GCAAGAAGAA 


ATTTTTGGTC 


CTATCTTACC 


GATTTTAACG 


600 


TATCAOTCAT 


TGGATGAAGC 


AATAGCCTTT 


ATTCACCAAA 


GACCAAAACC 


TTTGAGTTTA 


660 


TATTTATTTA 


GCGAAGATGA 


AAATGCTACA 


CAACGTGTAA 


TAAACGAGCT 


ATCATTTGGC 


720 


GGCGGCGCTA 


TTAATGATAC 


ATTGATGCAC 


CTAGCGAATC 


CTAAATTACC 


ATTTGGTGGT 


780 


GTTGGTGCCT 


CAGGTATGGG 


ACGCTATCAT 


GGTAAATATT 


CATTCGACAC 


TTTTACACAT 


840 


GAAAAAAGCT 


ACATTTTCAA 


ATCnACACGA 


TTAGAATCAG 


GTGTCCATTT 


ACCACCATAT 


900 


AAAGGTAAAT 


TTAAATrCAT 


CAAAGCTTtC 


tTTAAAnATT 


AAIT 




944 



(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4820 base pairs 



55 



1365 
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(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



5 





(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 453 : 






TCTCCAGTAG 


1 ACCTTGTGTA 


. TGAACAGTTT 


CTTTCATATG AATGAACATC GTTTTTAAAG 


* 60 


10 


TTTGTTTCAA 


. CTCAGCCTTA 


TCAGGATAAT 


ATCTAGAGAC AGTCGTCTCT GGCATCTCCA 


120 




TTGTATGATA 


TTTAACCTTA 


TGCAGCGACC 


CATGATTTTC GTTATAAACA AATGTATGAT 


180 




TCACTTCATC 


GAAATCATGA 


TCTTCTCCTG 


CAATCCAAAA AACTGGTACT ACTTGTTGCT 


240 


IS 


TATGTGTATC 


CGTTAATTCC 


TTAGATAAAG 


TAATGATTGa AAATATTTTA TGGAATGTAT 


300 




ACAATGGTCC 


CCCGAAAAGC 


CCTGCTTGTT 


GTCCACCAAT CACAACTTTT GAACCATTAG 


360 




CTAAATGTTG 


TATGTTTAAT 


TCTTGTTCAC 


TTGAAAGCTT GAAAGCTTTA AATCACTCAT 


420 


20 


ATATTCACGA 


ATAACATTCG 


CTAATGCCGC 


TTCTCTTCCA TTATTTTCTT TAGACATCCT 


480 




TTTU-rCAAAA 


CTAGTTTGTT 


GAGCTGCATC 


ATATTGAAAT AATCCTGTTA TTACAGGGTC 


54 0 


25 


ACTGCTCTTT 


ATTTTTGGTA 


TAAACTGATC 


TTTTTCATTT AAACTAACTA CTTTACAGTC 


600 


CATGTTTTTT 


CTCCTTAAGT 


ACGCGATTAC 


AAACTATAGT ATAAAGTCTA TACCGGTGAT 


660 




TGACAATTTT 


ACGGCTTGAA 


AATCAATTTA 


ATCATGGAAA ATTTATAATA TTCATTGTTT 


720 


30 


TACATTTTCA 


AATCAATGAA 


AAACACAAGT 


GGTTTAATGT ATAATAATAG TAGTAAACAA 


780 




ATAAGGGGTA 


GATAAATATG 


AGTGAAATCA 


AACGTCTTGA AATTAATTAC AAAACTGACG 


840 




AATTATTCGA 


AAACTTTAGA 


GCGTTTGGCA 


ACAAAGACTT ATACATGGTC AATGAGTTAA 


900 


35 


ACGGTCAAAT GATTGATGCA AGTTCAGATT 


CACCATTTTA TGGCATATTT GTCGaGATCA 


960 




ATTAGGAGCT AGAATGGCAT TACTAAAAAA 


AGGTQATGTC GAAGAAATCT ACTTCCCAGA 


1020 




TTTTUAAGAT 


TATATATTAT 


TATGGAAGTT 


AGAAGTATTA CCAAAATATC AAAACAGAGG 


1080 


40 


GTACGCTTCA 


GAATTGATTG 


ATTTTGCAAA 


GAGTTTCAAT ATGCCAATTA AAGCCATTGG 


1140 




CAGAAATGAT 


TCTAAGGATT 


TcrrrrrACA 


TCATGGATTT ACAGATGTGG AAGCTAAAAA 


1200 




TATAGAGGGA 


CATGATGTCT 


TATTGTGGAA 


ACCATAAGAT AATAATATTC GACACTACGA 


1260 


45 


GCATGAAAAT 


GCATCTTTTC 


GTAGTGTCTT 


TTTTACAATT ACTTTCTTAA GCTAATATAA 


1320 




GTaAATCATT 


TTCAAATTAT 


TTGTCTTAAC 


GTACAATATC ATTTAGTTGT TTCCATGrAT 


1380 


SO 


TAATTTCATA 


ATCAGGTATA 


ATTCCTGGAT 


TATGATCAAA TCCTCTAAAa TTAAACCAGC 


1440 


AAGTAGCTAT 


ACCcGCATTG 


ATTCCACCTA 


GAATGTCAGA TGTTAnAGAA TCTCCaACTA 


1500 




TAATCGAGTG 


CTGtCTTTCA 


TCCTCACCAA 


TATCATTAAA AACATAATTA AAAAATTCCG 


1560 
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ACGGCGTCTG ATTTAACCTT CTCTTTTGCG TTTCGGTTAC ACCATTAGTA ACAATATATA 1680 

AATCATGTCT TTTCGATAAT TCGACAATTG TTTCTAATGT TTGATCAAAG TATTTAACTT 1740 

^ TAGCTTCTGC TAATCCATTT CTAAATAACA CATCTGCACG ATGCCCATCA ACTTCCATTT 1800 

GATGATGTTT GAAGTAATTC ACAAATCGTT CTGATAATAC TTCAGACTTC GkTAATTTAT I860 

TTTGkTGAAA AGCTTCCCAA TGTTGGTGaT TGaTTTTTTT AAATGkTAAA AAATCATCCy 1920 

10 

TTGTTGCTTT ATGATTAAAA ACATTCGCCA TATAGTGAAn CGCCCATTCT TCTGcATCAT 1980 

AAAAATCAAC AATTGTATCA TCAAAGTCTA TCAAAATATT TTTATATCCC AATTTCCCCA 204 0 

TCrCCTATAT TGTCTATGTA TCTAAATCTT AACAGAGGCT CAAATTTCTG CAAATAAAAT 2100 

IS 

AAACTGAGTG CATAACATTA AAGTATGCTC ACCCAGTTTA TTTTAAAGAA TATTAGTTAT 2160 

TATATTAGAA TCCAAATAAT TTACCTAGTA AACCCACACC GTTAGCAACG ATGTCTACGA 2220 

TACTTGTGCC TAATTTCACA CTATCATGTT GTTGTGCAGC TTGCACAGTA TTTGCGATTG 2280 

20 

CTTCTGCTAG TCCAGTCATT TAAATCTCTC CCTCACCTTT GAAATAATAC TGATTACTTA 234 0 

CATAACATAT TGAAATTAGA ATCCGAATAA TTTACCTAAT AAACCTACGC CATTTTCAAC 24 00 

2s GATGCTCACA ATGCTTGTGC CTAATTTTGC GCCATCATTA TTAATTGCTG CAGTTACGGT 24 60 

ATCTTTAATT GCGTTAAATA AACCTTCCAT TGAAAACACT CCTTAAAATT TAAATTTGAA 2520 

GATAACAAAA ACGTGCGTAg yTTTTAAATC ACCGAAATGT TATTCGCTTA ACGTTTTGTT 2580 

30 GTTGTTATTT TAAAATAAAT TTGATGCAAT TAGTTTGTTT ATCCGCACAA CATCTTATAA 264 0 

TGTACTTAAC TGTATTTTAA AGAGAAAAGA AATACAGTTA GGCATTCAAA ACTGTATTTA 2700 

ACACAATTAA GTTGCCTGAA TTCGTATTTA AGTCTTATTG AACCTTTTTA GATAAATAGC 2760 

TCTATAATAG TGAAAAATAT AAAC A TTTTT TATTTACAAG GTATTGCTAA TTTAAGTTCA 2820 

TTTAGATATA ATAATTCTTG TGTTGTTAAA CGTGTCCTGG TAGCTCAGCT GGATAGAGCA 2880 

ATGGCCTTCT AAGCCATCGG TCGGGGGTTC GAATCCCTCC CAGGACGTTT ATAGGTATTT 2 940 

40 

TTATACGCAT TACCAAACAA AAGAGTTCCG TGATTACGGG GCTCTTTTTG TTTTGAATTT 3000 

CAGTAATATA GTATGATGCG TCACCAAAAC GTCCCCCGCA TAAGCCCCGA AAATACAGTA 3060 

ATTAAAACAA GCATGCTTAT TCGTTATAGA ATTTTTTGAC ACACAATTGA CACGCGTCTG 3120 

45 

ACACTTGTTT ATACATTTTT AATTAAGTAA TTTTGTGCTC AAATTTCATC TATACTGCAC 3180 

CTGAACTACA CCAACACTAC ACCAAGATTT TTAACACTCA CCATTTGCAT GCGTAGAGAT 324 0 

TTTTATTATT ATATTATTCC TATAGATTTT GATACTATTC AAAATTTTAG GGACTTTTCA 3 3 00 

SO 

GGGGCCCGAA ATCCTATAAT TATAATTATA TACATCTAAA AAAAATAACC ACGTCCATCG 3360 

55 
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10 



IS 



30 



3S 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENCTII: 4353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dovible 
SO <D) TOPOLOGY: linear 



3480 
3540 
3600 
3660 
3720 
3780 
3840 
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ATACTATTGG CAAATTTATA AAGTAGTTCA GCGTTTTTCA ATGACATATT GTCTAATGAT 
CTTTCATTTT TTCTCATTCT GTGTATTGTG CTTTGTGGAA CTCCTGTTTG TTTCGATATA 
TGTAAACTGC TCAAATCACT GTCTAATAGT TTTTGAATTT GATTTCTCAT TTTGTACACC 
CCCTTGTAAA TCTTCAATCA TCATTTAAAT TAAATAATTA CTTTTCCACA TATTCCAATA 
TTTAGGTTGC AAAGCATACC TCAAATATCA TTAATTTTGA GATTTAAATG TCAATTATGT 
GTCTATCAAT CCAATATACA TACTCTAATA ACGTAATAGT ACACACTCTT CTTATTAATT 
GGCCATAGCT ATCATGATAT AATTAGTGAA GAAAATCACA TAAGAAAGGT TGTAAATCAT 
GAGACTTCAA AAAGCACCTC TAGTAACGTC AGGACTAGTC TTAGGATTAT TAGGCCTGGG 3900 
TAATCTATTA AAAGACTTAT CTCTTACTTT AAACGCTGTT TGCGGAATCT TTGCTTTCTT 3960 
GATTTGGATT CACCTTTTAT GTACTATGAT CAAATATTTT AATAATGTGA AAGAACAATT 4020 
20 AAACAGTCCT CTAGTTTCAT CAGTGTTCAC AACATTTTTC ATGTCTGGCT TTTTAGGTAC 4 080 

TACTTATTTA AATACATTTT TTAGTAACAT AACTTTTATC AATAGCTTAA TAACGCCTAT 
TTGGATTTTA TGCCTTGTGG GAATTATGAC GCATATGATT ATTTTTTCAA TAAAATATTT 
2S AAAAGATTTT TCACTTGAAA ATGTTTATCC TTCGTGGACT GTACTTTTTA TTGGTATTGn 

TATCGCAGGA TTGACGGCAC CCGTTAGCGG ATATTTTTTC ATAGGTCAAT TAACAGTAAT 4320 
ATATGGCTTT GTAGCTACTT GTATTGTCTT ACCTATAGTT TTCAAGCGAT TAAAAGCATT 43 80 

TCCATTGCAG ACGTCAATCA AACCGAACAC ATCGACAATT TGTGCACCAT TTTCTTTAGy 444 0 

CGCTGCAGCA TATGTTATAG CTTTTCCTAA GGCGAATGCT TTTATCGTAA TTATAmTT 4500 
ACTATTAGCT CAAATATTTT ATTTTTATAT CATTATACAA TTGCCTAAAT TACTAAAAGA 4 560 

ACCTTTTTCG CCCGTATTTT CAGCTTTCAC ATTCCCTTTA GTAATCTCAG CAACTGCnT 4620 
AAAGAACAGT TTGCCTGTAC TTATGTkTCC AGACATTt GG AAAGGJcCTTT TGTTTATOGA 4680 
AGTGTTATTA GCCACTGTAA TAGTACTTAG AGTCTTTATA GGATATCTTC ACTTCTTTTT 4740 
AAAAAAGGAA AAACAAGATA AATTTCTnCG TAATGCGTCT CAGTAACACT ATTACCAAGA 4 800 

ATTAACACGT ATATTTAATA 

(2) INFORMATION FOR SEQ ID NO: 454: 



4140 
4200. 
4260 



4820 



55 
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ATTTGGATCT 


TTAATATCAC 


CAATATTTTT 


AATATCTTCC 


GGATTCAATC 


CATATACTTG 


60 




TACTGTATCT 


GAGTATTTAA 


TTGTGAAATA 


ATCACCTGAT 


TTAACTTTGT 


CATCAACTGT 


120 


5 


AATTTGTGAT 


TTTAATGATA 


AATAATCTTG 


GGCTGGTACG 


ATTTTATTGT 


TTTTATCTGC 


180 




ATCAACGACA 


GTtAATGTTG 


TATTTGATGT 


GATTAAATCA 


TTAACATTTT 


TAGCCTCTGT 


240 




TGATGATGGC 


TGTACTGCTG 


CTATACGCAT 


TCTTGTATTC 


AAACGTTTAG 


GTGCTGTACT 


300 


70 


TTTTGGCAAA 


ATGATATCTG 


CATTATTTTC 


ATTATTTGAA 


TTACTATTGT 


TATCAACAAG 


360 




AGTTTCATCA 


TTACTCTTGA 


TAGCATCACT 


TTTAACATTT 


AATGTAGTTG 


ATTCAGTTTT 


420 


IS 


GGCATCTACC 


TTTTTGTTTT 


CCTCATTAGT 


TGGTTGAACA 


TTTACCACTG 


ATTTATTCTC 


480 


TTGCAAATCA 


GGTTGTAACG 


CTTCTTGATT 


ACTTATAGTT 


TGTTTAGTGT 


TTAAATCTTC 


540 




ATTCGTAGAT 


TTTGGTGAAG 


CTTGCTCATC 


TGATTTGGCA 


GTTGAAACTT 


CAACTTTATT 


600 


20 


TCCAGTGGTA 


GATTGTACAC 


TTTCTTTTTC 


T ATT AATTT A 


TTCCCATTTG 


AAGTCGTTTC 


660 




ATTACCTTGa 


GATGATACCA 


TTTCTTTTTG 


ATTATCATTT 


TTAGTATTGT 


CTTCTTGATT 


720 




TAGTTGCTGC 


ATATCAACTT 


TATCACTCGA 


TTGATTATCA 


CTTGCTGAAG 


TTGTCGCTcG 


780 


25 


TTCAATTCTT 


TATTAGTACT 


TTCTGCAGCC 


TTTGCTTCTT 


GGTTCCCCAG 


ACCAAAAATT 


840 




AATGTTGTAC 


CTACTAAAAT 


TGATGCTGTT 


CCCACTGTGT 


ACTTTCTAAT 


CGAAAATTTA 


900 




TTTAATCGAT 


TGGATACCAT 


GCCTTTCCTT 


GTTATTGCCG 


TTTTATTTTC 


TCTGTTTAGC 


960 


'3/1 


ATTAGATTAC 


TCCTAATTCA 


TCAAATTTTT 


AAATAATACA 


ATTGTTTTAA 


ATACAAAAAT 


1020 




GTATATCAAT 


ATAGTATTAC 


ATTTTTAGAT 


AAAGCACAAT 


ACTTTAATTA 


TTTTTCTTTA 


1080 


35 


TCGTAAAACG 


TTATTTAACA 


TTTGTGTTTA 


AATAAAAGTT 


TTTATGAGTt 


TTGTAATCTT 


1140 


TATTTAATCA 


TCATAAAAAA 


TAGTATTATT 


TGCCCTTGAA 


ATTAATATCT 


TAGCTTTTCT 


1200 




AATTCATAGA 


CAATTACATT 


TCTGTAACAA 


ATTAAATTGT 


ATCTATTCCT 


TAAAGATTTT 


1260 


40 


TTGTTTTATA 


TCTGGGAATT 


TCTAAACAGA 


AAAAACCAGG 


CCACATGGAC 


CTGGTTAAGT 


1320 




TAATCATATT 


ATTTA'rrrix; 


rri-rriACGA 


CGACCGAATA 


ACAATAATGA 


TCCTAATGCC 


1380 




GCGAATAATC 


CACCGAATAA 


TGTGCCATTA 


TTTGAATTAT 


TATTTTCACT 


ACCTGTTTCT 


1440 


45 


GGTAATGCTT 


TAGCi'G'i''rrr 


ATGCTGATCT 


TTAACCGTAC 


TCATTGGTTT 


AGCCGGAGTA 


1500 




TGTTTACCTG 


CATCTGAATC 


TGAATCGCTA 


TCTGAATCTG 


AGTCGTTGTC 


TGAGTCCGAA 


1560 




TCGCTATCTG 


AATCTGAGTC 


GCTGTCTGAA 


TCTGAATCGC 


TATCCGAGTC 


TGAGTCGCTA 


1620 


50 


TCTGAGTCTG 


AGTCGCTATC 


TGAATCTGAA 


TCGCTGTCTG 


AGTCTGAATC 


GCTATCTGAG 


1680 




TCTGAATCGC 


TGTCCGAATC 


TGAGTCGCTA 


TCTGAATCTG 


AATCGCTATC 


TGAATCTGAG 


1740 
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25 



30 



35 



45 
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TCTGAATCTG AGTCGCTGTC TGAATCTGAA TCACTGTCTG AGTCTGAGTC GCTGTCTGAG 
TCTGAATCGC TGTCAGAATC TGAGTCGCTA TCTGAGTCTG AATCTGAATC ACTGTCTGAG 

TCCGAATCGC TATCTGAATC TGAATCGCTA TCTGAGTCTG AGTCGCTATC CGAATCTGAG 1980 

TCGCTATCTG AGTCTGAGTC GCTATCCGAG TCTGAATCGC TGTCTGAGTC TGAGTCGCTG 204 0 

TCTGAATCTG AATCGCTATC TGAGTCTGAG TCGCTGTCTG AATCGCTGTC TGAATCTGAG 2100 

TCGCTATCTG AATCTGAGTC GCTATCTGAG TCTGAATCGC TGTCAGAATC TGAGTCGCTA 2160 

TCTGATGTTT CTTCTTCGTA GTAGCCATTA TCAAGTGTGA AATCATCATG ATCCGTAATT 2220 

,5 GTTACATCAA CTTCGCCACC ATCGGCATCT TTATCATCTT CAGTTGTATT TGTACCTGTT 2280 

TGAGTTAAGC CAGCAGGTTT TTCAAAGATA ACTTTGTATT TACCACTATC TAAATTATCA 2340 

AAGCGGTATT TACCATTTTC ATCTGTyTCA GTTGTACCAA TTACTTCGCC TTTTTCGTTT 24 00 

20 TGCAAAGTAA CTTTAACACC TTTAATTCCT TTTTCAGTCG AATCTTGTTT ACCATCTTTA 2460 

TTACTGTCGT ACCAAACATA ATCACCTAAA CTATATTTTG GTGTTTTGTA GAATCCACTA 2 520 

TCTAATGTCA TGTTGTCAGC GTCTTTAATG ACACCTGTTg' TAGTTAGTCC ATCAGAATCT 2580 

ACAGCATCAT CTGTACCTAC ATTTGCAGTT GTCGGTGTAT AACCGGGTGG TGTTGAAAAC 264 0 

TCTACACTAT AAGTTCCATT GCTTAAACCA GTGAACTGAT ATTTACCATT TTCATCTGTT 2700 

GTCGTACGAT CTAATTCTTT ACCGTTACTA TCTTTAAGAA TGACATAAAC ACCTTTAATC 2 760 

CCTTTTTCAT TGGCATCTTG TTTACCATCT TTATTTGTAT CTTCCCATAC ATAGTCACCT 2820 

AGATTATATT TCTTTTGGTC GCCATTAGCA GTTGATGAGC CATTCACATT TGAATAACTA 2 880 

TTTGACCAAC TATATTTAGT TTTGTCAGTG TCTAAAGTAT AATCAATTTT TCCATTATCT 294 0 

GTTGAACTAT TATCTGGATA AGCAACTTGT TGAATGATGT ATTGTTTATT GCTGCTTGTT 3000 

TGGCCTTTCA TTAAATCGAC TGTAGCTGTT TTATTATCAT TACTATAAAT AACATCGAAT 3060 

40 TGATCAGTAA CATCTTTAAG TTTTGAAGTA TCAGGGGTGA AACTATCCAC AAATTCATTT 3120 

TGATCTGtCA CTtCGTAAAT TTTGAAGTTT TTTGCATTTG GATTAAATTT ATATCCAGTT 3180 

AAATTAGTAA CAAACGTTTG TTTAGTATAT GTATTTTTAG GTTGATTTAC ATATGCAGTC 324 0 

ATATTACGCG ATAAATCTTC ATTGTTAATA TAGTTTGTAC TTGAAATAAG CGGTTGTGCT 3 300 

TTTTTATTAC CATAATCGAC AATGATTTCT TCGCTATATG TATCATTACC TAAAGTTACT 3 3 60 

TCCATTTTAT AAGCTGTTTT ATCAGTTGTT GCATTTTTAC GTTTCGCAAA TGCAACTTGT 3420 

TCAAAGCTAC CTCTAACATT TGTATATTGA TCTACATAGT TCGTAAAAGT ATATGTTGTT 34 80 

GTGTTTGTTG TACTATCATA AATACCTTTT GCAATAATAT TACCTTGGGC ATTATATAAA 3 54 0 

55 



1370 

iNSOOCiD: <EP 0786519A2_I_> 



EP0 786 519 A2 



5 



10 



IS 



GTAAATGTAT 


CGCCCTCTTT 


AACAGAATCA 


TCGATTGTGT 


AATTTGCTTT 


TAATTTTAAA 


3660 


ACATCACTTG 


AAGTTGCCCA 


AAATTCAGTT 


TTACCAGTAG 


TCTGATTAAC 


ATGTCCTTTA 


3720 


TCAATCGCAA 


TGTCAATATT 


TGAAAAATGT 


ACTTTATCAT 


TAACATTTGT 


TCCTTGTTGT 


3780 


GGAGCTGCAA 


CAGTATTCAC 


TGCCATGCGA 


TTTAAAGTTC 


TTGGTTTAAT 


AGTCGTTGTT 


3840 


TTAGGTGTAG 


TTGAAACATC 


TTTTGCTTGT 


GTTAAATTAC 


TTTTATCAGT 


TTCATTACTA 


3900 


TATGTAGTTG 


ATGATTTATC 


ATTTGTTGTT 


ACATTGCTAG 


TTTTTGTAGT 


AGATTGATTA 


3960 


GCTGTAGCGT 


TTTGTGGTGA 


TTGCATGTTA 


CTACTAGTTT 


CTTTAACTGT 


TGCACTATCA 


4020 


CTCATTGTCA 


CTTTAGGCTG 


ATCTGCAGTT 


GCAGTTTGCG 


TATTGTCTTT 


TAGTTGACGA 


4080 


CTATCAACTT 


TTTTAGTTGT 


TTTATTCTCA 


CTTGGGGCTG 


TCX5TTTCATT 


TTTTGATTGA 


4140 


TTTAATTCTC 


CATTCGTATG 


TTCTGCCGCT 


TTAGCTTCAT GACCACTTAA 


CCCAAAAATC 


4200 


AATGTTGTCC 


CTACTAAAAT 


TGAAGCAGTA 


CCTACAGAAT 


ACTTTCTTAT 


CGAAAATTTG 


4260 


TTTAATCGAT 


TTGGTATCAT 


GCCTTTnCTA 


TTTGTnGCTG 


TCTTTTTATA 


ATTCATTTAA 


4320 


TAATACTCCT 


TTAAAATATC 


AAAATTTGAT 


AAATATAA 






4358 



(2) INFORMATION FOR SEQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDE0NESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 



40 



45 



SO 



TTGACTTCTT 


AATTCAGCAT 


TTTCTGCACT 


TAATGCTTTG 


TTCTTTTTAA 


TAAGTTGCTT 


60 


TCTTGCATAA 


ACTTCGGTAT 


CTATTTTACT 


ATTACTATAC 


CTTTGATTTA 


AAACTAATAT 


120 


ACCAATTAAT 


GCTACAATGA 


TAATGATAAG 


TACAACATAA 


AAAGACATTT 


TTTCACCAAT 


180 


ccrrn-ixiAC 


TTCTTTAACT 


TTGTATACAA 


TAATAATTAA 


TAAAGATTAA 


TTGTTATTCA 


240 


ATTTCCCACA 


TTT-rrATTAG 


TTGATTTTAG 


TTCATCATTG 


TTATAATCAA 


ATTATAAACT 


300 


GACAGATATT 


GATGTTCAAT 


GAATATGACG 


TGAAAGATTC 


GTGAATTCAA 


GTTTATGTCG 


360 


AATTTATGTT 


ATAACGGTCA 


TTTAAATGAC 


AGAATTAGGT 


CACTCATAGT 


ATTTTGAAGA 


420 


TTGAATTCAT 


TAATTTTAAA 


ATGTATAATG 


ATATTTGTGA 


AAGCGCTTGC 


TTAGGAGGTG 


480 


TATTTGAGAG 


TGAATGAAAT 


GAATGCTAAA 


GAACAATTAG 


TGGACAATTT 


AATGAAAACA 


540 


TCATCGCAAT 


TATTTAAATT 


TCACGGTGAA 


GTTGCCATGC 


AGCTTTTCTT 


AAATGATGAA 


600 
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AAAGTTATTC CGCAATCATA TGCGTTACTA TACATAGATA AGCAAGATCA AGCAATAGCT 
AAAGAAGATT TATCACTTTC AAAAATTGCA AAAGTTTATG TGCAATATGA TGATACAACA 
ATAATGAGTA TTTTCGTTTA TGATGTAGTA AACGATGAAT GGATTTTTAG ATTGGATCCG 
AATATACGTA TACCTAAGAG TAACATATAC TTCCATAGTT TAAATTGGGA TGTGGATATA 
TTAAACCGGA GtCGTCTAAT GTATGTCTAA TGCaCACCAT CAGaTCATCA TtATCCATTA 
TAAcGrGCAT AGTGCATAAn yACTwCAtTT TaTTaaATTG AGaGGgGCAC GATAGGTGCA 
TCAGGACATA ATATAGGAAG CATCAACGCG TGAnCAGGTC 
(2) INFORMATION FOR SEQ ID NO: 456: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double- 
<D) TOPOLOGY: linear 



720 
780 
840 
500 
960 
1020 
106O 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 56: 
25 ATAATnACTA AATACnAAAG TTTAACTGTC TTACTAATAA TGACTATGTT ATAATTTTAA 

AAGTGATATT TTGGGTAATC GCTATATTAT ATAGAGGAAA GTCCATGCTC ACACAGTCTG 
AGATGATTGT AGTGTTCGTG CTTGATGAAA CAATAAATCA AGGCATTAAT TTGACGGCAA 
TGAAATATCC TAAGTCTTTC GATATGGATA GAGTAATTTG AAAGTGCCAC AGTGACGTAG 
CTTTTATAGA AATATAAAAG GTGGAACGCG GTAAACCCCT CGAGTGAGCA ATCCAAATTT 
GGTAGGAGCA CTTGTTTAAC GGAATTCAAC GTATAAACGA GACACACTTC GCGAAATGAA 
GTGGTGTAGA CAGATGGTTA TCACCTGAGT ACCAGTGTGA CTAGTGCACG TGATGAGTAC 
GATGGAACAG AACATGGCTT ATAGAAATAT CACTACTAGT TTAGCTCTCC TAGATGATGG 
AGAGCTTTTT TCATGAAAAG AACACTTAAA ATTAACGCCy TGTCTTGaTA tAATGACaCT 540 
GCcTTGTTTT AAAATAGTAA GCGGATGCgT TAATGTATCA GCGATTAAAT TTGTTGGAAA 600 
TGTATAAAAA ACACAAGCTA AGAATAAAAT ACCTGTATAA AAGGAGAATC ATATATGTTT 660 
CAATTACTTG CAGTTTGTCC GATGGGATTA GAAGCTGTTG TTGCTAGGGA AATTCAAGAA 720 
TTAGGCTATG AAACAAATGT TGAAAATGGT CGTATATTTT TTGAaGGAGA CGCAAGTGCA 780 
ATTGTAAAGG CAAATTTATG GTTGCGCACA GCAGACCGAA TCAAAaTTGT TGTTGGACGT 
50 TTTAACGCAA CAACGTTTGA CGAATTATTC GAACAAACCA AAGCGCTCCC TTGGGAATCT 

ATAATTGATA AAGAGGGTAA CTTCCCAGTT CAAGGTAGAA GCGTTAAATC AACACTACAT 



60 
120 
180 
24 0 
300 
360 
420 
480 



840 
900 
960 



55 
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TATAACGAAA AAGGTTGGTT AAATGAATCA GGTGCCAAAT ACCCTGTTGA AGTTGCCATT 
TTAAAAGATA ATGTATTATT GACTATCGAC ACATCAGGTT CTGGTTTGAA CAGACGTCGT 
TATAGATTAG CACAAGGTGA AGCACCAATT AAAGAAACGT TGGCAGCAAG TTTAATCCGT 
CTTGCCAAAC TGGGAAAGGT GATTACACCT TTnAATTGGT CCCATTTGCG GTTCnGGTTA 
CA 

(2) INFORMATION FOR SEQ ID NO: 4 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1142 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 



CCTGGCTGCT 


TTATCAGCAT 


CTACTACTTT 


AAAACCGAAT 


ACGGATAAGA 


GTTCTGATAC 


TGTTGATTTT 


CCTGAGGCGA 


TTCCACCTGT 


TAGACCAATA 


ACTTTCGGCA 


TAATTTCACT 


CTTTCTTTAT 


TTTTGACATA 


CTGGACAATA 


ATGACTATTT 


CTTGTCGCGA 


TGATTTTTGT 


TTCAATTTGA 


CTTCCACACA 


CTTTGCATAC 


CGGCTGCTTA 


TATACATTAA 


GATGCAATTG 


CATCTCACCA 


GTTTTTCCAT 


CAGCATGACG 


ATAATCTGAA 


ATACTTGTAC 


CGCCATATTT 


AATACCTTCT 


TCTAGTACTT 


CTCTAACATA 


ATAAAAAACC 


ATTTCTTGTT 


GTTGGTGTGT 


TAAGTCTTTT 


AcrrrriTAT 


CTGGTAAAAC 


ACCTGCACGA 


AACAACGCTT 


CACATGCGTA 


AATATTTCCA 


CAACCTGCGA 


TTACTTTATG 


ATCCAAAATC 


ACTTGTTTGA 


TTGGTTTATT 


CTTATTAGAC 


TGTTGATGAA 


TTCGATTTAA 


ATAATACGTC 


AATGCTTCAT 


TTGAAAAAGG 


TTCAGGCGCT 


ATTTCTAAAA 


ATGAAGGATA 


AGATGCTACA 


GACGCAACAT 


TTCTAATTTC 


TCCAAAACGA 


CGTATATCTG 


AATAAATTAA 


CTTTTTGTCA 


TTTGACAACT 


CAAAAATAAC 


ATGCCAATGC 


TTACGATAAT 


TAGGTATCAT 


AATATCTTCA 


AGTTCATCTA 


CAATGAAAAA 


ACCGCCCGCC 


ATACCTAAAT 


GACTAATTAA 


TGTACGTTGT 


TCTCGTTTAT 


TATCTAGCTG 


AAAAACGATA 


TATTTACTTC 


TTCGTTCTAC 


ATTTGTAATG 


GTATAGCCTT 


CCGATAAAGT 


TTTAAAAGTA 


TCTAATTCAA 


TTCCTTTTAT 


AATTGTTTCC 


TTGCCTTGAG 


CTTTACCTTC 


GATTACTTTA 


TCCGAAAATA 


TAACGTGTTC 


AArmTTGA 


TTTATAACGT 


AGGGTTCAAT 


TCCTCTTTTT 


ACATGTTCTA 


CTTCTGGTAA 


TTCGGGCATA 


CCATTAACCT 


CACTTTATTT 


TGCATCATAC 


CAGGTTGCAC 


CATAACTTGA 


GTCTACTIIT 


AATGGAACAT 


CTArTTGCAA 
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TT 

(2) INFORMATION FOR SEQ ID NO; 458: 

^ (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 



75 


CCTTTAGTAA 


ACAATCCTTC 


TTTAGTTTTA 


GTACGTTGTT 




IaKI i. iValAl 


o U 


TTCATTGCCT 


CGCCCGATTG 


AGTGCCGCTA 


AAGTTATCAT 


('"■"i"l"l'r^A'm*T"i' 








GTAAACATGT 


GTATATCACT 


GTTTAAACGG 


TCTTTATAAG 








20 


TATTGCTTAT 


AAATATAACC 


ACCATCAACA 


GAGCCTTCTG 






o A rt 




GCATAAACAG 


TCGGTTCTAA 


AAACAACACG 


TTAGCTTCCT 


TTTGTTTm* 

A X A \J X JL X X v.. X 




"a A n 
J u u 




GGATCTAAAT 


TTAAATTACC 


TTTAATAAGT 


AACATAGCGT 


CATTTAAATC 




T c r*^ 

J o W 


25 


TTAGCAsymy 


CTGATTCAGC 


ATTATCATAC 


AAATCAATTA 


AAGTGATTAC 


TTmV AT A A 






Tccccrrrrc 


TTCTTTCGTT 


GTTGCTAAAT 


TCTGTAATAG 


GCATACGTTC 


GAAAGAGTGT 


*B 9 w 




GATTCAAAAC 


CGTTTTCACG 


TGGTGTGAGC 


TTCAATCCAT 


TTGfTTCTACT 


GGTAAGATAT 




30 


CTATAAACAC 


CGTGTGAAGT 


GAATAAATCA 


ACTGTAAACA 


CTTCATCTTG 


GTCAGTCTTG 


600 




TCTATTGGTT 


TAGTTCTTAA 


ATATCTAACG CCTGCGATAC 


TATTACGTTC 


AATTGTATTG 


660 


35 


TCGTATATGA CAAAAGTACT 


CATTGCATCA 


CTCTTGTATA 


AACGCGTTTC 


ATCATCTTGG 


720 


TTTCTAATCA 


TTAACTCATA 


AGCTTTGCCA 


TAAATTGACA 


AATCTAATCC 


TAAAGATCTA 


780 




TTGTGTGACT 


CAACATCATT 


TAAATCATTG AACGCCTCAA 


TAACTTCTAA 


TACATCTTTG 


84 0 


40 


TCATCATCTT 


GATATTGAAT 


TGGATTACCC 


AAGAAATAGC 


CGTTGATAAA 


ATCGCTAATA 


900 




TAAGATGCGT 


AATCATGCGC 


TACACGGTTA 


TCTGCCATGT 




GCGTCGTGTT 


960 




AACTCAACTA 


AGTTCTTAGT 


TTTACCTTCG 


TAATAATCAC 


TTAACACTTT 


CAATCTAGGT 


1020 


45 


CGTTGGTAAT 


CCATGTGATG 


TTCAATGTAT 


TTACTTACTT 


CATTAACGTT 


TTGTAATAAA 


1080 




TCGGATTCCG 


TCCCGTCATA 


TGTGTAAACA 


ACATTGGCTT 


CATCATTAAA 


TAAGTAATTT 


1140 




ATGTTTCCCC 


GTAGATCTGT 


ATCTGTTTCA 


AATTCGTTTA 


CTTTTAACAT 


TTGTTCCCTC 


1200 


SO 


CTATAATCCT 


AGAGATTTTA 


TTGTGTCAAC 


TTTCGAACTG 


AGATTTGTGC 


GTTTtCTAAC 


1260 




CGGTCTGTAG 


AATCGTTCCA 


CTGAATAACG 


CAACGAATCG 


ATACAATGAT 


TGTATGTATC 


1320 



55 
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CTCTTCAATA GTCTTGAAAC AACGTTCATC AACAATGATT TCAAATTGCA TTAAGAATTG 
TAACCCTTGT ACAACCGAGC CCTTCCCTTT TTTGGTTGGT AAAATCCTTT TAAGCCCTAG 
ATTCCTTAAT TCAGCTATAC TTTTTTGTTC TGCACTATCT GCTGTAATTT CTTCTTTAGC 
ATAACCAAGT TGCTTTATGA CATTAGCTAT TTCATCATTC AGCATACCTT GTTTAACATA 
CTCTTCAATG ATGTATAACT TCTTTTTCTT TACATCTATT TTAGAATGTA TAAAAGCACT 
AGGATCATTA ACGTAGCCAA AGTCCAATCC AAAATAAGAA GGTAAATGTC TTAACTCATC 
TTTAnTATT AAACGTTTTT CATACTTAGG GAAAACCAAT TTGTCTAGTG TAGCAAATTC 
ACCTAACGCA TAAA 

(2) INFORMATION FOR SEQ ID NO: 4 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 
AATTnAGATT ATTACCCTCC TTTAAAAATA TTTGTTTCAC AATTTTTTAT TACCTATTTA 
CTGGTTTCAT GTCTTATGGG GCATTTTACT TGTTGGTATT TGAAAATGTG CAAAATTTAA 
TCTTATATGT TTCTTGGCTT TTCATGACTA TGCTATTTAT GTTTATGAAT ATGCATTCAA 
TTATAGATAA AAAAGTACAT ATATTCTTAA AGTCTAATAA ATAGTTACAA ATTTAGTTAG 
TTTTCAATTG TTAATTAGGG GTGGTAAACA GTGCTTTGTG AATCTAGACA AATTTATAAA 
AATCCTAAAT ATCGAGTTAT TAGATATAAT AATGAATATT TCATGGTCGA TTTAGTAAGT 
ACTTGGATTA CTTATTTTTT CCCTATGATT AATTGGTTTT TGCCCAAAAA ATACGCAAAA 
ATTAGCGAAA ATGAATTTGA AAGGTTAAAT ATAGTCGAGC CTGTTAAAAA TAATGTmT 
TGGCCGGTTG CAGGAAGTTC AGTTCTATTT GGAATTATAT TGAGAAAGTA CGGTAACTTC 
TTTAATGTTC AGTTTGAAAA ACAACTAGCA ATCACTGTAT TTTTTATCAT GTTAATAGGG 
ATGTTAATTT TTTATTTTTA TCTAAATAAA AAATTAACAT TAAAAATTTT TAATACCAAC 
GTGGGTAATA AGAATAGGAG TTGTAT 
(2) INFORMATION FOR SEQ ID NO: 460: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 

^ ATCTGCAATT ATGGGCACAC CCAAGCTnAT GCAAGTAACT AAAGGAGAAG TACTTTTAGA 60 

CGGTGTAAAT ATTTTAGAaT TAGAAGTTGA TGAAAGAGCA AAAGCAGGAT TATTCTTGGC 120 

AATGCAATAT CCATCAGAAA TTACAGGTGT TACAAATGCT GATTTCATGC GTTCAGCAAT 180 

W 

CAATGCGAAA CGTGAAGAAG GACAAGAAAT CAACTTAATG CAATTTATTA AGAAATTAGA 24 0 

TAAAAACATG GATTTTCTAG ACATAGATAA AGACATGGCA CAACGTTATT TAAATGAAGG 3 00 

TTTCTCAGGT GGAGAGAAGA AACGTAACGA AATCTTACAA TTAATGATGT TAGAACCTAA 36 0 

IS 

GTTTGCaATC TTAGATGAAA TCGATTCAGG GTTAGACATC GATGCATTAA AAGTTGTATC 4 20 

TAAAGGTATT AACCAAATGC GTGGGGAAAA CTTTGGTGCA TTAATGATTA CACACTATCA 4 80 

20 ACGATTATTA AATTACATTA CTCCTGATAA AGTACATGTA ATGTATGCTG GTAAAGTCGT 540 

TAAATCTGGT GGTCCAGAAT TAGCAAAACG TCTTGAAGAA GAAGGATATG AATGGGTTAA 600 

AGAAGAGTTC GGTTGAGCTG AATAATCTTA TTAATACAGT ATCCATGAGA TGTTCATCTA 66 0 

25 TATATGATGA AAATGAACAT TTATACGAAA TAGTAAATTT CATCAAGTAG GAGGAAAAAG 72 0 

TTATGACAAC TGATATTTTG rACaTTyCTG AAGAACAACT TGTTGATTAT TCTAAAGCCC 78 0 

ACAATGAACC TTCTTGGATG ACAGAATTAC GTAAAAAAGC TTTGAAATTA ACAGAAACTT 84 0 

TAGAAATGCC AAAACCTGAT AAAACAAAAT TAAGAAAATG GGATTTTGAT TCTTTTAAAC 900 

AACACGATGT AAAAGGTGAT GTTTATCAAT CTTTATCACA ATTACCTGAG TCAGTAAGAG 960 

AAATTATTGA CGTAGATCAT TCTAAAAACT TAGTAATTCA ACATAATAAT ACGATTGCGT 1020 

35 

ACACACAAGT TGATGATAAT GCATCGAAAG ATGGCGTTAT CGTTGAAGGT TTAGGAGACG 1080 

CTCTTATGAA CCATAGTGAT TTAGTACAAA AGTACTTTAT GAAAGATGCA GTAACAGTAG 114 0 

ATGAACATCG TATCACAGCG CTACACACGG CATTAGTTAA TGGTGGCGTA TTTGTTTATG 1200 

40 

TTCCTAAAAA TGTAGTTGTA GAACATCCAG TACAATACGT TGTGTTGCAC GACGACGAAA 1260 

ATGCAAGCTT TTATAACCAT GTTATCATCG TTACTGAAGA 1300 
45 (2) INFORMATION FOR SEQ ID NO: 461: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



55 
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GACAGCAAAT TCAAGATACA TTAAATAAAG ATATTGTCAT AAAGCATATT CTTGTTCGAG 60 

ATAAATCTAA AAAGAGACCG CTAAATATTA GCCAATATCA TTTAACTGAA GATGTTAATG 120 

AAATTTTAAA TGATGATTCA TTAGATATTA TCGTTGAAGT CATGGGAGGA ATTGAACCAA IBO 

CTGTAGATTG GTTAAGAACA GCACTTAAAA ATAAAAAACA TGTTATTACC GCAAATAAAG 240 

ATTTATTAGC AGTACATCTT AAACTTTTAG AAGATTTAGC AGAAGAAAAT GGTGTAGCTT 300 

TAAAGTTTGA AGCGAGTGTA GCAGGTGGTA TTCCGATCGT AAATGCCATA AATAATGGTT 360 

TGAATGCGAA TAATATTTCA AAATTTATGG GAATTTTAAA TGGTACCTCT AATnTATTT 420 

TATCTAAAAT GACTAAAGAG CAAACGACAT TTGAGGAAGC ACTTGATGAA GCGAAAAGAC 480 

TTGGTTTTGC TGAAGCGGAT CCAACTGATG ATGTAGAAGG GGTAGATGCA GCGCGTAAAG 54 0 

TTGTCATTAC ATCATATTTA TCATTTAACC AAGTCATTAA ATTAAACGAC GTTAAACGAA 600 

GAGGAATTAG TGGCGTAACT TTAACTGATA TTAATGTAGC CGATCAACTG GGGTATAAAA 660 

TTAAATTGAT TGGTAAGGGA ATATATGAAA ATGGCAAAGT TAATGCATCG GTAGAACCAA 720 

CGTTAATTGA TAAAAAGCAT CAATTAGCAG CTGTAGAGGA TGAATATAAC GCGATTTATG 780" 

25 TTATTGGTGA TGCCGTTGGT GACACGATGT TTTATGGAAA AGGAGCAGGC AGTTTAGCAA 84 0 

CAGGTAGTGC CGTTGTCAGT GATTTATTGA ATGTAGCATT ATTCTTTGAA TCAGATTTAC 900 

ACACATTGCC ACCACATTTT GAATTAAAGA CAGATAAAAC ACGGGAAATG ATGGATTCAG 960 

ATGCAGAAAT TAATATTAAA GAAAAATCCA ATTTCTTTGT AGTAGTGAAT CATGTCAAAG 1020 

GTTCAATTGA AAATTTTGAA AATGAGTTAA AGGCAATATT ACCATTTCAC CGATCATTAA 1080 

GAGTTGCAAA TTACGATAAT CAATCATATG CCGCTGTTAT AGTTGGATTG GAATCATCAC 1140 

CGGAAGAATT AATCACTAAG CATGGATACG AATTGACAAA GTATACCCAG TAGAAGGAGT 1200 

TTAATTATAA TGAGAAGATG GCAAGGATTA GTAGAAGAGT TTAAAGCACA TTTACCAGTA 1260 

AATGAAAATA CACCAAAATT AACATTGAAC GAGGGAAATA CACCACTCAT TCATTGTGAA 1320 

AATATGTCTA AAATACTAGG CATAGATTTA TATGTGaAGT ATGAAGGTGC CAATCCGACA 1380 

GTTCATTTAA AGATCGCGGT ATGGTAATGG CTGTGACAAA AGCAAAAGAG CAAGGTAAGA 144 0 

AAATTGTAAT ATGCGCTTCG ACTGGAAATA CATCAGCGTC TGCAGCAGCA TATGCAGCGA 1500 

GAGCAGGTTT AAAAGCTATC GTCGTAATAC CAGAAGGTAA AATTGCATTA GGTAAATTGT 1560 

CGCAAGCAGT AATGTATGGT GCAGAAATCG TTTCTATTGA AGGAAACTTT GATGAAGCTT 1620 

SO TAGAAATTGT AAAAGAAATT GCAAAAAGTG GCGAAATCGA GCTTGTAAAC TCTGTCAATC 16 80 

CATTTAGAAT CGAAGGACAA AAGACAGGCT CATTTGAAAT TGTACAACAA TTAGACGGTG 174 0 



55 
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5 



10 



IS 



25 



30 



35 



AAGGCTTTAA 


AGAATATCAT 


GAAGCTAAAG 


GATCACAATT 


GCCGAAAATG 


TTTGGCTTCC 




AAGCTGAAGG 


CGCATCACCA 


ATTGTTCAAA 


ATAAAGTCAT 


TAAAAATCCT 


GAAACGATTG 


1 oon 

X 7Z U 


CAACTGCTAT 


TCGAATTGGT 


AATCCTGCTA 


GTTGGGATAA 


GGCGACTAAT 


GCTCTTAAAG 


X ^ o u 


AATCAAATGG 


ATTAATAGAT 


AGTGTTACTG 


ATGATGAAAT 


TCTAGAAGCA 


TATCAGTTAA 




TGACAACTAA 


AGAAGGTGTC 


TTTAGTGAAC 


CAGCGAGTAA 


TGCTTCTATT 


GCAGGTTTAA 




TTAAATTGCA 


TAGACAAGGT 


AAATTACCTC 


AAGGTAAAAA 


AGTAGTTGCT 


ATTTTAACTG 


^ X o \^ 


GTAATGGATT 


AAAAGATCCT 


GATACTGCTA 


TTTCACTACT 


AGATAATCCG 


ATAAAGCCAT 


^ ^ ^ V 


TGCCAAATGA 


TAAAGATAGC 


ATTATCGATT 


ATATTAAAGG 


AGCTTTATAA 


CATGT CGAAT 


22 8 0 


GTTTTGGAGT 


TAACAATTCC 


TGCATCAACA 


GCCAACCTTG 


GAGTTGGCTT 


TQATT CT AT A 


5*^4 ft 

^ ^ *t V 


GGTATGGCTT 


TAGATAAATT 


TTTGCATCTG 


TCTGTAAAGG 


AAACATCAGG 


GACAAAATGG 


!?A n ft 


GAATATATTT 


TCCATGATGA 


TGCATCTAAG 


CAATTGCCTA 


CTGACGAAAC 


AAA (TTT* AT*T 


OA a ft 


TATCATGTAG 


CACAACAAGT 


TGCTTCTAAA 


TATAGTGTTG 


ACTTGCCTAA 


TTTATGTATC 


252 ft 

^ ^ 4i V 


GAAATGAGAA 


GTGATATTCC 


ATTGGCAAGA 


GGGTTAGGTT 


CGTCAGCTTC 


TG CTTTAGTA 


2580 


GGAGCTATAT 


ATATCGCAAA 


TTATTTTGGT 


GATATCCAAC 


TGTCTAAACA 


TGAGGTATTA 


2 Ad ft 
^ O ri U 


CAATTAGCGA 


CTGAAATCGA 


AGGACATCCT 


GATAATGTTG 


CGCCGACCAT 


TTATGGTGGT 


2700 


TTAATCGCTG 


GATATTATAA 


TGATGTCTCG 


AAAGAAACGT 


CaGTtGCACA 


TATCGACATA 


2760 


CCAGACGTGG 


ATGTGATTGT 


AACGATACCA 


ACTTATGAAG 


TAAAAACAGA 


AGCATCAAGA 


2820 


CGTGCTTTAC 


CACAAAAATT 


AACACATAGT 


GAAGCGGTTA 


AAAGTAGTGC 


AATTAGTAAT 


2880 


ACAATGATTT 


tGgCATTAGC 


ACAGCACAAT 


TATGAATTAG 


CAGGTAAACT 


CATGCAACAA 


294 0 


GATGGCTTTC 


ATGAACCGTA 


TCGTCAGCAT 


TTAATTGCTG 


AATTTGATGA 


AGTGAAAACA 


ft ftft 

O U w U 


ATTGCTAGTC 


AACATAATGC 


CTATGCAACT 


GTAATTAGTG 


GTGCTGGACC 


AACTATTTTA 


3060 


ATATTTAGTC 


GTAAAGAAAA 


TAGTGGGGAA 


TTGGTTCGCT 


CTTTAAATAG 


TCAGGTAGTA 


3120 


TCATGCCATT 


CTGAA 










3135 


(2) INFORMATION FOR SEQ ID NO: 462: 









45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 
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AGnTCAATAT 


TTAGATCAAG 


CCGTTTTAAG 


TAATTACGAA 


CAAGTTTATA 


TCATTCATGG 


TAAAGGTACA 


GGTGCACTTC 


AAAAAGGTGT 


ACAACAACAT 


TTGAAAAAGC 


ATAAAAGTGT 


TAGTGACTTT 


AGAGGTGGTA 


TGCCAAGCGA 


AGGTGGATTT 


GGCGTTACCG 


TTGCAACACT 


AAAATAAATT 


ATAATTTGAT 


AAATTAAATA 


GCTGCAGTTA 


AAATAATGTA 


AAGCAACAAG 


AATACATTTC 


AAACATGTTA 


TTTGAAATAA 


GCATAAAAAT 


TGAGCAAATA 


GAAATACATG 


AAGCATGTTA 


TCTGATATAA 


TTTGAACATC 


ATAATAATAA 


TTAAGGAGGA 


TTGGCATTTA 


TGGCAATCGT 


AAAAGTAACA 


GATGCAGATT 


TTGATTCAAA 


AGTAGAATCT 


GGTGTACAAC 


TAGTAGATTT 


TTGGGCAACA 


TGGTGTGGTC 


CATGTAAAAT 


GATCGCTCCG 


GTATTAGAAG 


AATTAGCAGC 


TGACTATGAA 


GGTAAAGCTG 


ACATTTTAAA 


ATTAGATGTT 


GATGAAAATC 


CATCAACTGC 


AGCTAAATAT 


GAAGTGATGA 


GTATTCCAAC 


ATTAATCX5TC 


TTTAAAGACG 


GTCAACCAGT 


TGATAAAGTT 


GTTGGTTTCC 


AACCAAAAGA 


AAACTTAGCT 


GAAGTTTTAG 


ATAAACATTT 


ATAAGTTACA 


ACCAATGACG 


ACTGGGGCAT 


TTCTTTAATG 


AATTGCTCCA 


GTTTTTGTTT 


GTGTTTTTAA 


TATAAAAAGT 


TGAATGATAA 


GTCATCATAT 


TGTTTACGAC. 


TTGAGAATGG 


TGGGATTAAT 


AAATCTATGA 


ACGTTAAATG 


ATAATCTAGC 


ATGCTGATAG 


ATTTGTAGCA 


GTTGGTTTGA 


TAAAACCATG 


TTCAATATTA 


CATGATGTGC 


ATGAAAAGTC 


ATACTCGAAG 


ATGTTGATTA 


TTAACTAGAA 


TTAGTGGTGA 


TAAATTTGAA 


GCACTTTTGT 


AGCATCATTC 


ATTTTAAAAT 


TAGAAGGGGG 


GATATTTTTG 


GAAGACTATA 


AGCAACGAAT 


TAAAAATAAA 


TTAAATGTCG 


TACCTATGGA 


ACCAGGATGC 


TATTTAATGA 


AAGATCGTAA 


TGATCAAGTG 


ATATATGTTG 


GCAAAGCTAA 


AAAGCTAAGA 


AATCGATTGC 


GATCATATTT 


TCACGGGTG 













(2) INFORMATION FOR SEQ ID NO: 463: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 
AGTTCAACAC GACGAATTTT ACCTGAGTTT GTTTTTGGTA AGTCGTCAAC GAATTCAATC 
TCTCTCGGAT ATTTATATGG TGCAACTTCA TTTTTAACAA ATTGTTGTAG TTCTTTAACT 
AACGTATCAT CACCCGCAGT ATGGTCCTGT AAAATAACGA ATGCTTTAAC AATATTTCCT 
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GCATCTTCAA CTTCAAAAGG CCCAATCGTA TAGCCTGAAC TAATAATAAT GTCATCTCGA 
CGTCCTTCAA ACCAGAAATA ACCATCATCA TCTACATGAG CTAAGTCACC AGTGATGTAG 

TATTtACCTG TTTGCGCTTT CGCCGTACGT tCTGGCTCTT tATAATACCC TTTGAAAAGT 42 0 

GCTGGCAAAT CAAGTGGTAC TGCAATATTC CCTTTCGTAT TAGCAGGTAC GCTATTCCCC 4 80 

TCATCATCTA CTACAGTGAC CGAACTACCC GGAATGCCTT TACCCATTGA TCCAATCCTC 54 0 

TGTGGTGTAT CTTTTAAAAA GCCTATAAGC AAGGTACTTT CAGTCTGGCC ATATCCATCT 600 

CTTACAGTTA AATTAAAGTA TTTCTTGAAT TGTTCAACTA CTTCTCGATT TAGTGGCTCA 660 

CCTGCAGAAA CGGCACTATG TAAATGCGTT AAGTCATAAT CATTTAAGTT CTGTAATTTA 720 

GCCATCATAC GATATTCTGT CGGTGTACAA CATAAAACAT TAATTTGATA TTTTTGAAGC 780 

AATTCTAAGT ATGTTTCAGG ACTGAACCTT CCATTAAATA CAAAAGCAGT TGCACCTGAA 84 0 

20 CCTAATACAG ATAAGAAAGG ACTCCATACC CATTTTTGCC AACCTGGTGc TGCTGTTGCC 900 

CAAACTAAGT CATCTTCATT aATACATaAC CAATGTTTTG GTGCCATTTG TaAATGTGcA 96 0 

AATCCCCaTC CATGACAATG TGTAACGGCT TTAGGATTGC^ CAGTTGTACC AGATGTATAT 102 0 

25 GACAGAATCG CCATATCATC ACGCGTCGTA TCTGCCATTT CTAGTTTGTT ACTTGCGTTT 108 0 

TCTTTTTCAG CTTCAAGTGA AATCCATCCA TCTTTTTGAC CGGCAATAAC AAATTTAGTT 114 0 

AACGCATCAT ATTCTTTAAT TTTTTCAAAT TCAACTGTGA ATGGCTCTAG TGCAATAACT 12 00 

GCATTAATTT CACCATGTGT GATACGGTAT TGTAAATCTT TAGTTCTTAG CATTTCAGAA 126 0 

CATGGAATGA TTGCAACACC TAATTTTAAA GCAGCAATAT ATAATTCATA CGTCGCAATA 1320 

GATCGTGGCA TCATAATGAG TACTTTATCG CCTTTAGATA AACCGTGCGA TGCTAAAACA 1380 

TTACCTACTT TATTAGACTG TTCAATGAGC TGTTGGTAAG TGACTGATAT ATCTTCGCCT 144 0 

TCAGTATTAT GATATAAAAT TGCCTTTTTA TCTGGTATGT GGCTATATTT TTCGATTTCC 150 0 

GAAATAATGT TATATTTTTC AGGCGCGAAT AGAGCTGACT TTTGCATAAC TAACTrCCTT 1560 

TCATACATCC ACTTTTCCTG TGATGAACAT TGTAATTTTA TAAATGAATT ATATACATCA 1620 

TACGCCTATC TTTACAGAAT TTTCAATTAA ATAGGGTTAA ATACCAAAGT CCTCGACACT 1680 

45 ACACTTTGAC ATGACGTAGC ATTCAAGGAC TTTCAAATGA TTGAGGGTTG ATATCTCGGG 174 0 

CTAGACCATA TCAGCTAATT CAATACGAAT ATTGTATGAT AATTCACX3AT TAATTATTTT 1800 

TACATCTGCA CCTTTCGAAG TGCCACX3ATG CTTGTGTGTA TGCTTGTACT CAGCTGAATT 1860 

SO TTGCCAATGA TAAAATGCTT GCCTATTTTC CCACAGCGTA ATAATGATAT AGTGTCTACC 1920 

AGCTGTTCTA GGTCTTAAAA ACCTTAATGC TTTAAATCCA TCAACGTTTT TTAAATGCTT 198 0 
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ATTTAACACA CATAATGAAT CATTTGATAA ATCATTTATC GCTTCTAGCA CATCGTAATA 
TGCAGTGTCA TTATTTTTTT GTATTGTGAG ACAATCATCC AGTrCTTCTA TTACATAACT 
TCTATATTCA TCATAAATTT TCATAATAAA TGCCTTCATT TCATTTATAT TTTTGGTCAT 
ATTACTkTAT ATCTATTACT AAtJcCATTCC CGTATTTATT AATTACAATC ATAGTTTGGC 
TyCTTTTTAA AAGATAAGAC TTTGTAAAAA GTATTAATAT TTCATGCAAA TGGGGGACAG 
GAGTCGCCCA CTATTTTTGT GTCTTCAATT TCATGATCAT TATTTAACAT TAGTCATGAA 
AATAGCCGAC 

(2) INFORMATION FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 
TTTATTAATT GTAAAAAATT GAGTAAATTA TCTTTACATT CTAAATTAGT CTTAACTACA 
ACTAGTATCC TAATAATTAT AGGAGCTATT ACATTCTTTT TATTAGAACA GTTTAATACT 
ATGCAACATA TGGGACTAGT TGAAAAAATC GGAAATTCTT TTTTCCAATC AGTAACAACA 
CGAACAGCGG GTTTTAACAG TATAGATATA GCAAGCATTA rCAAATCTAC CGCATtAaTG 
TTAATGCTAC TTATGTTTAT TGGTGGTGCC CCTCTCAGTG CAGCTGGAGG AATTAAAATA 
ACTACTTTTG CAGTTGCGTT TATTTTTGTA CTAAATTATA kACGTAAAGA AAATAATGTT 
TCAGTATTCA ATAAAGAAAT ATCTGACAAA CATATAAAAC TATCTATTGT TACCATTAAT 
ATCTCATTTC TATTTATCAG CATCATTACT TTTATATTAT CGATAATTAA TCCGAACATA 
TCATTAATCA AGTTATTATT CGAAGTGGTT TCTGCATTCG GAACAGTAGG GTTAAGTATG 
AACCTTACCA CAGAATATCA TGGTATTACT AAAATAATTA TTATATTCGT 
(2) INFORMATION FOR SEQ ID NO: 465: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 65: 
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TATTCGATTT GAnTCTTTTA AAATTATGTT TAAAAACATG TCTAATGATT CCGCATCAIT 
TTTTGCACCA TCAATAAGCG TTTCAGcAAA CCCCTTAATT GAAOTAATAG GTGTTTTTAA 
TTCATGTGAA ACATTTGCTA CAAATTCACG TCTTAGATTT TCAAGTrGTT TCAGATTTGT 
TATATCATGC ATCACaACTA AAATCCCtTG CAAACTTTTT TGAGACCTAG TTAAAATCGG 
AACGCATGAA ATATCAAAGT ACTTGGCATG GACTTGGTTT ATTGCAACTT CCAATTGTTC 
ATAAATAGGT TTTTCAACTT TAAAACTTTC TAAAATTAAT TGCTCAATTT CAGTATTAAC 
ATAGCCGTGA TAGCCTACTT GTTCAATATT ATGCGAGATG TTGAACTGTT CATAATACGC 
TTTATTTGCA ACAACGATTT TTCCATTTCG ATCTATCATT AAAATAGCAC TTGGAATATT 
TTCAATCGTT GTriTTAAAC GGTTGGATTG AATTTTTTGC TCATTATTAA GCTTITGAAG 
GCGTCGTGCT AAATCATTGG TAGACAC7VAA AAGCGCTTTA GTTTCTACAA CATTACTTTC 
AGGTACACGT ATGTGATAAT AACCATTTGC CAACAATTGT GTTGCATAAG TAACTTCTTG 
AATGGGACGG ATTAATGTAC GCTTAAAACT ACGGCTTGCA AAATACAGAC AAATGAGTAC 
AACTAAACAT GTCAAAATAA GATATTTCCA CAACGTCCAA TGCATTTCTG TAATATCGTT 
2S ATTGTAACCT TTAATCCATA CATGATAACC GTTAACCTTC TTATtAAAAA TAAAAACGTC 

CCTTT 

(2) INFORMATION FOR SEQ ID NO: 466: 



75 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1016 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



3S 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 

TTTGGTTAGC CCATAAAAGA AAAAAACAAG TAGTCATTTT TAAACAACAT ATCAAGTCTA 60 

CCCAAGAAAT ACGTTTTGAC AAAGCGAAAG TGCTTGAACA CAAAGATGAA ATAGCAAATT 120 

TTATTTCTTT CGAACCACAA AGTTTTGAAT TTTATTATTT TACAGAATCG GAATTTTCAG 180 

AAGAACAATT AAATGAAGTT TCGCCAATTA GAATTAAATT CAATGTTATA AGACACACAA 240 

AAGATTTGaT AAAGCATATG CCGAATATAT TTTTGGcTAG ACTTATTTCA GAAGATAATG 3 00 

ATAAAAAGAC ATATATGTTC TATAAACGCA AAGTATTAAC CGATAACTTT TTAGATAAAT 360 

so ATATGCAGAA ATTTTCACCG GCAACATACA CAATAATATT TGTAAATXSTC TTAATATGGT 420 
TATGTATGAT TTTATATTTA AATAATTTTT CGGATGTAAA ATTATTAGAT GTTX3GCGGGT 



45 



480 



55 



BNSOOQD: <EP 0786519A2J_> 



1382 



10 



IS 



20 



30 



35 



40 



45 



EP0 786 519 A2 

ATTTTAGTTT TGAACATATA CTTATGAATA TGCTTTCATT ATTTATTTTT GGTAAAATAG 600 

TCGAAGCAAT TATTGGTTCA TGGCGGATGT TAACTGTATA CTTTATTGCA GGGTTGTTTG 660 

GAAACTTTGT ATCACTATCA TTTAATACGA CTACAAnTC AGTTGGGGCT AGTGGTGCTA 720 

TATTTGGTCT GATTGGATCA ATTTTTGCGA TGATGTATGT TTCAAAAACA TTTAACAAAA 780 

AAATGTTAGG ACAGTTATTA ATTGCATTAG TGATATTAGT TGGTGTTTCT CTGTTTATGT 84 0 

CAAATATAAA TATTGTGGCG CATATTGGAG GATTCATTGG TGGTTTATTA ATAACTTTAA 900 

TTGGCTATTA CTATAAAGTG AATCGtAATA TTTTTTGGaT TTaCtAATTG GTATGCtTGt 960 

tATATTTAwT GCACyTCmAA TTAgAtTTTT ACmATTAAAG AAGATAATAw TTATAA 1016 
(2) INFORMATION FOR SEQ ID NO: 467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 06 base pairs 

(B) TYPE: nucleic acid 

(C) STRA^^:>EDNESS : double 

(D) TOPOLOGY: linear 



25 - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 

AACTTTAAAT TTAGACATCT TTAAAACCTC TCTTAAACCA TGCCTATATC TCAAGATGAT 60 

ATTTCAAATG AACAATACTA TTGCTTGAGA CCATTAATGA ATGATCATAA ATATTTCTTT 120 

CTATAAAATT AGCTTTCCAA TAACTGTGTT GTTGCATAAT ATCATTCACA AGTACACCAT 180 

TTTCGGAAGT ATGATTATCT TTATCTATAC TTAAAACAAT TTGTTTAGTT TTAGCATGGC 24 0 

TAAATTGTTG AAGACCCTTA CACGATAAAC GTATAGCGTC TGAATTCTCA TTTAACAATG 300 

CGGCTGGgCA AACAAATGAC ACATTGTACT TCATGTTTGA ACtTCGTTAC AATCaTCGTG 360 

kCatTTTGat AAATAACAAT CCCTCGTAAT kGATTAAGTA TATTAT 4 06 
(2) INFORMATION FOR SEQ ID NO: 468: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO : 468: 
SO AGATAATACA CTTGAAGTTG GAATGGTTTG TGACGGTTAT TTAATGCGAA TTGAAAACTT 60 

AACACCATCA AATTTCTTCA ACTCAGCAAG TGAAGATACG ATTACTAAAA TTAAATTAAA 120 
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AGGTACAGCG 


TTAAAACTAA 


GAGAAGCCAT 


CAATTATGAT 


GAAATGGTTA 


X X WXAO^iX Avj 


240 




TATGACGTAG 


TTCCTAATTA 


TGCkAAAAGG 


GATTGATGAA 


AAACTGAAGG 


X X X X X ^ 


3 00 


5 


AATCCCTTTT 


ATTTTAGGGG 


AATTGAATAG 


ATAGTTTTAA 


ACTATACGAA 


TTATT A AT AT 

X x^%x XAnXnX 


o ^ 




TTGAGATTTA 


ATTGAAATAA 


GTTTTAAAAA 


TTGGAGGAGA 


TAGATTAAGC 


GAAGTCATTT 

NVW^W X XXX 


420 


10 


AAAGGTGAAG 


TTAAGTGTAT 


TCACAAAAAn 


TAGCCACACT 


CATATGACAT 


CGGATYlAftTfS 

w^TVJX^X VJAv X Vi9 


>l O A 

4 8 0 


TGGCTTAAGG 


ATCTATGGGG 


GGAGGAAnCC 


ATAGATGTTT 


ACTTTGATAC5 




54 0 




ATATCAAAGT 


ATGCGATTAT 


TTATAGCTTG 


ATGCAAAAGT 


GGTATGCCTA 


TTAAAAriTTA 
X xrUvvVVjl Xx\ 


^ A A 

bOO 


IS 


CTGCACATAG 


CTTITAATAT 


TCCGTTCAAA 


GGAAAGGGGC 


ATACAATTGA 


A^-ATlX V* XVjXa 


t ^ A 
DO 0 




ATAGTACTTT 


TAACCAGCTA 


TGCTAAAAGT 


CTAGTAGGGA 


GAACAGTTC'jT 




720 




AAGAACCTCT 


AACTTCGTTA 


GTACGATTAA 


GAAAAGCTTT 


TTAGTTACTA 


X W X iv\ X>iV_-rVn 


780 


20 


TTTATTGACG 


CGCGTGAATC 


TCi"X"i"l'ATAA 


GAGTGTGTAG 


GGAATGGC^T 


Tf^T AT A A fiTT 


840 




GTATTAGAAG 


AACTTCTAAC 


GCATCTCTGT 


GGTTAAAAGA 


GATG AAf^n^ A 


A rvi A f a r* TTT 


900 




aATTAAAACT 


GCATAAGAAC 


TTCTAGCTTT 


TCTCTCTCGT 


TPAA Af3Af^a A 


Vat-AvjC i\7 TTC 


960 


25 


GCAGTTTAAT 


CAAAACCACA 


TAAAGCTTTT 


AACTTTACTC 


A X X VJX% XXX xi^^ 


ArS A f^TV^ A T & & 


1020 




ATGTTTACAG 


TTTAATTAAA 


ACTGCATAAG 


AACTTCTAGC 


X XX xwx^xxx 


V-v»X XL.>\A^ALt 


1 A O A 




AAGCAGCTGT 


TCGCAgTTTA ATCAAAACCA CATAAAGCTT 


TTAACTTTAO 

X X XXX 


X ^X X XuAx 1 1 


1 1 J» A 

X 140 


30 


AAAGAGTGAC 


AAATGTTTAC 


AGTTTAATTA 


AAACTGCATA 


AG AA C^TTC^H. 


wV- XXll^lt*! 


1200 




TTCGTTCAAA 


GAGAAGTTCT 


AATACCACCA 


TATCGTGCGA 


TCGGGAACGG 


TATATATATT ' 


1260 


35 


AATAGGAGGG 


TAATATATAT 


TTAACGCACG 


ATATGGGACT 


ATTAGCCTTC 


GACTTTGTTA 


1320 


TGTTGATGTG 


TGGCCTAAAA 


TATTGGAGAT 


ACCAATATTT 


TAGGTTGCAT 


CAACATCA 


1378 




(2) INFORMATION FOR SEQ ID NO: 469: 









(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 
TCCCAACCAA TAATCGTGGC AAAAATACGG ATATTGGTAT GGCTTAACAA ATTGCAAATA 
so TCGTTTAATC ATACATCCCC CCTAATCTAT TGCCCTATCC TATTCaTAAG CATAAAAATG 

AATAGAGGTT GGATACATAA TTTGTAGATG TAAATTCTTC TTACAATTTA CATTTTTAAA 
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TTATATTCTA 


TTCAATTTAA 


TCTATGGATA 


CTGTGTCCCC 


ACACGACAGC 


AAAAGTTATC 


300 


ATACTTCTTT 


ACATCACTAA 


GTCAATATAA 


ATGATTTAAT 


CAGTATTTAC 


ACTTTATTTG 


360 


CTTAATACTG 


TCTAATTTrr 


TTGTAACGTT 


CTTTCCAAAC 


TTTGATAAAA 


TCTGGCGCGA 


420 


ATGGGCCCTT 


CTTCTGTTCT 


ATCCATTGTT 


GAAGAATGTC 


CACXSTTGCGT 


CTTAAAATAA 


480 


TATCAATATC 


ATGCGGATAA 


TTCATTTGAT 


TCATATGTTG 


CTCATATTCA 


TCTTCATCTA 


540 


ATAAATGATA 


CnTCCGTTT 


GGATATACTT 


TAATATCTAA 


ATCATAGTCT 


ATATATTTTA 


600 


ATGCCTCTTC 


ATCACAAACA 


AATGGTGATG 


ACAAATTGCA 


ATAGTAATAA 


ATTCCATCTT 


660 


CTCTAAACAT 


GCAGATAACA 


TTaAACCAAT 


ATTCTGAGTG 


AAAGTaAACA 


ATTGCCGGTT 


720 


CACGTGTTAT 


CGAAGTTCTT 


CCGTCACTTT 


CAGTCACTAA 


CGTATGATCA 


TTTCCACCAA 


780 


TGACAACATG 


ATCAGTACCC 


TTTAATATTG 


TTGTTTCAGA 


CCAAACGCGA 


TGAATCTTAC 


840 


CATCATGTTT 


ATAACTCTGA 


ATTTTAATGT 


TTTCCCCTTC 


TTTAGGTATG 


GATTCTCTGA 


900 


CCATACTCCA 


CACCACCTTC 


TGTTAATTTA 


ACCATTATAA 


ATTATAGCAT 


ATTTCAGAAA 


960 


TAGTATTATA 


TAAATACATA 


'rrrriACGAA 


ATAAGATTTT 


ACTACTTAAT 


AATTAAACTC 


1020 


GGTAATATTG 


CTAAGTACTA 


CAACAGAGAT 


TTACATGTCC 


CATTTAAAGT 


ATATAAAATC 


1080 


ATCACTTTTA 


TATATCAACA 


CTTTAACTTT 


TTGACATTGT 


TATTCTATGA 


GATTTAAAGA 


1140 


TATCATTTAT 


ACTTTTTAAA 


ATTAATGTCA 


CTATGTTTTC 


CGATAATATT 


ACCAATCATC 


1200 


GAATGTTACC 


CATTTATAAA 


TTGATAAATs 


TTTGACATAG 


GTACAGGGAA 


TGTATATTGA 


1260 


TCTCGATCAC 


TTAAATCAAA 


CCAAATCATG 


TCATCTGGTA 


ATGTTTCAAT 


GTTAATTGCT 


1320 


CCTGAAACGG 


CGTATACTTT 


AATCTTCCAT 


GTTAAATGAG 


TAAATTGATG 


CTTCAACTCA 


1380 


AAAATAGGTG 


TTTCTACTGG 


TTGAATGTCA 


TGACCGATTT 


TTTCAGTCAT 


TTTACGTCTA 


1440 


GCATGCTCAC 


TTTCAAACAT 


AGGAAATTGC 


CACATACCAT 


GCAATAATTT 


TTCGCTACGC 


1500 


TTTTGCAACA 


GATATTGACC 


TTGATTATTT 


CTAATTAAAA 


AGACGGATTG 


CTCAATTACT 


1560 


rrrrrACTTA 


CATTTTTAGA 


TTTAACAGGT 


AACTTTTCAA 


ATGTACCTTT 


ATCAAATGCC 


1620 


TCACAGTTTT 


CTTGAACTGG 


ACAAAATAAG 


CATAATGGAT 


TTTTTGGTGT 


ACAAATTAAC 


1680 


GCCCCTAATT 


CCATCATAGC 


TTGATTAAAC 


GTTCCAGCTT 


CTGTAGTAAC 


ATACGGTAAT 


1740 


AATTCTTGTT 


CGTACGATTT 


CCTCGTCGAT 


TGTAATTTAA 


TATCTCGATA 


GTCATCATTC 


1800 


AATCTAGACC 


ATACTCGAAA 


AACATTTCCG 


TCTACAGTTG 


CTAGTGGTAC 


ATTATATGCA 


1860 


ATGCTCATTA 


CTGCAGCTTG 


TGTGTATGGG 


CCAACACCTT 


TTAACGCTTT 


AAATTGATCA 


1920 


GGATCTTTGG 


GAACTAAGCC 


TTCATATTTA 


TCATGAACTT 


CTTTAATCGC 


C6TATGAAAA 


1980 
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GCTTGaCTCA AAACTTCCAC AGTTGGAAAT CGTTCAACAA AACGATGATA ATAGTCAATA 
ACTGTTTTAA CTTGTGTCTG TTGTAACATG ACCTCACTTA ACCAAATATA GTACGGATTG 
GTCGTTTGTC GCCATGGCAT TTCTCTTTGA TTTTCATCAA ACCAGTGTAT CAAATTTTCT 
TTAAAACTAG ACTGCTGATA CATTTATAAA ACCCITTCCT CACCAAAATT AATTGTCTTT 
ACTCATAATG TTTTTATTGT ACATTAAAAT CATGGTTAGT ATGTAAGTTA ATTTAGTTAT 

TTGCGAAATT GGATTATAAT AGTATATATA ATATTATGAA ATGAGTGAAC TGATATGGAC 24 00 

ACTGCAACAC ATATCGCAAT TGGGGTGGGC CTTACAGCAC TTGCAACTCA AGATCCAGCA 24 6 0 

ATGGCTTCTA CGTTTGGTGC AACAGCTACA ACCCTTATCG TTGGTTCATT AATTCCTGAT 2520 

GGGGATACTG TTCTTAAATT AAAGGACAAT GCAACATATA TTTCGCATCA TAGAGGTATC 2580 

ACGCATTCCA TCCCTTTCAC AATACTATGG CCAATTTTAA TTACATTTTT AATATTCACG 2640 

20 TTCTITAGTG GAACCAACCC ATTTCATGTA TGGATGTGGG CTCAGCTCGC AGTATTTTTA 27 00 

CATGTCTTTG TAGATATATT CAATTCTTAT GGTACACAAG CGCTTAGACC TATCACAAAC 2760 

AAATGGATTC AATTAAGTGT GATTAACACA TTTGACCCTA TTATTTTCAC AGTTCTTTGT 2820 

25 ATTGGTATTG TATTATGGGT TATAGGCTTG CATCCATTTG CAGTCTTCTT TCCTATAATC 2 8 80 

GCTTTACTAA TCATTTATTA CATGATTCXST TTTAAAATGA GAGCCGTAAT TAAGCAACAA 294 0 

GCTTTAAAAG CAATTCAACA AGAGCATCAC CCTGTTAAAG TATTTGTTGC GCCAACAATA 3000 

AAATTTATGG AATGGCGTGT CGCGATACAA ACTGATGCAC ATGACTATGT TGGAAAAGCA 3060 

TATGGTAGGA ATGTGGTGTT TAGTGATAAA GTGGAACGTC AAACATTATC AACAGACTCC 3120 

ATTTTATGGA AAGTCAAAGG TAATAAAGAT ATACGTACTT TTTTAAACTT TTCATCAATC 3180 

TATCGTTGGC AAACAACAAC GTTAGCAGAT GGTTCTACTG AAATTCGTTT GATTGATTTG 3240 

CGTTATTTAA AAAATGATCA TTATTCATTT GTGGCAATTG CACATGTAAC AAACGATAAT 3300 

GTCATAGACC ACTCTTATAT TGGCTGGGTA TTTACAGAAG ATAAGTTACA ACGTAAACTG 3360 

TATGCTAAAT AATTTCAAGT TATTATTCAC TAAAGTTAAT CTATAAAAAA TGAACAACCG 3420 

GGCAGAATGA AAATCAAAAC GATTTTTACT CTGTCCGGTT TTTTAATGTA AAACTATGAA 3480 

TGCTTTTACA AAATCTAAAA TTTATATTGT TGCTAACAAA CTACCTTTAA TGACTCGAAA 3 54 0 

TATCAAAATC AGTATAGGAA AACAATATCT AGATGATATT CTAATTGTTT CTGATTCTCA 36 00 

CAGATTAATT TACACAACAG GTCAGCTAAA CATCATGAAG AAGTATCCGC CTCGTCTGTA 3660 

SO CTATCATTTG AAACATCCTG TTGATTATCa GTTTGTGTCA CTTTAGATGT TTTATTaTAA 3 720 

ATTGCGTGTG TCGTATACCT TGCTAATACG AAATTTACTG CAGCTATTAA ACATAAGAAT 3780 
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TGTGCTATAC CATTAACAAT GTAATACATT GGATTTAGCA TTAGGATGTG ATTGATAAAT 3900 

ACATGATTTG GATTTGGTAT GAAAATAATT CKSTAACAATA AGAAACACAA TACACAAACC 3 960 

^ CCATAAAATA TGATATTTAT TTTTTCAGwT AACAGTCGAA TAAGACCAAA AGTAACGGAT 4 020 

ATTAATCCTA CAAAAATAGT TGCCATCACA ATAAAATAGA ATAGCGCTAT ATATGATGTT 4080 

TCGAAGTTTA CTGGTTTAAC CAATGCACTA ATCATCGTCA AAATGACTAG CATAATAAAA 414 0 

10 

CTTAAAATAG ACATAATAAC TACTGGCXJTC G 4171 
(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 
TGGTTGAAGT AGCAGTTAAT TCAAAATCTG CAACAGTTTC AGCAGAATaG GGGCTTTCAA 60 
25 AATAAtCAAA GGAGAATAAT TTATGACTAA AACTTTAAAG GTTTATAAAG GAGACGACGT 120 

CGTAGCTTCT GAACAAGGTG AAGGCAAAGT GTCAGTAACT TTATCTAATT TAGAAGCGGA 180 
TACAACTTAT CCAAAAGGTA CTTACCAAGT GGCATGGGAA GAAAATGGTA AAGAATCTAG 24 0 

TAAAGTTGAT GTACCTCAAT TCAAAACCAA TCCAATTCTA GTCTCAGGCG TATCATTTAC 300 
ACCAGAAACT AAATCAATTA TGGTAAATAC CGATGACAAT GTTGAGCCAA ACATTGCACC 360 
AAGCACAGCA ACGAATAAAA TATTGAAATA TACAAGTGAA CATCCAGAAT TT G TTACTGT 420 

35 

AGATGAAAAT ACAGGAGCAA TTCACGGTGT AGCTGAAGGT ACTTCAGTAA TCACTGCTAC 4 80 

GTCTACTGAT GGAAGCGATA AGTCAGGACA AATTTCAGTG ACAGTAACAA ACGGATAGGG 540 
ATTTAAGGCG CAGTATATCT GCGTCTTTTT TATTTGAATA AAAGGAGCTA ATACAATGAT 600 

40 

TAAATTTGAA ATTAAAGATC GTAAAACAGG AAAAACAGAG AGCTATACAA AAGAAGATGT 660 
AACAATGGGC GAACAGAAAA ATGCTATGAG TATTTAGAAT TAGTAAATCA AGAGAATAAA 720 
AAAGAAGCAC CTAACGCAAC AAAAATGAGA CAAAAAGAGC GACAGTTATT AGTAGATTTA 780 

45 

TTTAAAGATG AAGGATTGAC TGAAGAAGAT GTTCTGAACA AGATGAGTAC TAAAACTTAT 84 0 

ACAAAAGCCT TACAAGATAT ATTTCGAGAA ATCAATGGTG AAGATGAAGA AGATTCAGAA 900 
50 ACTGAACCAG AAGAGATGGG AAAGACAGAA GAACAATCTC AATAAAAGAC ATTTTATCGA 960 

ACATTAAGAA AATACAACGT TTCTGTATGG AGCAGTATGG GTGGACATTA ACTGAAGTCA 1020 
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AAGAAAAACA AAGTGAACAA 


AAAGTCATTA 


CAGGTACGGA 


1 i 1 AAG AAAA 


CrT'l''i''ltiGAA 


1140 




GCTAGAAAGG 


AGGTTAATAT 


GAATGAAAAA GTAGAAGGCA 




GCTGAAAn^A 


1200 


5 


GACCATTTAG 


GTGTCCAAGA 


AGGCATGAAA GGTTTAAAGC 




TGTTGTTAAT 


1260 




AGTGAAATGA 


AAGCTAATCT 


GTCAGCATTT 


GATAAGTCTG 




f* TV TV TV TV TV ^ IV 

GGAAAAATAT 


1320 


70 


CAGGCGAGAA 


TTAAGGGGTT 


AAATGATAGG 


CTTAAAGTTC 




GTATTCTCAA 


1380 


GTAGAAGATG 


AGCTTAAACA 


AGTTAACGCT 


AATTACCAAA 




TV m^^^n TV Ti « « 

CAGTGTAAAA 


1440 




GATGTTGAGA 


AAGCATATTT 


AAAGTTAGTA 


GAAGCCAATA 


& & & 2V » ?v 


ATTAGCTCTT 


1500 


75 


GATAAATCTA 


AAGAAGCCTT 


AAAATCATCG 


AATACAGAAC 


'PT'Ii & 2^ H Jv Ji/^^ 


TGAAAATCAA 


1560 




TATAAACGTA 


CAAATCAACG 


TAAACAAGAT 


GCGTATCAAA 


AAUilAAACA 


GTTGAGAGAT 


1620 




GCAGAACAAA AGCTTaAGAA 


TAGTAACCAA 


GCTACTACTG 


UACAACTAAA 


AAGAGCAAGT 


1680 


20 


GACGCrtACA 


GAAGCAGTCC 


GCTAAGCATA 


AAGCACTTGT 


i vaAACAAT AT 


AAACAAGAAG 


1740 




GCAATCAAGT 


TCAAAAACTA 


AAAGTGCAAA 


ATGACAATCT 


1 rCAAAATCA 


AATGATAAAA 


1800 




TTGAAAGTTC 


TTACGCTAAA 


ACTAATACTA 


AATTAAAGCA 




GAATTTAATG 


1860 


25 


ATTTAAACAA 


TACTATTAAG 


AATCATAGCG 


CTAATGTCGC 


AAAAGCTGAA 


ACAGCTGTTA 


1920 




ATAAAGAAAA AGCTGCTTTA 


AATAATTTGG 


AGCGTTCAAT 


AtoA TAAAGCT 


TCATCCGAAA 


1980 




TGAAGACTTT 


TAACAAAGAA 


CAAATGATAG 


CTCAAAGTCA 


111 CUwTAAA 


CTTGCAAGTC 


2040 


30 


AAGCGGATGT 


CATGTCAAAG 


AAATTTAGTT 


CTATTGGAGA 


wlftAATGACT 


TCCCTGGGAC 


2100 




GTACAATGAC 


GATGGGCGTA 


TCTACACCGA 


TTACTTTAGG 


1 1 X ALs^j 1 udA 


^ 1^ ^TT^niV TV TV Tk TV 

GCATTAAAAA 


2160 


35 


CGAGTGCAGA 


CTTTGAAGGG 


CAAATGTCTC 


GAGTTGGAGC 


GATTGCACAA 


TV TV TV T^ 

GCAAGCAGTA 


2220 


AAGACTTAAA 


AAGCATGTCT 


AATCAAGCGG 


TTGACTTAGG 


AGCTAAAACA 


IV / Mfvm TV ]v fV 

AGTAAAAGTG 


2280 




CTAACGaAGT 


TGCTAAAGGT 


ATGGAAGAAT 


TGGCAGCTTT 


AGGCTTTAAT 


^^^^Ik IK IV /^1V Tk TV 

GCCIAAACAAA 


2340 


40 


CAATGGAGGC 


TATGCCAGGT 


GTTATCAGCG 


CAgcAGaAGC AAGTGGTGCA 


GAAATGGCTA 


2400 




CAACTGCAAC 


TGTAATGGCT 


TCAGCGATTA 


ACTCTTTCGG 


TTTAAAAGCA 


TwTGATGCAA 


2460 




ATCATGTTGC 


TGATTTACTT 


GCGAGATCAG 


CAAATGATAG 


TGCTGCAGAT 


TV • 1 K 1 »/^ TV TV «TT TV *TV TV 

ATTCAATATA 


2520 


45 


TGGGAGATGC 


ATTAAAATAT 


GCAGGTACTC 


CAGCAAAAGC 


ATTAGGAGTT 


^P/^ n M *TV IV TV 


2580 




ACACTTCTGC 


AGCAATTGAA 


GTTTTATCTA 


ACTCAGGTTT 


AGAGGGGTCT 


CAAGCAGGTA 


2640 




CTGCATTAAG 


AGCTTCGTTT 


ATTAGGCTAG 


CTAATCCAAG 


TAAAAGTACA 


GCTAAGGAAA 


2700 


SO 


TGAAAAAATT 


AGGTATTCAT 


TTGTCTGATG 


CTAAAGGTGA 


GTTTGTTGGA 


ATGGGCX3AAT 


2760 




TGATTAGACA 


GTTCCAAGAT 


AACATGAAAG 


GCATGACGAG 


AGAACAAAAA 


TTAGCAACAG 


2B20 
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CAGATAAAAT 


TAATAGCTAT 


AGCAAATCAT 


TGAAGAACTC 


TAATGGTGAA 


AGTAAAAAAG 


2940 




CAGCTGATTT 


GATGAAAGAT 


AACCTCAAAG 


GTGCTCTGGA 


ACAATTAGGT 


GG CGCTTTTG 


3000 


5 


AATCX5TTAGC 


AATTGAAGTT 


GGTAAAGATT 


TAACGCCTAT 


GATTAGAGCA 


GGTGCGGAAG 


3060 




GATTAACAAA 


ATTAGTTGAT 


GGATTTACAC 


ATCTTCCTGG 


TTGGGTTAGA 


AAGGCTTCGG 


3120 


10 


TAGGCTTAGC 


AATTTTTGGT 


GCATCTATTG 


GTCCTGCTGT 


TCTTGCTGGT 


GGCTTATTAA 


3180 


TACGTGCAGT 


TGGGAGCGCG 


GCTAAAGGCT 


ATGCATCATT 


AAATAGACGC 


ATTGCTGAAA 


3240 




ATACAATTCT 


TTCTAATACC 


AATTCAAAAG 


CAATGAAATC 


TTTAGGTCTT 


CAAACATTAT 


3300 


IS 


TTCTTGGTTC 


TACAACAGGA 


AAAACGTCAA 


AAGGCTTTAA 


AGGATTAGCC 


GGAGCTATGT 


3360 




TGTTTAATTT 


AAAACCTATA 


AATGTTTTGA 


AAAATTCTGC 


AAAGCTAGCA 


ATTTTACCGT 


3420 




TCAAACTTTT 


GAAAAACGGT 


TTAGGATTAG 


CCGCAAAATC 


CTTATTTGCA 


GTAAGTGGAG 


3480 


20 


GCGCAAGATT 


TGCTGGTGTA 


GCCTTAAAGT 


TTTTAACAGG 


ACCTATAGGT 


GCTACAATAA 


3540 




CTGCTATTAC 


AATTGCATAT 


AAAGTTTTTA 


AAACCGCATA 


TGATCGTGTG 


GAATGGTTCA 


3600 




GAAACGGTAT 


TAACGGTTTA 


GGAGAAACTA 


TAAAGTTTTT 


TGGTGGCAAA 


ATTATTGGCG 


3660 


25 


GTGCTGTTAG 


GAAGCTAGGA 


GAGTTTAAAA 


ATTATCTTGG 


AAGTATAGCfc 


AAAAGCTTGA 


3720 




AAGAAAAGTT 


TTCAAAGGAT 


ATGAAAGATG 


GTTATAAATC 


TTTGAGTGAC 


GATGACCTTC 


3780 




TGAAAGTAGG 


AGTCAACAAG 


TTTAAAGGAT 


TTATGCAAAC 


CATGGGCACA 


GCTTCTAAAA 


3840 


30 


AAGCATCTGA 


TACTGTAAAA 


GTGTTGGGGA 


AAQGTGTTTC 


AAAAGAAACA 


GAAAAAGCTT 


3900 




TAGAAAAATA 


CGTACACTAT 


TCTGAAGAGA 


ACAACAGAAT 


CATGGAAAAA 


GTACGTTTAA 


3960 




ACTCGGGTCA 


AATAACAGAA 


GACAAAGCAA 


AAAAACTTTT 


GAAAATTGAA 


GCGGATTTAT 


4020 


CTAATAACCT 


TATAGCTGAA 


ATAGAAAAAA 


GAAATAAAAA 


GGAACTCGAA 


AAAACTCAAG 


4080 




AACTTATTGA 


TAAGTATAGT 


GCATTCGATG 


AACAAGAAAA 


GCAAAACATT 


TTAACTAGAA 


4140 


40 


CTAAAGAAAA 


AAATGACTTG 


CGAATTAAAA 


AAGAGCAAGA 


ACTCAATCAG 


AAAATCAAAG 


4200 




AATTGAAAGA 


AAAAGCTTTA 


AGTGATGGTC 


AGATTTCAGA 


AAATGAAAGA 


AAAGAAATTG' 


4260 




AAAAGCTTGA 


AAATCAAAGA 


CGTGACATCA 


CIXJITAAAGA 


ATTGAGTAAG 


ACTGAAAAAG 


4320 


45 


AGCAAGAGCG 


TATTTTAGTA 


AGAATGCAAA 


GAAAGAGAAA 


TGCTTATTCA 


ATAGACGAAG 


4380 




CGAGCAAAGC 


AAT7AAAGAA 


GCAGAAAAAG 


CAAGAAAAGC 


AAGAAAAAAA 


GAAGTGGATA 


4440 




AGCAGTATGA 


AGATGATGTC 


ATTGCTATAA 


AAAATAACGT 


CAACCTTTCT 


AAGTCTGAAA 


4500 


SO 


AAGATAAATT 


GTTAGCTATT 


GCTGATCAAA 


GACATAAGGA 


TGAAGTAAGA 


AAGGCAAAAT 


4560 




CTAAAAAAGA 


TGCTGTAGTA 


GACGTTGTTA 


AAAfVGCAAAA 


TAAAGATATT 


GATAAAGAAA 


4620 
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GTTGGTGGTC TAACTTTAGA 


GAAGACCAAA AGAAGAAAAG 


' TGATAAATAC 


GCTAAAGAAC 


4740 




AAGAAGAAAC 


' AGCTCGTAGA 


AACAGAGAAA ATATAAAGAA ATGGTTTGGA AATGCTTGGG 


4800 


5 


ACGGCGTAAA 


AACTAAAACT 


GGTGAAGCCT TTAGTAAAAT GGGCAGAAAT 


GCTAATCATT 


4860 




TTGGCGGCGA 


AATGAAAAAA 


ATGTGGAGTG GAATCAAAGG 


AATTCCAAGC 


AAATTAAGTT 


4920 


10 


CAAGTTGGAG 


CTCAGCCAAA 


AGTTCTGTAG GATATCACAC 


TAAGGCTATA 


G CTAATAf^TA 


4980 


CTGGTAAATG 


GTTTGGAAAA 


GCTTGGCAAT CTGTTAAATC 


GACTACAGGA 


AGTATTTAPA 


5040 




ATCAAACTAA 


GCAAAAGTAT 


TCAGATGCCT CAGATAAAGC 


TTGGGCGCAT 


TCAAAATCT*A 


5100 


IS 


TTTGGAGAGG 


CACATCAAAA 


TGGTTTAGCA ACGCATATAA 


AAGTGCAAAG 




5160 




TAGATATGGC 


TAATAAATCG 


CGCTCGAAAT GGGATAATAT 


TTCTAGTACA 


GCATGGTCGA 


5220 




ATGCAAAATC 


CGTTTGGAAA 


GGAACATCGA AATGGTTTAG 


TAACTCATAC 


AAATCTTTAA 


5280 


20 


AAGG'ITGGAC 


TGGGGATATG 


TATTCAAGAG CCCACGATCG 


TTTTGATGCA 


ATTTCAAGTT 


5340 




CGGCATGGTC 


TAACGCTAAA 


TCAGTATTTA ATGGTTTTAG 


AAAATGGCTA 


TCAAAAACAT 


5400 




ATGATTGGAT 


TAGAGATATT 


GGTAAAGACA TGGGAAGAGC 


IXSCGGCTGAT 


TTAGGTAAAA 


5460. 


2S 


ATGTTGCTAA 


TAAAGCTATT 


GGC^TTTGA ATAGCATGAT 


TGGCGGTAJT AATAAAATAT 


5520 




CTAAAGCCAT 


TACTGATAAA 


AATCTCATCA AGCCAATACC 


TACATTGTCT 


ACTGGTACTT 


5580 




TAGCAGGAAA 


GGGTGTAGCT 


ACCGATAATT CAGGAGCATT 


AACGCAACCG 


ACATTTGCTG 


5640 


30 


TATTAAATGA 


TAGAGGTTCT 


GGAAACGCCC CAGGTGGTGG 


AGTTCAAGAA 


ATAATTCACA 


5700 




GGGCTGACGG 


AACATTCCAT 


GCACCCCAAG GACGAGATGT 


GGTTGTTCCA 


CTAGGAGTTG 


5760 


35 


GAGATAGTGT 


AATAAATGCC 


AATGACACTC TGAAGTTACA 


GCGGATGGGT 


GTTTTGCCAA 


5820 


AATTCCATGG 


TGGTACGAAA AAGAAAAAAT GGATGGAACA 


AGTTACTGAA 


AATCTTGGTA 


5880 




AAAAAGCAGG 


GGACTTCGGT 


TCTAAAGCTA AAAACACAGC 


TCATAATATC 


AAAAAAGGTG 


5940 


40 


CAGAAGAAAT 


GGTTGAAGCG 


GCAGGC?GATA AAATCAAAGA 


TGGTGCATCT 


TGGTTAGGCG 


6000 




ATAAAATCGG 


CGATGTGTGG 


GATTATGTAC AACATCCAGG GAAACTAGTA 


AATAAAGTAA 


6060 




TGTCAGGTTT 


AAATATTAAT 


TTTGGAGGCG GACTAACGCT 


ACAGTAAAAA 


TTGCTAAAGG 


6120 


45 


CGCGTACTCA 


TTGCTCAAAA 


AGAAATTAGT AGACAAAGTA 


AAATCGTGGT 


TTGAAGATTT 


6180 




TGGTGGCGGA 


GGCGATGGAA 


GCTATCTATT TGACCATCCA 


ATTTGGCAAA 


GGTTTGGGAG 


6240 




TTACACAGGT 


GGACTTAACT 


TTAATGGCGG TCGTCACTAT 


GGTATCGACT 


TTGGTATGCC 


6300 


SO 


TACAGGAACG 


AACATTTATG 


CTGTTAAAGG CGGTATAGCT 


GATAAAGTAT 


GGACTGATTA 


6360 




CGGTGGCGGT 


AATTCTATAC 


AAATTAAGAC CXK3TGCTAAC 


GAATGGAACT 


GGTATATGCA 


6420 
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ATCAGGTGCT ACAGGT7UVTT TCGTTAGAGG AGCACACTTA CATTTCCAAT TGATGCAAGG 6 54 0 

GTCGCATCCA GGGAATGATA CAGCTAAAGA TCCAGAAAAA TGGTTGAAGT CACTTAAAGG 6600 

TAGTGGCGTT CGAAGTGGTT CAGGTGTTAA TAAGGCTGCA TCTGCTTGGG CAGGCGATAT 6660 

ACGTCGTGCA GCAAAACGAA TGGGTGTTAA TGTTACTTCG GCTGACGTAG GAAATATCAT 6720 

TAGCTTGATT CAACACGAAT CAGGAGGAAA TGCAGGTATA ACTCAATCTA GTTCGCTTAG 678 0 

AGACATCAAC GTTTTACAGG GCAATCCAGC AAAAGGATTG CTTCAATATA TCCCACAAAC 684 0 

ATTTAGACAT TATGCTGTTA GAGGTCACAA CAATATATAT AGTGGTTACG ATCAGTTATT 6 900 

AGCGTTCTTT AACAACAGAT ATTGGCGCTC ACAGTTTAAC CCAAGAGGTG GTTGGTCTCC 6960 

AAGTGGTCCA AGAAGATATG CGAATGGTGG TTTGATTACA AAGCATCAAC TTGCTGAAGT 7020 

GGGTGAAGGA GATAAACAGG AGATGGTTAT CCCTTTAACT AGACGTAAAC GAGCAATTCA 70 80 

20 ATTMiCTGAPL CAGGTTATGC GCATCATCGG TATGGATGGC AAGCCAAATA ACATCACTGT 714 0 

AAATAATGAT ACTTCTACAG TTGAAAAATT GTTGAAACAA ATTGTTATGT TAAGTGATAA 72 00 

AGGAAATAAA TTAACAGATG CGTTGATTCA AACTGTTTCT TCTCAGGATA ATAACTTAGG 72 60 

TTCTAATGAT GCAATTAGAG GTTTAGAAAA AATATTGTCA AAACAAAGTG GGCATAGAGC 73 2 0 

AAATGCAAAT AATTATATGG GAGGTTTGAC TAATTAATGC AATCTTTTGT AAAAATCATA 73 80 

GATGGTTACA AGGAAGAAGT AATAACAGAT TTTAATCAGC TTATATTTTT AGATGCAAGG 744 0 

GCTGAAAGTC CAAACACCAA TGATAACAGT GTAACTATTA ACGGAGTAGA TGGTATTTTA 7500 

CCGGGCGCAA TTAGTTTTGC GCCTTTTTCA TTAGTATTAA GGTTTGGCTA TGATGGTATA 7560 

GATGTTATAG ATTTAAATTT ATTTGAGCAT TGGTTTAGAT CTGTGTTTAA TCGCAGACAT 7620 

CCTTATTATG TTATTACTTC TCAAATGCCT GGTGTTAAAT ATGCAGTGAA TACAGCTAAT 7680 

GTTACATCfA ATTTAAAAGA TGGTTCTTCA ACTGAAATTG AAGTAAGTTT AAATGTTTAT 7740 

AAAGGGTATT CTGAATCAGT TAATTGGACC GATAGCGAOT TCTTATTCGA CTCTAATTGG 7800 

ATGTTTGAAA ATGGAATTCC TCTTGATTTC ACACCTAAAT ATACTCATAC ATCAAATCAA 7860 

TTTACTATTT GGAACGGTTC TACTGATACG ATAAATCCAC GATTCAAGCA CX3ATTTGAAA 7 920 

ATATTAATTA ATTTAAATGC GAGTGGAGGA TTTGAACTGG TTAATTATAC AACAGGTGAT 7 980 

ATTTTTAAGT ACAACAAAAG TATAGATAAA AACACTGATT TTGTTTTAGA TGGTGTGTAT 8040 

GCATATCGAG ATATAAACAG AGTGGGAATT GATACAAATA GAGGCATTAT AACATTAGCG 8100 

SO 

CCAGGTAAAA ATGAATTTAA GATTaAAGGA GACGTCAGTG ATATTAAAAC TACATTTAAG 8160 

TTTCCTTTTA TTTATAGGTA GGTGATTTAA TGGATTATCA TGATCATTTA TCAGTAATGG 8220 
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GAATGAAGCT 


AGGTACATCA 


CCTTTACAGT 


TTATAGAACT ACTCATAATA 


8340 




GTTTTGTTTT 


TGATTTATTG 


ATTTGTGAAA 


ACTTCATAAT TTATCATGGT GAAAAATACA 


B400 


5 


CAATTAAGCA 


GACAGCGCCA 


AAGGTTGAAG 


GTGATAAAGT 


TTTTATTGAA 


GTTACGGCAT 


8460 




ATCACATAAT 


GTATGAATTT 


CAAAATCACT 


CAGTGGAATC 


AAATAAGCTT 


GATGACGACA 


8520 


10 


GTAGCGAAAC 


TGGTAAAACG 


CCAGAATACT 


CTTTAGATGA 


GTACTTAAGA 


TATGGATTTG 


8580 


CAAATCAAAA 


AACGTCAGTC 


AAGATGACCT 


ATAAAATAAT 


TGGAGATTTT 


AAAAGAAAAA 


6640 




TACCAATTGA 


TGAATTAGGT 


AATAAAAATG 


GCTTAGAATA 


TTGTAAAGAA 


GCAGTAGATT 


8700 


IS 


TGTTTGGTTG 


TATTATTTAT 


CCAAATGATA 


CGGAGATATG 


'l-rriTATTCT 


CCTGAAACAT 


8760 




TCTATCAAAG 


AAGCGAAAAA 


GTAATAAGGT 


ATCAATATAA 


TACTGATACT 


GTGTCTGCTA 


8820 




CTGTCAGTAC 


GTTGGAATTA 


AGAACAGCTA 


TAAAAGTTTT 


TGGGAAAAAG 


TACACAGCCG 


8880 


20 


AGGAAAAGAA 


AAATTATAAT 


CCTATTAGAA 


CAACTGACAT 


TAAATATTCA 


AATGGTTTTA 


8940 




TAAAAGAAGG 


TACTTATCGT 


ACCGCAACAA 


TTGGGTCTAA 


AGCTACTATT 


AACTTTGATT 


9000 




GCAAGTATGG 


TAATGAAACA 


GTTAGATTTA 


CAATAAAaAA 


GGGCTCTCaA 


GGTGGAATAT 


906O 


25 


ATAAGTTGAT 


TTTAGACGGC 


AAGCaAATTA 


AGCslAATTTC 


TTGTTTTGCT 


AAGTCGGTTC 


9120 




AGTCTGArnAC 


AATAGATTTA 


ATAaAAAATA 


TTGATAAAGG 


CAAGCACGTT 


TTAGAAATGA 


9180 


30 


TATTTTTrGG 


AGArGrCCCC 


AAAAATAGAA 


TTGATATATC 


TTCAAATAAA 


AAAGCTAAGC 


9240 


CTTGTATGTA 


TGTTGGAACT 


GAAAAATCAA 


CAGTCTTAAA 


TTTAATTGCT 


GACAACTCAG 


9300 




GTCGCAATCA 


ATACAAAGCA 


ATTGTTGaCT 


ACGTCGCAGA 


TAGTGCAAAG 


CAGTTTGGGA 


9360 


35 


TTCGATATGC 


TAATACGCAA 


ACAAATGAAG 


ATATCGAAAC 


ACAGGATAAG 


CTGTTAGAAT 


9420 




TTGCAAAAAA 


GCAAATAAAT 


GATACTCCTA 


AGACTGAATT 


AGATGTTAAT 


TATATAGGTT 


9480 




ATGAAAAAAT 


AGAGCCAAGA 


GATAGCGTAT 


TCTTTGTTCA 


TGAATTAATG 


GGATATAACA 


9540 


40 


CTGAATTAAA 


GGTTGTTAAA 


CTTGATAGGT 


CACATCCATT 


TGTAAAOGCA ATAGATGAAG 


9600 




TGTCTTTCAG 


CAATGAAATA 


AAGGATATGG 


TACAAATTCA 


ACAAGCGCTT 


AACAGACGAG 


9660 




TTATTGCACA 


AGATAATAGA 


TATAACTATC 


AAGCAAATCG 


TATAAATCAT 


TTATACACTA 


9720 


45 


GTACTTTGAA 


TTCTCCTTTC 


GAGACAATGG 


ATATAGGGAG 


TGTATTAATA 


TAATGGCAAC 


9780 




AGAAGAAGTT 


AAAATCAAAG 


CGCTACTTGA 


AAACGATAAA 


C 




9821 




(2) INFORMATION FOR SEQ ID NO: 471: 









SO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 



TTATTGTTTT 
CACTTTCTAT 
GCTAATCGAT 
TATGGTATTA 
TCAAATGCAT 
GCAACAAGTA 
CAAAATAGCC 
AACAGCACAG 
CTCATCATTT 
GTATTTTAAG 
TAAATTGTTG 
TTTAAAATAA 
TGTCATTTTA 
AATGACTATT 
ACCTGCCATA 
AGGCATACCT 
ATATGCTTTG 



CCAAATGGTC 
CtATTAATTC 
AGCGTCTCAT 
CCTTTTCATC 
ATCTGCAGTG 
ATATTATAAT 
AGTTCAAAAT 
CACTTAACCT 
TCACCTCTTA 
TTAATCGATT 
CGTTTAAGTA 
TAAATATTTC 
AAACCCGCAT 
GTTATAATCG 
GCTATGACAA 
TCATTAACTT 
TCTCGTTTAA 



TTCTCGGTTA 
AaTTTCAGCT 
TATAGGACCA 
CAAAAATAAA 
CTTATATCGA 
CACACTACTA 
ATTAAATGCG 
CATCACTTTT 
AATACCATGA 
GGGCTCCCAC 
ATGTATGTTG 
GTTTCAATAG 
GTTTGACATA 
TAAATAACAA 
TAATACTAGG 
GTTGATAAGA 
TAAACGGCaA 



TTTAACCATC 
TGACCACCCG 
ACATCGATAT 
ATGCCGTTTC 
TAGACTGGCG 
ACAATGGATA 
ATTAAAAGTA 
TTAGCGTGTG 
TTTCAATTTA 
ACCTTTAGCA 
CCAAGTGTCA 
CTCGAAATTC 
AAGATATCCT 
TAATAATTGA 
TATTAAAAAG 
TAAATCTGGT 
TATCGGCACA 



GGTTTCAATC 
CTGTATAAAG 
CAATATTTTG 
GTACACGCAA 
CTATAACAAG 
AATGGTTATC 
CAAGCGCTAT 
GTGACATAAA 
TTTGCATCTT 
ACAATAAAGC 
AATCCTATAA 
TGGATAGTAA 
TTGATCACAA 
TTCCAAAAAT 
TGTCTGTGGA 
ACTAAATTCT 
CTACCTGAAT 



CTACGTGCTT 
GGTCAAAGTT 
AATACGAAAA 
ATGGTGTTTT 
CGTGAAAACA 
TAAAATCTCC 
GGGTATCCAA 
ATTATAATCC 
CACTTCTTGC 
TAAAATTATT 
TGTGATGCGC 
TTTGTTCTTC 
ATAAACCAAT 
AACAGCCTAT 
AACCTGACAA 
GGATAATTTG 
CATTGTC 



(2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6806 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 
TAGGATTGAT TAATCCTTTC TTCAAAATGA TGAATGTGTA ATGTTAAATA TATAATTTCA 
GACTCACTGA TATTAACATC AAATTGTTTT TGTATCATAG TTAAAATTTT ATATGCTGTG 
TTATAGCAAA TCGGATAGTG .ATTTTTAATC ATAGACACAA AATCATCTTG TGCATGTATA 
TATTCTTTTC TTCTTAATCG GCGAATTAAA AATTGTACGT GCCTTATAAA ACGTTGGTAT 
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TTAATAACAT TATTnGATCA AGGTCATCTC ATGCATAGAT AAATCTTCTG TATTaGATGC 
AATATGTAAT GCAATAAATC CTATCTCATC TTCAGGaAAA TGTaCATCTA ATGCTGCATT 
TAACTGATCA ATCACCTGTT TAGCAATATG ATATGCATCA CTATATAACT GCATAGTTTC 
CATAACAAAT GGATTGCTAA TAACTTGATT TTGTTTTAAG CGTTTATAAG CAAATATAAT 

ATGATCCGTT AATGAAACTA CAAGTTGTTT TGAATCAACA TTCATCGCAG TATTAGAAAT SOO 

AAAATTCAAC GAATCAATAA TTACTTGTAA TACATTATCA TCAGCGATTT CAACTAAACT 660 

TTTATAATGT GCTTTTTGTT GCTCACTTTC TAATTTATAA ATTTTCTCTA TTGTAATAGT 720 

TTGGTCGTTT AACGCCATTC CCTCTTTTTT GTTAAAACCA ATACCTTTAC CGATTAAAAT 780 

AACTTCTTGA TCATTATTAG TACATACTAC GACATTGTTG TTCAATGTTT TAGTAACAAT 84 0 

ATATTCTCCC ATTATCATCA CCTATTTTTC TATTATTAAG ATTATATATC GGAAATGTCT 900 

AACTTGAAAG TATAATAATT TAAATACTTG TGTTCTATTA CACGCCTATC CTATATGATA 960 

TATCTTAATT TAATTTCGAT GTCTCTCAAA GTGGAATAAC TATAATAAAA AAATCTGACT 1020 

CCCATATTTA CAAATAATTC TATTTATTTA TATATTATCT GAATTAATAC TCAATACAAA 1080 

25 CTAAAACGTA CTATTAAATT GTGCAAAGCT AAAACAAATT TATATTCATC TATCCAACAA 114 0 

TATGTCTTAT CAATGGTATA GTCTTTGCAC ACCAATGGAG GAAATAAATC TCAACCTTAC 1200 

TATATTAATA TATAATCAAA TCTTAGATTA ACTAGTGTAA TGATACAGAT GATAATTGAG 126 0 

TACAAATTTA AAACCCTGAG ATTTTCGGTT TAATTTGAAA ACCTCAGGGT TTATTTGATT 1320 

TTTATATAAT GAATCGTTAC ATTAAAAATA TTTATTTATC AGAGTTCTTA TATTTGTTAG 1380 

CGCCCCAAGC ACTAATTCCA AATAAGTTAA TTTCTAAGTT TTCAGGTTTA AAGACAGGGT 144 0 

TCTTGCCTTC TTTTTTCTGC TTTTGATAAT CTTTCATCAA TGCAAAAGCT ACATTGGACA 1500 

GTCCTATAAT GGAAATAATG TTTACAATTG CCATTAAGCC CATAAATAAG TCTGCCGTAT 1560 

TCCATACTGT TTCTGTTTTT ACAACTGCAC CGACAAAGAC AAGTACTACA ACAAGACATC 1620 

TAAAGATAAA TAATATTACA CGGTTTGTTG ATAAAAATTC AATATTAGAT TGACCGTAAT 1680 

AGTAATTACC TACAACAGAT GAAAATGCAA ACAGTGTAAC tGCTaTTGTT AAGAAAATAC 1740 

^5 CTCCAGCAGA ACCTAAATGC TCATTAAGTG CTGATTGAGT AACTGCAACA CCTTGAGGTX3 1800 

CGTTATCACC AAATTTCAGT CCTGAATATA GTAAAATCAT GATTGCAGTT GCTGTACAAA 1860 

CCAACATTGT ATCAAAGAAC ACACCTAATG ATTGGATTAA ACCTTGCTTA ACAGGGTGTG 1920 

50 GTACGGCAGC AGTTGCOGCT GCATTCGGCG CAGAACCCAT ACCAGCTTCG TTAGAGAATA 1980 

AACCACGTTT GATACCTTGA AGAACCGCAG CACCTACAGC GCCACCAGTT ACTTGTTCGA 2040 
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GCAATATTAC TAAAACCATA CCAATGTAAA TGATAGCCAT AATCGGTACA ATTAACGAAG 2160 

ATAACGTAGC AATACTACGT ACACCACCAA ATATAATAAT AGCTGTTACG ATTGCTAAAA 2220 

TAATACCTGT GATTACTGGA CTAATATTAT ATTGCGTATT TAACGACTCC GCAATTGTAT 2280 

TAGATTGCAC TGTGTTAAAT ACAAATGCAA ATGTAATTGT AATTAAAATC GCAAATACGA 234 0 

TACCTAGCCA TTTTTGATTT AAACCTTTAG TAATATAGTA AGCTGGACCA CCACX3GAATC 2400 

CACCATCTTT ATCATGTACT TTATAAACCT GAGCCAAAGT CGCTTCTATA AATGCACTCG 24 60 

CTGCACCTAT AAATGCAATA ACCCACATCC AAAATACTGC ACCTGGACCG CCTAAAACAA 252 0 

TCGCAGTCGC AACACCAGCA ATATTACCAG TACCAACTCT CGAACCAGCA CTAATCGCAA 2580 

ATGCTTGGAA TGGCGAAATA CCCTTCTTAC CATCTTCTAA AGTTTCTGGA CGTTCTACTA 264 0 

AAGCTCTAAA CATTTCAGGT AACATTCXSTA ATTGAACX3AA TTTAGAACTA ATCGTAAAGA 2700 

AGAATCCAGC TGTCAATAAT AGACCAATTA AATATTGAGA CCATATTAAA TCGGTACCAA 2760 

CATGGACAAA TTCTTTAAAC CATCCAGGTA TTAAACTATC GAAATCTTTC AAAATAAACC 2820 

CCTCGCATCC TCTACATGAA TCATGTACCT TCTATAAAAT TAGACCGAAT TGAACCTTCA 2880 

2s GTAAATATAG AGATACATCA TCATTTCTTA TACAATACAA GAGATTTATA TTAGTTTGGT 2 94 0 

CAAAGTATAT CGCTAATTTA ACGATAAGTA CTTGGTCAGC ATTTAATATA AATCCCTTGA 3 000 

ATTTAGTCAA AATTTAACAT TACTGTATTT TATCATTTAA TTTCGTGATT GCATATAGTT 3060 

30 TTTAGCTAAT ATACATGTCT ATTACTTCAC CAAAATCATC TGTATCTACA ATGAATGAGC 3120 

CATTTGTATA TTGTTCAGAT TTATGAATAT CATTAATTAA ACCATGTTCT TCATTTGATT 3180 

TTGAATATAA TGTATATTGA CTATGTTCAC CTGTCACTAC ATGTGCAGCT ACAATACGAT 324 0 

GTGGATTTTT CTTTAATTCT TTTAATAAAG TTATTCCaCG TTGTGCTCTT TTAGCAACTT 3300 

GTAAGATTTT AAAACTAATA CGTTTTAACG AGCCGCGTTG TGTGGCCATC AATATAGTAT 3360 

CATTTTCAGA AACACCTTCT GTCATAACAA CGAAATCTTC AGCTTTAAGA TTTATTGATT 3420 

TAACACCAGC TGCCCTTAAT CCGGTATCTG ATAGTTCACT TGTATTATAC GTTAATGACA 3480 

TACCTTTATT AGTAATGACG GTAATTAATT GATCTTTTTC AAAGCGCATA ACACTAATCA 354 0 

AATCATCATT TTCTTTAACT TTAGTAGCAA TTAAAGGTTT ATTAAAACGC G TT G TTTT A A 3600 

ATAGAGGCAC TGTACTTTTC TTAATCATGC CATTTTGAGT CGCAAAAACA TAAAATGCAT 3660 

CTGTATTAAA GTCCTTTTCA TTAAAGACAT TAATAACCAC TTCATCTTCT TCGATAGGAA 3720 
so CTATTTGTGA TACATGTTGT CCCAATTCTT TCCAACGAAT ATCTGCTAAT TTATGAACCG 3780 
GTATAAATAG ATAACXSACCT TTATTTGTAA ATACTAGTAC GGTATCTTGC GTATTTACTT 384 0 
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TCACTAGATT 


TCCTCCTCAT 


CAAATTGATC 


ATTTTCAAGC 


ACTTGTACTT 


CAGAATTATC 
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SO 


TAAAATACTT 


TGGTCCTCTT 


GCATACCAAA 


CTCAACATGC 


TTTTCAATCC 


ATTCACGTCT 


5580 




AGGTTGTACT 


TTGTCACCCA 


TTAATGTTGT 


TACACGTTTA 


GATGAACGCA 


CTTCATCTTC 


5640 
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AGGGTTCATT TCACCCAAAC CTTTGTAACG TTGTAACGTG AaGCCTTTAC CAAGTTCTTT 5760 

TTGCAATTTA TTAAGCTCTT CGTCTGTCCA AGCGTATTCA ACTCG C TT T G TTTTGCCTTT 5820 

ACCTTTTTCC AATTTATAAA GTGGAGGTAA AGCAATAAAT ACACGACCTG CTTGAACAAG 5880 

CGGTTTCATA TATTTGAAGA AGAATGTTAA CAATAGCACT TGAATATGCG CACCATCAGT 5940 

ATCAGCATCA GTCATAATAA TTACACGATT ATAATTACTA TCTTCAATTT TAAAGTCAGT 6000 

ACCAACGCCT GCCCCGATTG TGTGGATAAT TGTATTAATT TCTTCATTTT TAAAAATATC 6060 

TTCTAGACGT GCTTTCTCTG TATTAATTAC CTTACCACGT AATGGTAATA TCGCTTGGAA 6120 

TTTGCGGTCT CGTcCAAGTT TTGCTGAACC TCCCGCAGAA TCACCTTCGA CTAAATACAA 6180 

TTCATTTTTT TCAGTGTTTT TACTTTGTGC AGGTGTTAAT TTACCAGATA GCAAAGTGTC 6240 

TTTACGCTTG TTTTTCTTAC CTGAACGAGC ATCTTCACGA GCTTTACGTG CAGCTTCCCT 6300 

TGCTTGTTGT GCTTTAATCG CTTTTTTCAC AAGTGATTTA GACAATTGTC CTTTTTCTTC 6360 

TAAATAGAAT GGCAATTTGT CTGCAACAAC TGAATCAACA GCACTTCTAG CTTCAGAAGT 6420 

ACCCAATTTA GATTTCGTTT GTCCTTCAAA TTGCAATAAT TCTTCTGGAA TACGAACAGA 64 80* 

25 CACAACAGCT GTTAAACCTT CACGAATATC ATTACCATCT AAGTTTTTAT C T rnXriTiT 654 0 

AAGTTCATTA ATACGACGTG CATAATCATT AAATACACGT GTCATTGCTG TTTTAAAACC 6600 

AACTTCATGT GTACCACCAT CTTTAGTACG TACATTATTT ACAAAACTTA AAATACTTTC 6660 

30 TGAATATTGA TCATTATATT GGAAAGCTAC GTCTACCTCT ATACCATTTG CTTCACCTGA 6720 

AAATGTAGCC ACGTCATGCA AAACTTCTTT TCCTTCATTG ACATAACTAA CAAACTCTTT 67 80 

GATTCCTTCT TATAATGGTA TGTCTT 6 806 
(2) INFORMATION FOR SEQ ID NO: 473: 



20 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1716 base pairs 
(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

45 

GGGGCAnAAA TTCCAATACA CTCATTACCA AATATATACA CCACTTCCTG CTACAAGTnn 60 

TTTTACTTGA TCTTGGTCTT TTCCGCAGAA AGAGCATTTC CAAATTTTCT TCATCTTCCA 120 

so TTGAATTTAA ACATTCTTTT TACACCCCTA TTCGTTAAAG ACTATACTAG ATTGGATGTT 180 

ACAATGCAAC ATATTAACAT ACAAACTTTT TGCTTAAAGA ATAGTAGCAG ATACATAAGC 24 0 
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CCtTCAACGA ACTTTGCGTT ATCTCTTAAT AAATCGATAA CTTTrTOGAT ACGAACATCA 
TTTTTAATC5A TATCAGTAIT ACCTAAAGTA TTTTTCATAT CTTCAACTGA GATATTAAAT 
TGTTTACTCA TTTTTTCTAA TTCTTTATCG ATATCTTCAT CAGTAGCTTC GAnTTTTCA 
GCTTCAGCGA TCGCAGTTAA AGTTAAGTTA GTTTTAACAC GTTGTTCTGC ATCGTCTTTC 
ATTTGCTCTC TTAATTGAGT TTCATCTTGA CCTGAGATTT GGAAGTACGT TTGTAAATCT 
AAACCTTGTT GTrGAATTCT TTGTGCAAAT TCAGACACCA TACGATCTAA TTCAGTATTA 
ACCATTGCTT CAGGAATATC GATTGTTGTA TTATCAGTAG CTTTTGTAAT CGCTTCTTCT 
TTTTCAACAT TTTCAGCATC TGTAGCTTTT TGTTCAGCTA AACGITTAOG TAAGTITrCT 
TTGTACTCGT CTACTGTATT TGCTTCTGCA TCTAATTCAT TAGCAATTTC ATCTGTTAAT 
TCTGGGACTT CTTTAAATTT AATTTCGTTA ACTTTTGTTT TGAAAGTTGC TTCTTTACCG 



900 

GCTAATTCTT CAGCATGGTA TTCTTCTGGG AATGTTACGA CAACATCTTT TTCTTCGTCA 960 

ACTTTCATAC CTTCTAATTG CTCTTCGAAA CCAGGTATGA ATGAACCTGA ACCGATTTCT 1020 

AAATCGTAAC CTTCAGCTTG TCCACCTTCG AATTCTTCtC CGTCAACTGA ACCACTAAAG 1080 

25 TCGATGTTAA CTGTGTCGCC ATTTTCAACA ACACCATCTT CTTTAACGAC CATTTCAGCT 1140 

AAATGTCCTA AGCTGTGGTC AATCGCTTCT TGTAACTCAT CATCAGATAA TTCAGTTTCT 1200 

TGTTTTTCAA TTTCAAGACC TTTATAGTCT CCTAATTTAA CTTCTGGCTC AACTGTAACT 1260 

GTTGCTTCAA AAATGAAATC TTTACCTTTT TCAATTTCAG TAACACTTAC TTCtGGTTGT 1320 

GCAACTGGTT TAATATCAGT TTCGTCAATT GCTTCACCAT AAGCATCTGG TAATAAAATG 1380 

TCGATAGCAT CTTGATATAA TGCTTCTACA CCAAAGCGTT GTTCAAAAAT TGGACGTGGC 1440 

ACTTTACCTT TACGGAATCC AGGTACGTTA ATTTGTTTAA CCACTTTTTT GAATGCTTGA 1500 

TCTAACGCTT TGTTTACTTT TTCTGCAGGA ACAGTAACAG TTAATAAACC TTCGTTACCT 1560 

TCCTTTTTTT CCCAAGTTGC TGTCATCTAT ATATACCTCC ATGATTAACT AATTTATTTT 1620 

TTCAACTTCC CTATTATATC ATACGTCTAT TCCCTATACA AACATTCAAA TCACAACGTT 1680 

TATATATTTG TAAATCAACT TTTTTCGTCA AAACTA ^716 
(2) INFORMATION FOR SEQ ID NO: 474: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
SO (D) TOPOLOGY: linear 
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TGACCAAGTA 


CGTTTCGAAG 


TTGCCATTAA 


AGCATTAAAC 


CCATCATTGA 


AAGCATTCGC 


60 


ACCTGTACGT 


GAGTGGGCAT 


GGAGTCGTGA 


AGAAGAAATC 


GATTATGCAA 


TTAAACATAA 


120 


TATCCCTGTA 


TCAATCAACC 


ATGATTCACC 


TTATTCTATC 


GATCAAAATC 


TATGGGGCAG 


180 


AGCGAATGAA 


TGTGGTATTT 


TAGAAGATCC 


TTATGCTGCG 


CCACCAGAGG 


ATGCGTTCGA 


240 


TCTAACAAAT 


GCTTTAGAAG 


AAACACCAGA 


TACTGCTGAT 


GrAATCATTT 


TAACGTTTGA 


300 


TAAAGGCATC 


CCAGTTCAAA 


TTGATGGCAA 


AACATATGAA 


TTAGACGATT 


TAATTTTAAC 


360 


GTTGAATGCA 


TTAGCTGGTA 


AGCATGGTAT 


CGGAAGAATT 


GACCATGTAG 


AAAATAGACT 


420 


TGTAGGTATC 


AAATCAAGAG 


AAATTTATGA 


GGCACCTGCT 


GCAGAAGTTA 


TTTTAAAAGC 


460 


GCATAAAGCA 


TTAGAAACGA 


TTACXTTTAAC 


GAAAGATGTC 


GCACACTTTA 


AACCAATCAT 


540 


TGAGAAGCAA 


TTTGCTGAAC 


AACTATACAA 


TGGACTTTGG 


TTCTCACCTT 


TAACTGATAG 


600 


CTTGAAATTA 


TTTATTGATA 


GTACTCAGCA 


ATACGTAAGT 


GGTGATGTCA 


GAATTAAATT 


660 


ATTCAAAGGT 


AATGCCATCG 


TGAATGGTAG 


AAAATCACCT 


TACACATTAT 


ATGATGAAAA 


720 


ATTAGCAACT 


TATACAAAAG 


AAGATGCATT 


TnATCAAGAC 


GCTGCTGTTG 


GCTTTATCGA 


7B0 


TATCTATGGT 


TTACC 










795 



(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 887 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 

CGATTGAAAG AAGACGGTTC AGTTGAAAAG TTTCCAAAGC CAGTAATTAG CCAACAACCA 60 

GAAGGATATA CGAGTCATTT TAGAGATCCT AAAGTTTTTA AATATGATGr GaAATATTAT 120 

40 

GCAATCaTTG GTGCmCaAAA TAATGATCaG CaAGGTCGAT TATTACTTTA TAATACTGAA 180 

GATATAATTA ATTGGCATTA TTTAGGTGAA ATAAATACAG AGTTGGATGA TTTTGGATAT 240 

ATGTGGGAAT GCCCAGATTA CTTTAATGTA GATAATCAAG ATGTCATACT TATTTGTCCA 300 

45 

CAAGGTATTG AACCAAAAGG CGATCAGTTC AAAAATATTT ATCAAAGTGG TTATATACTT 360 

GGAAAGTTTG ATATTGAAAA GTTAACATAT GAACATGAAA ATTTTGTCGA GCTTGATAAT 420 

so GGTTTTGATT TCTATGCACC TCAAACATTT TTAGATGAAA AAGGCCGACG AGTACTAATT 4 80 

GGATGGATGG GGTTACCGGA AATCGAATAT CCTACTGATA ATGAAGGATG GGCCCATTGC 540 

55 



1399 



EP 0 786 519 A2 

GCGTTGGAAA AATTACGTCA CAATAAAGAG ACAGCATTtA GGctACGCAA ATAAATTTAC " 660 

TCGAAAATTA CATCCGTATG AAGGTAAACA GTATGAATTA ATCATAGATA TTTTGGATAA 720 

5 TGATGCTACC GAAGTGTACT TTGAATTACG TACATCTAAG ACTTCTTCAA CATTAATTGC 780 

TTATAACAAG CGTGAAAATA AAATAACATT AGATCGCAGC GACAGTGGTT TATTGCCGAC 840 

AAATGTTGAA gGTACGACGC GTAGTACGAT ATTAGACACG CCATTAA 887 
(2) INFORMATION FOR SEQ ID NO: 476: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1183 base pairs 

(B) TYPE: nucleic acid 

IS (C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 76: 
TTGGAAnCAA AAAACCATTG GTAAACCGTG TTnAACCGGA TTTCCGATGG ACCTTTTAAA 60 

ACnACCAAAT AGAAAGCTTT GATAAAAGGT AATTATGGTA CTGATAACAA ACAAGTTCAA 120 

AAACATCATG ATTTAGTACG TATGCTTTTG ATGGATCAAG ATGGTTTTTT AACTGAAAAT 180 

AATAAAGTTG ATCATTTCAT TGATGGAAAT GATTTATATG ATCAAGTTTT AAAAGATATT 240 

AAAAATGCAA AAGAtATATC CATTTAGAGT ACwATACTTT CGCTTwAGAT GGTTwAGGTA 3 00 
30 AAAGAATTTT ACATGCTTTA GAAGAAAAAT TGAAACAAGG TCTAGAAGTA AAAATATTAT . 3 60 

ATGATGATGT TGGATCTAAA AATGTTAAGA TGGCAAATTT TGATCATTTT AAATCGTTAG- 420 

GTGGAGAAGT TGAAGCATTT TTTGCTTCAA AATTACCGTT ATTGAATTTC AGAATGAATA 4 80 

ATAGAAATCA TAGAAAAATC ATCGTAATCG ATGGTCAACT AGGTTATGTC GGAGGATTTA 54 0 

ACATTGGTGA TGAATATCTm GGATTAGGAA AATTAGGATA TTGGAGAGAT ACGCATTTAC 600 

GTATACAAGG GGATGCGGTT GATGCACTGC AGTTGCGATT TATTTTAGAC TGGAATTCGC 660 

AAGCGCACCG TCCACAATTT GAATATGATG TTAAGTATTT CCCTAAAAAG AACGGACCAT 720 

TGGGCAATTC ACCAATTCAA ATAGCTGCAA GTGGCCCGGC TAGTGACTGG CATCAAATTG 780 

AATACGGTTA TACAAAAATG ATTATGAGTG CAAAGAAATC TGTATATTTA CAATCACCAT 840 

ATTTCATTCC GGATAATTCA TATATaJUVTG CCATTAAAAT TGCTGCTAAA TCAGGTGTAG 900 

ATGTACATTT AATGATTCCA TGTAAGCCAG ATCATCCATT AGTATATTGG GCGACATTTT 9 60 

CAAATGCCTC TGACTTATTA TCAAGTGGTG TTAAAATTTA TACGTATGAA AATGGATTTA 1020 

TACATTCTAA AATGTGCTTA ATTGATGATG AAATCGTATC AGTGGGCACA GCAAATATGG 1080 



40 



45 



SO 
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CTAAAGATTT AAGGGTGGCT TATGAACATG ATATTACAAA ATC 1183 
(2) INFORMATION FOR SEQ ID NO: 4 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 

GGTTATATGT TTGGTATTAC TCATTATATT GATTGGAGGT TGTGTCATTA TGACAAAAAC 60 

AAATGGTCGA AACGCTCAAA TTAAAGAAAA TTTCAACAAA ACATTAAGTG TATATCTAAC 120 

CAAGAATCTC GATGATTTTT ACGATAAGGA AGGTTTTCGA GATCAAGAAT TTGATAAAAG 180 

AGATAAAGGG ACTTGGATTA TTTATTCTGA AATGGTTATC GAACCAAAAG GGAAnAATAT 24 0 

GGAATCGAGA GGAATGGTGC TCTATATCAa TCGCAATACT AGAACmACGA AGGGTAATTT 300 

TATTGTCACC GAAATAACTG AAGATAGTAA AGGATATTCA CGTAGTAAAG AAAAAAAATA 360 

25 TCCTGTCAAG ATGGAAAATA ATCGAATTAT TCCAACAAAG CCTATACCGG ATGACAAGTT 42 0 

AAAAAAAGAG ATTGAAAACT TTAAGTTCTT TGTACAATAT GGAAATTTTA AAGATTTTAA 4 80 

AGATTATAAA AATGGTGATA TTTCATATAA TCCTAATGTG CCAAGTTATT CTGCAAAGTA 54 0 

30 TCAATTGAAT AATGATGACT ATAATGTTCA ACAGTTAAGA AAACGATATC ATATTCCAAC 600 

CAAACAAGCG CCCGAATTAA AATTGAAAGG ATCCGGCAAT TTAAAAGGCT CATCCGTAGG 660 

ATCTAAGGAT CTAGAATTTA CGTTTGTAGA AAATCAAGAA GAGAATATCT ATTTTTCAGA 720 

TTCGGTCGAA TTTACACCTA GCGAGGATGA TAAATCATGA GTCAAACGGA ATATCAAATA 780 

AAATCTGGCA ATATAAAAGG TAACTCTGAA GAAACAAGTA CAGTATCTAA TATAAGTTAT 840 

GAAATAGAAA ACGCAAATAA CAGTGGTTTA AAACAAAATA AAATTGATAA ACAAATTAAA 900 

AAGTTACAAG AAAAAAATAA ATTCCCTAAA AATCTTTCAT ATCTTAAAAG TTATACGGAC 960 

CCCAAAACAG GCACGACTAC AAGCGCCTTT TTAAATAAAG ACACTGGCAA AGTTACTTTA 1020 

GGTATGACAG GTACTAATGT ACACAAAGAC GCAATATTAA AACAAACATT TGGTGTTCCT 108 0 

TCTTATCAAG GATATATAGA CGTGAGTGAA ACgCTaAAAG ATATTGGGGC CQATGTCAAT 114 0 

ATTGGCCTTC ATTCCGTCAC AGATAAAGAT CCACATTATA AAAATACCCA AGACTTTATC 1200 

AAAAATATCA AAAAAGACTA TGATATTGAT ATTATTACCG GACATTCGCT GGGCGGTAGA 1260 

GATGCGATGA TTTTAGGTAT GAGTAATGAT ATTAAACATA TCGTTGTGTA TAATCCAGCT 1320 
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ATTGAAAAGT 


ACG ATGG Tr* a 


^a'i"i v*i'& » 


TTTGTGTCTG 


ATGAAGACGA 


ATTAGATGCA 


1440 




GGTGTCCGCA 


ATP ATT*r A T A 
n X ^.inx X X A xx\ 


IvaAAAL. 1 vjv. 1 


UGAGAAAAAA 


TAGTACTTAA 


^ H «L *T\^^ TV ^ TV TV 

AAATGGAGAA 


1500 


5 


GGCCATGCAA 


TGAGTGGTAT 

X KJi^lXV X X /\X 


^p^p^pa an a ^^^^ 

X 1 lAAxVaAu^ 


a/** a a ^a /^a/'V^ 


CTATAATCTT 


TV ^W/WiV * TV TV WiimTV 

AGCTGAATTA 


1560 




AACAAAGTTA 


AAGGGTAGrA 


AG Ar*G A A a aT 


a aTa a a/^/^at* 
AAlAAAGVJAl 


»T» Jv TV TS TV »^/^ 1 < 

TAAAATCCGT 


f^^T/^^lTS TV TV IV TV 

TCGTAAACAA 


1620 




A CG AGGCATA 




A GT a a #^ a r^r' 


T*T«a a/^ ao/"v^ k 
i 1 AACiAGvJGA 


ATTGGATTCA 


AACAACGGGT 


1680 


10 






a a a a a a 


TTAGAAG CTT 


m IS IV IV ^^^^ Ik J' win 

TAACAGCACT 


AACCATTGCC 


1740 




X Xnn 


A TT" a a 'T*T' a r"r 


/^a AT'r'a ar'a a 

VjAA 1 uAAo AA 


AGC CAACATT 


n^^^ IS IS TV TV IS n^^^ 

TGAAAAAATG 


^ri m n^^^ m ^^^^ ^^^^ ih 

TATCACGCGA 


1800 


IS 


TGGr'ar*aTaa 


ATTTWiar'ar' 


a a r^Trv^ a & a & 
AAU 1 Vvv^AAaA 


AAG uGCaAGA 


IS ^^«|Sil (/ TV TS 1R 

AGTTGGAAAT 


G7VAATTGGTG 


1860 


a a a a Ik *T*T* a zv 


L. 1 L. IXaAACj 


VjTTATAGATG 


rATTAAGAAA 


AGGTGGCGCG 


TATGAAAGTa 


1920 




/v\v. X x\j>\M/\v. 


lk/^ft.'T*/^/^r» A 

AvsATC wCIAAA. 


AGAAAAATTG 


7S *T>^ IS T* IS IV IV fT« 

ATGATAAGAT 


AAAGAAATTA 


AATGATGTTT 


1980 


20 


ATaa aa aT*m 


1 AA 1 uuv. 1 A 1 


ATCGCAAAAA 


»!»«»» TV » IS /^TS^ 9Sr^ 

TTAAaCAGAG 


m TV ^py"»y^ TV TV TV 

TATCGAAGCA 


ATTGTTTCTA 


2040 




aTr;a*^r'a a aT 




t_A,GATTGATG 


GGATGATGTA 


ATGTTTACTA 


CGTATAAnAA 


2100 




TATTAATGAA 


CTTGAAAATG 


CCTATGATGA 


AGAAAGAAAA 


CAATTGAATG 


ATGCATTCAA 


2160 


25 


TCAAATTGAT 


GAATTAAGAC 


ATCAAACACG 


CAAGAmATGT 


GAACAAATGT 


ATGATCATTT 


2220 




CTTATATCTC 


AAACATAAAA 


TGAATTmyms 


TGAAGACGCT 


ATGATCAGGA 


TGACACGTAT 


2280 




TATAGAATCT 


TTCGATAGAG 


AAACGAATCA 


ACGTATCCGA 


CATCACGAAA 


TG 


2332 



30 (2) INFORMATION FOR SEQ ID NO: 478: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 

TTTACTACCC AGTATCTCTT TTTATAAATT ATATAGCCAC CACATAT G GT GGAAAGTCTT 60 

TTTAATTAGA ATTTTGTTTT TTCAGTTAAG AAAGCTTCTA ACTCTGAGAT TGGCATACGA 120 

ACTTGTTCCA TTGAATCTCT GTCACGTACT GTAACTTGAT TATCTTCTAA TGAATCAAAG 180 

TCGAATGTtA CACAATAAGG TGTaCCGATT TCATCTTGAC GACGGTATCT TTTACCGATA 24 0 

GATrGTGATT CATCGAAATC GATTQAGAAT TTAGAACTTA ATTGCTCAAA AATCTTAATC 300 

GtTCGCCAGA TAATTTCTTA CTTAAAGGTA AAATCGCTGC TTTATATGGT GCTAATGCAG 3 60 

GATGGAAGTG TAAAACTGTA CGTGCATCTT TACTACCTTC AACGCCTTCT TCATCATATG 4 20 
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GAATATATTT 


TTCGTTCGTT 


TCTGGATCAT 


GGTATCTGAA 


ATCTTCACCA 


GAGTGTTCAG 


54 0 




CATGTTTACG 


TAAGTCGAAG 


TCTGTACGAC 


TTGCGATACC 


CCATAACTCA 


CCCCAACCAA 


600 


5 


ATGGGAATTT 


ATATTCAATA 


TCAGTlUTi^ 


CATTTGAGTA 


ATGAGATAAT 


TCATCTTCAT 


660 




CATGATCACG 


TAAACGCATA 


TTTTCACTGC 


TCATATTTAA 


GCTTGTTAAC 


CAGTCACTTG 


720 




CAAAAGTTTT 


CCAATAATTT 


TGCCATTCGA 


TTTCTTCTCC 


AGGTTTACAG 


AAGAATTCAA 


780 


10 


GTTCCATTTG 
GGaATGaTTT 


TTcAAATTCT 
ACCAATTTGG 


CTTGTTCTGa 
ACCGG 


AAATGaAGTT 


ACCTGGAGTG 


aTTTCaTTAC 


840 
865 


15 
20 


(2) INFORMATION FOR SEQ ID NO: 4 79: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 






25 


TAATGAGTAA 


ATAAGTACCA 


GATAAAATCA 


TGAATATCAT 


CCAACATGCG 


GTTAACTCTA 


60 




CTAAATAATT 


AATAATAGTA 


TTTTCAGTAA 


ACAAAGAACT 


ATGTATACTT 


CGCATCACAT 


120 




TAGAATACGT 


ATGTTTCGCA 


TTTTGATCTG 


CAACAAATTG 


ATTGTTATGA 


TCTAGAAAGA 


180 


30 


CGTAACGTTG 


ATTTCCTGCC 


ATATCACTCA 


GTGTAATTCG 


TTTGTTATAT 


GGTTCATCAA 


240 




GTATGCTAAC 


TTTACTTACA 


AAGAATCCTT 


CATATTGTTG 


TTCAACTTGA 


TGTACTGCAT 


300 




CATTTAATGT 


TTGATGCGTT 


TTTACATCAC 


TGTCACCAAA 


AAACTCATTC 


TTATAAATAT 


360 


35 


TATTTTCAAC 


TTCTGGAAAG 


AACAAGTAAC 


CAATGCCCGA 


AATGGTTAAA GTGATTAACA 


420 




GTGGAGCAAT 


AAATATTGCT 


GCATAGAAAT 


GTAATCTTTG 


TAATGGATTA 


AATGTATTTT 


480 


40 


TCATATTTCC 


CTCCCAATTG 


GCTATTATAC 


GGTGTCAATT 


CTGTGATGTG 


TGTGAACAAA 


540 


CTGTGACAAT 


ATTTATTTTC 


TAGAAAAATT 


TAACGATGAT 


TTGTGATTTT TAGAAAAATQ- 


600 




AACTTTTAAG 


TTGGAATGTT 


TGAAGAAAAT 


TGATTATTCG 


TATGTTTTAT 


CAAGCAGCTA 


660 


45 


TGATAAAATT 


TAAACATAAT 


ACAATGCQAG 


CCATTTAACG 


ATCTATGTTT 


AAATGGACAT 


720 




CGATATTGTA 


TGAATTCGTT 


GTAACAAGCA 


AGCATTTCTA 


TGTGAACGAA 


CCAAAGGGGA 


"7 Q n 




AAGTAACATG 


ATTAATAAAG 


AACAATTAGA 


TCTTTTATAT 


AAATTAAAAA 


AAGAAGTTGA 


840 


SO 


AAAGTCGCGA 


AATGAAGCAC 


TTTTACATAC 


AATTAACCAA 


GTAATTAAGA 


AAGTATATTT 


900 




GCAGCAATAT 


ACATGTTCGT 


TCGTTGGACA 


TTTTTCTGCA 


GGTAAATCGA 


CACTGATAAA 


960 
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TATTGTGTCA GTTTCAGACA ATCACGATAT TATTGCTAAT TTGCCGAATC AAACGTATGC ' 1080 

CAAATTATCT AATTATGATG AAGTA^GGGA AATGAATCGC CAAAATGTCG ACGTTGAATC 1140 

5 TGTAGAAATT AATTTTCAAT CAGCTAAATT TGAAAATGGG TTTACGTTGC AAGATACACC 1200 

AGGTGTTGAT TCAAATGTTG CATCACATCA GTCAATAACA GAACAATATA TGTATACAAG 1260 

TAATATGATA TTTTATACGG TTGACTATAA CCACGTTCAA TCTGAACTTA ACTTTAAGTT 1320 

TATGAAGCAT ATAAATGATG TTGGaATACC TGTTGTGTTT ATCATTAATC AAATTGACAG 13 8 0 

CATCCAAGAC GATGGAATTG TCATTCTCTA CGTnTTAAAT CTCGAGTTGG AAAAATCAAT 144 0 

TGGC 1444 



75 

(2) INFORMATION FOR SEQ ID NO: 480: 

(1.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6309 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

2^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 

GCAGCAGCTT ATCGTGAGTT ATCATTATTA TTACGTAGAC CTCCAGGTCG TGAAkCATAC 60 

CCAGGTGACG TATTCTACTT ACATAGTAGA tTATTAGAAA GAGCAGCAAA ATTAAACGAT 120 

30 GACTTAGGTG GCGGTTCAAT TACTGCATTA CCAATTATCG AAACACAAGC TGGTGATATT 180 

TCAGCTTATG TACCAACAAA CGTTATTTCA ATTACAGATG GACAAATCTT CTTACAATCT 24 0 

GATTTATTCT TCTCAGGTGT AAGACCAGCG ATTAATGCCG GACAATCTGT ATCTCGTGTT 3 00 

GGTGGATCTG CACAAATTAA AGCAATGAAG AAAGTTGCTG GTACGTTACG TCTTGACTTA 360 

GCGTCATACA GAGAACTTGA ATCATTTGCA CAATTCGGTT CAGACCTTGA TGAATTTACT 420 

GCAAGTAAAT TAGAACGTGG TAAACGTACT GTTGAAGTCT TAAAACAAGA TCAAAACAAA 480 

40 

CCATTAGCAG TCGAACACCA. AGTGTTGATT ATTTATGCAT TAACAAAAGG ATATTTAGAT 54 0 

GATATTOCAG TTGTAGATAT CACACGTTTT GAAGACGAGT TAAACCACTG GGCAGAATCA 600 

AATGCTACTG AACTGTTAAA TGAAATCAGA GAAACTGGTG GCTTACCAGA TGCTGAGAAG 66 0 

45 

TTTGACACAG CAATTAACGA ATTCAAAAAA AGCTTTAGCA AATCTGAATA ATAAACAAGT 72 0 

TTAGTATAAG GTGGTGAGAT AGTGGCTTCT CTTAAAGAAA TAGATACTCG AATAAAATCA 780 

5^ ACCAAAAAAA TGAAGCAGAT TACGAAAGCG ATGAACATGG TATCAAGTTC AAAACTTCGT 84 0 

AGAGCTGAAA AAAATACAAA ACAATTCACA CCATATATGG ATAAAATGCA AGATGCAATT 900 

55 
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ACTAGAAGTG GATATTTAGT TATCACGAGT 
AACGTGCTTA AAAAATTGAT TACTGATATT 
5 AGTATTGTAG TTTTAGGGCA ACAAGGTGTT 

GAGTATTCTC AAGTAGACGT ACCTGATCAA 
AACCATGCTA TAGACTTATA CAGTGAAGAA 

10 

CATTATGTCA GCGTTCTTGA AAACAAGCCT 
GAGGATTCTA GTAAGGGGCA TGGTCATTTG 
TCTATCTTAA GTGTAATCTT GCCTCAATAT 

IS 

GACGCAAAAG CAAGTGAGCA TGCAACACGT 
GCAACTGAAC TTATTGATGA CTTATCATTA 
ACGCAACAAA TTACTGAAAT TGTTGGTGGT 

20 

GAAAATAGCA TGGGAATTGG CCGTGTAACT 
TTTGAACATA ACGAAgTTCC TAAAATTAAT 

2s GAAGGTACAA TACAACTAAC ATTAGAAGTT 

ACAATTGCGA TGGATTCAAC TGATGGTGTC 
AAAGAAATTA GTGTACCTGT TGGTGACGAA 

30 GAAACAATTG ACCTTAAAGA AGAAATTAGT 

CAAGCACCAG CATTCGATGA ACTTTCAACA 
GTAGTAGATT TACTAGCACC TTATATTAAA 

^ GGTGTAGGTA AAACAGTATT AATCCAAGAA 
GGTATTTCTG TATTCGCCGG TGTAGGTGAA 
GAAATGAGTG ACAGTGGTGT AATTAAGAAA 

40 

CCACCTGGTG CACX3TATGCG TGTTGCATTA 
GACGAACAAG GTCAAGACGT ATTATTATTC 
GGTTCTGAGG TATCTGCATT ATTAGGTCGT 

45 

CTTGCTACTG AAATGGGACA ATTACAAGAA 
ACTTCTATTC AAGCGGTATT CGTACCTGCC 
so GCGTTTGCCC ATTTAGATGC AACTACAAAC 
TATCCAGCCG TGGATCCATT AGCGTCTACA 

55 



GATAAAGGTT TAGCAGGTGC ATATAGTGCA 1020 

GAAGCGAAAC ATCAAGATAG TAGCGAATAC 1080 

GATTTCCTAA AAAATAGAGG TTATGACATT 1140 

CCTTCTTTCA AATCTGTTCA AGCACTAGCT 1200 

GAAATTGATG AATTAAATAT ATACTATAGT 12 60 

ACATCTAGAC AAGTATTACC ATTATCTCAA 13 20 

TCTTCTTATG AATTTGAGCC AGATAAAGAA 13 80 

GTTGAGAGTT TGATTTACGG AACAATATTA 1440 

ATGACTGCGA TGAAAAATGC CACTGATAAT 1500 

GAATATAACA GAGCGAGACA AGCAGAAATT 1560 

TCCGCAGCGC TTGAATAATA TTTAAAGGAG 1620 

CAAGTTATGG GTCCTGTAAT TGATGTTCGA 1680 

AACGCCTTGG TTATTGATGT GCCTAAAGAA 1740 

GCGCTGCAAT TAGGTGACGA CGTTGTTCGT 1800 

CAAAGAGGCA TGGATGTAAA AGATACAGGC 1860 

ACATTAGGTC GTGTATTTAA TGTACTAGGT 1920 

GATTCTGTTC GCCGCGATCC TATCCATCGT 1980 

GAAGTTCAAA TTTTAGAAAC AGGTATTAAA 2040 

GGTGGTAAAA TCGGATTGTT CGGTGGTGCC 2100 

TTAATTAACA ACATCGCTCA AGAGCACGGT 2160 

CGTACTCGTG AAGGTAACGA TTTATACTTC 2220 

ACAGCCATGG TATTCGGGCA AATGAATGAG 2280 

TCTGGTTTAA CAATGGCTGA ATATTTCCGT 234 0 

ATCGATAACA TTTTCAGATT TACACAAGCT 2400 

ATGCCTTCTG CAGTAGGTTA CCAACCAACA 2460 

CGTATTACGT CTACAACAAA AGGATCAGTT 2520 

GATGACTATA CTGACCCAGC GCCTGCGACA 2580 

TTAGAACGTA AATTAACTGA AATGGGTATT 264 0 

TCAAGAGCAT TGGAACCATC AATTGTAGGT 2700 
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10 



IS 



20 



25 



30 



3S 



40 



45 



SO 



CAAGATATCA 
GAACGCGCAC 
ACTGGTCAAA 
TTAGATGGTA 
GATGTTATTG 
AATGAATACA 
TGTTGAACTC 
TCCAACTGTA 
ATATATTGCT 
TCAGACTGCA 
AGCAGAGTCT 
TTTAGAGAGA 
AAGATGTGAT 
ATGAGAGGGG 
ATCATTAAAC 
AAGAAATACT 
CCAAATACAA 
ACTTTTTTAC 
TCGACATTTC 
GATATTTATG 
TAAACTTTGT 
TGATAAAATG 
TATTTGTAJT 
ATACX3CCACT 
AAGCAATTTT 
AGTCTAACTC 
TTTAAAATAG 
ATAGTAATCA 
GCAGTATTAC 



TTGCTATCTT 
GTAGAATTCA 
AAGGTTCTTA 
AATATGACCA 
CAAAAGCTAA 
TTAAACCTAG 
GTTGTTATGC 
GCTGCTTTAA 
GTAAGCGATG 
GAAACTGCAA 
CACTTGGAAA 
GCAAATAACC 
TTCATATCTT 
GGAAGGAATT 
ACAGCTAATG 
AATTTCTAAA 
CACTATAGAG 
AATCACTAAT 
ATTTTTTGAA 
TAAAAAGAAT 
ATCATTATTT 
GATTATATCG 
GCCTATTGGG 
CAATTACAAG 
ATTGTAGATT 
TATGATTTGT 
GGAAATGTAA 
AAGGTGGAAA 
CAATATTGAC 



AGGTATGGAC 
ATTCTTCTTA 
TGTACCTGTT 
TATTCCAGAA 
AGATATGGGT 
ATATTGTCAC 
AAACAACAGC 
AAACAGGCTT 
GCTTTGTTGA 
GAGAAATTGA 
ATGATGACGA 
GTTTGCGTGT 
CGACCCTTTT 
GATAAAGAAC 
TGTATTTAAA 
GAAAAAGTAT 
TCTAGACATT 
AGATTGCTAA 
ATTATCTACA 
TATATAGTAA 
AAATTTTTCA 
GACAATATGC 
CTTTACAATC 
TGTGTATGAT 
TGTTACAATA 
AATCAAAACT 
AGTAATAGGA 
TAAATTAACG 
AGCATCTTTA 



GTVATTATCTG 
TCTCAAAACT 
AAGACAACAG 
GATGCATTCC 
GTTGAAGTAT 
TCCTAATGGT 
TGGTGAGATA 
TGTAAAAGTG 
AGTTAGAAAA 
TGTTGAAAGA 
CAATACTGAT 
GGCTGAATTA 
TTGAATTATA 
CATTAAAGAT 
AATAGGaayA 
TTCTTTATGT 
GATTTATGTC 
AATCAAAATT 
TTTTTTCATA 
GTTAGCTTAA 
TGTACAATGT 
AGTTATCCAT 
AATTAGATTA 
ATTTGTTGCT 
CTCGACGCAG 
AGATATAATT 
GTTCTAAGTG 
GGTGAAGTTA 
TTAGCTTCTG 



ATGAAGATAA 
TCCACGTAGC 
TTGCAAACTT 
GTTTAGTTGG 
AACAATTAGG 
TCTGTTTACA 
GGTGTCATGA 
AAATTTCACG 
GATAAAGTTT 
GCTAAATTAG 
ATTCATAGAG 
AAATAGTAAA 
TTGATTTAAA 
TTATGATGTA 
CATgAGTAAA 
TGGGGCCCCG 
CX5ACTCCCAA 
TCCTTCACCA 
CCAAGATATT 
ACTTTACTAA 
AATACAGTAA 
TTAGTGTTAC 
GATCAATTTT 
ATTTTATTAG 
GTAAAATATT 
AAATAATGAC 
GAGGATTTAC 
AAGTAGAAGG 
ATAAACCGAG 



ACAAACAGTT 
GGAACAATTT 
TAAAGATATC 
TAGCATGGAT 
AGGAATGGAT 
ATCGTGATAA 
GTGGACATAT 
ATGGAACTGA 
CAATCATTGT 
CCAAAGCAAG 
CCGAAAGAGC 
TAAAGGGTCG 
GATACAAAAC 
GTGGTTCTTT 
ACTCATGTAT 
TCAACTACTG 
GAATAGTTTT 
CTATCTACAG 
TTATAGTTAT 
AAACGGGTAT 
TCTTATGAGG 
ATGTTGTATG 
TTAAAAAAGG 
GCACTGCAGT 
TAATAAAATA 
TTAAAATAAT 
GATGGATAAA 
TGCTAAAAAT 
CAAATTAGTT 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
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GACGTTACAT 


ACAAAAAGGA 


CGAAAATGCT 


GTTGTCGTTG 


ATGCAACAAA 


GACTCTAAAT 


4620 




GAAGAGGCAC 


CATATGAATA 


TGTTAGTAAA 


ATGCGTGCAA 


GTATTTTAGT 


TATGGGmCCT 


4680 


c 


CTTTTAGCAA 


GACTAGGACA 


TGCTATTGTT 


GCATTGCCTG 


GTGGTTGTGC 


AATTGGAAGT 


4740 




AGACCGATTG 


AGCAACACAT 


TAAAGGTTTT 


GAAGCTTTAG 


GCGCAGAAAT 


TCATCTTGAA 


4800 


70 


AATGGTAATA 


TTTATGCTAA 


TGCTAAAGAT 


GGATTAAAAG 


GTACATCAAT 


TCATTTAGAT 


4860 


TTTCCAAGTG 


TAGGAGCAAC 


ACAAAATATT 


ATTATGGCAG 


CATCATTAGC 


TAAGGGTAAG 


4920 




ACTTTAATTG 


AAAATGCAGC 


TAAAGAACCT 


GAAATTGTCG 


ATTTAGCAAA 


CTACATTAAT 


4980 


IS 


GAAATGGGTG 


GTAGAATTAC 


TGGTGCTGGT 


ACAGACACAA 


TTACAATCAA 


TGGTGTAGAA 


5040 




TCATTACATG 


GTGTAGAACA 


TGCTATCATT 


CCAGATAGAA 


1TGAAGCAGG 


CACATTACTA 


5100 




ATCGCTGGTG 


CTATAACGCG 


TGGTGATATT 


TTTGTACGTG 


GTGCAATCAA 


AGAACATATG 


5160 


20 


GCGAGTTTAG 


TCTATAAACT 


AGAAGAAATG 


GGCGTTGAAT 


TGGACTATCA 


AGAAGATGGT 


5220 




ATTCGTGTAC 


GTGCTGAAGG 


GGAATTACAA 


CCTGTAGACA 


TCAAAACTCT 


ACCACATCCT 


5280 




GGATTCCCGA 


CTGATATGCA 


ATCACAAATG 


ATGGCATTGT 


TATTAACGGC 


AAATGGTCAt 


5340 


25 


AAAGTCGTAA 


CCGAAACTGT 


TTTTGAAAAC 


CGTTTTATGC 


atgttgcaSa 


GTTCAAACGT 


5400 




ATGAATGCTA 


ATATCAATGT 


AGAAGGTCGT 


aGTGCTAAAC 


TTGAAGGTAA 


AAGTCAATTG 


5460 




CAAGGTGCAC 


AAGTTAAAGC 


GACTGATTTA 


AGaGCAGCAG 


ccgccttaat 


TTTAGCTGGA 


5520 


30 


TTAGTTGCTG 


ATGGTAAnAC 


AAGCGTTACT 


GAATTAACGC 


ACCTAGATAG 


AGGCTATGTT 


5580 




GACTTACACG 


GTAAATTGAA 


GCAATTAGGT 


GCAGACATTG 


AACGTATTAA 


CGATTAATTC 


5640 


35 


AGTAAATTAA 


TATAATGGAG 


GATTTCAACC 


ATGGAAACAA 


TTTTTGATTA 


TAACCAAATT 


5700 


AAACAAATTA 


TACCTCACAG 


ACAGCCATTT 


TTATTAATTG 


ATAAAGTAGT 


TGAATATGAA 


5760 




GAAGGTCAAC 


GTTGTGTGGC 


TATTAAACAA 


GTATCAGGAA 


ACGAACCATT 


CTTTCAAGGG 


5820 


40 


CATTTTCCTG 


AGTATGcGGT 


AATGCCAGGC 


GTATTAATTA 


CTGAAGCGTT 


AcTCAAACAG 


5880 




GTGCGGTAGC 


TATTTTAAAT 


AGTGAAGAAA 


ATAAAGGTAA 


AATCGCTTTA 


TTTGCTGGTA 


5940 




TTGATAAATG 


TCGTTTTAAA 


CGTCAAGTAG 






X XUUi/WlVjlAu 


dUUO 


45 


AAATCACTAA 


AATTAAAGGA 


CCAATAGGTA 


AAGGTAATGC 


TAAAGCTACT 


GTCGATGGTC 


6060 




AACTTGCTTG 


TAGTTGTGAA 


CTTACATTTG 


CAATTCAAGA 


TGTAAAATAA 


AACAAAAAAA 


6120 


50 


ACATTCAAAG 


ATTTAATGTG 
TTTAACTTTG 


TTGGCATAAT 


CTTTGAATGT 


TTTTTATTTT 


ACTCTTCTAA 


6180 


TTTTTCATCC 


GTTTAGACTG 


CaTCATTCGA 


TTAAATGATT 


TTTTTAATTC 


6240 




TTCACCAGAT 


AATCCATCAT 


CAATAAGTTG 


GTTCTAATAA 


ACTTTCAGCA 


TACTGTTGGA 


6300 



55 



1407 



EP0 786 519 A2 



1$ 



20 



(2) INFORMATION FOR SEQ ID NO: 481: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 481: 

AGTTGCTACA CCAGACATGA TGGGTGAAGT TGGTAAATTA GGTCGTGTAT TAGGACCAAA 60 

AGGTTTAATG CCAAACCCTA AAACTGGAAC TGTAACAATG GATGTTAAAA AAGCTGTTGA 120 

AGAAATCAAA GCTGGTAAAG TAGAATATCG TGCTGAAAAA GCTGGTATCG TACATGCATC 180 

AATTGGTAAA GTTTCATTTA CTGATGAACA ATTAATtGAA AACyTCaATA CyTTACAAGA 240 

TGTATTAGCT AAAGCTAAAC CATCATCTGC TAAAGGTACA TACTTCAAAT CTGTTGCTGT 3 00 

AACTACAACA ATGGGTCCTG GAGTTAAAAT TGATACTGCA AGTTTCAAAT AATAAATGAT 360 

ATAAACAATT ACAGGCTGAA AGAAATATCT TTCAGTCTGT AAAAATATAT TGACAATAAG 4 20 

25 TAATTTCCAA GTTATATTAC TTATTGTGAT TATTTTACCT AAGACAGTAG GAGTTATTTA 4 80 

TAACTTAAAA TTTATCCTGC CGAGGCTAAA ATTGACTTGA ACGTGATGAT CTATGATCTT 540 

TCAAGCACTT TTTGCCGTGG GTAGAAAGTG CTTTTTTTAT TAATTTTAAA AAAAGCACCA 600 

AAAATTTAAA TGGAGGTGTC TGAATGTCTG CTATCATTGA AGCTAAAAAA CAACTAGTTG 660 

ATGAAATTGC TGAGGTACTA TCAAATTCAG TTTCAACAGT AATCGTTGAC TACCGTGGAT 720 

TAACAGTAGC TGAAGTTACT GACTTACGTT CACAATTACG TGAAGCTGGT GTTGAGTATA 780 

AAGTATACAA AAACACTATQ GTACGTCGTG CAGCTGAAAA AGCTGGTATC GAAGGCTTAG 84 0 

ATGAATTCTT AACAGGTCCT ACTGCTATTG CAACTTCAAG TGAAGATGCT GTAGCTGCAG 900 

CGAAAGTAAT TTCTGGATTT GCTAAAGATC ATGAAGCATT AGAAATTAAA TCAGGCGTTA 960 

TGGAAGGCAA TGTTATTACA GCAGAAGAAG TTAAAACTGT TGGTTCATTA CCTTCACACG 1020 

ATGGTCTTGT ATCTATGCTT TTATCAGTAT TACAAGCTCC TGTACGCAAC TTCGCTTATG 1080 

45 CGGTTAAAGC TATTGGAGAA CAAAAAGAAG AAAACGCTGA ATAATTTTTA GCGTAAAAAA 1140 

ATTAAAAATA ATGGAGGAAT TATAAAATGG CTAATCATGA ACAAATCATT GAAGCGATTA 1200 

AAGAAATGTC AGTATTAGAA TTAAACGACT TAGTAAAAGC AATTGAAGAA GAATTTGGTG 1260 

TAcTGcAGCT GCTCCAGTAG CAGTAGCAGG TGCAGCTGGT GGCGCTGACG CTGCAGCAGA 1320 

AAAAACTGAA TTTGACGTTG AGTTAACTTC AGCTGGTTCA TCTAAAATCA AAGTTGTTAA 1380 



55 
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TCCTAAAGTA ATCAAAGAAG CTTTACCTAA AGAAGAAGCT GAAAAACTTA AAGAACAATT 1500 

AGAAGAAGTT GGAGCTACTG TAGAATTAAA ATAATTCAAG TATCTTAAAC TTAATAATCA 1560 

AAGTTTTATA GCAAGTATTG CTATAATATA ATGATTCTTT GAGAAGTTAA AACCCCGTTA 1620 

TTTTGATAAC GGGgTtTTAT TCaTTTAAAG ACTGAGTGAA ATGTTATAAT TATAATGACG 1680 

AGTTACAAAG TGAAGATGAG GTGGGAATAA TGAGTCA 1717 



10 

(2) INFORMATION FOR SEQ ID NO: 482: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1279 base pairs 

(B) TYPE: nucleic acid 

'5 (C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 





GTAAATCTGT 


TACTCGAAAT 


GTAACGATTA 


AAGAAAAGGG 


CTCATCTCAA 


ACATATATTT 


60 




TGTTAGGCTA 


TCCAACAAAA 


GCACAGAAGA 


ATAGTCATAG 


CAAATATAGT 


GGAGTCTTTA 


120 


25 


TATATAAAGA 


CTTGAAATCA 


ATCGAAGATA 


CAAATAATGC 


TATTACGATT 


ATCACCATAA 


180 




TTACGGCTGT 


TATTTTCTTA 


ACAATTACAA 


CAGTCTTTGC 


GTTTTTCTTA 


TCGTCAAGAA 


240 




TTACAAAACC 


TTTAAGACGT 


TTAAGAGACC 


AAGCTACACG 


TGTATCTGAA 


GGGGATTACT 


300 


30 


CTTATAAACC 


TTCTGTCACA 


ACGAAAGATG 


AAATTGGTCA 


ATTATCGCAG 


GCATTTAATC 


360 




AGATGAGTAC 


AGAAATCGAA 


GAGCATGTCG 


ACGCATTATC 


CACATCTAAA 


AATATTAGAG 


420 


35 


ACAGCTTAAT 


TAACTCTATG 


GTAGAAGGTG 


TCCTAGGTAT 


TAATGAGAGT 


CGACaAATTA 


480 


TCTTATCTAA 


TAAGATGGCG 


AATGATATTA 


TGGACAATAT 


TGATGAAGAT 


GCTAAAGCTT 


54 0 




TCTTATTAAG 


ACAAATAGAA 


GATACTTTTA 


AATCAAAACA 


AACTGAAATG 


OGCGATTTAG 


600 


40 


AAaTGAATGC 


ACGATTCTTT 


GTTGTGACCA 


CAAGCTATAT 


CGACAAGATT 


GAACAGGGAG 


660 


GTAaAAGTGG 


TGTTGTTGTG 


ACAGTTCGTG 


ATATGACTAA 


TGAGCACAAT 


CTAGATCAAA 


720 




TGAAGAAAGA 


TTTCATTGCT 


AATGTATCAC 


ATGAATTACG 


TACACCGATA 


TCATTACTTC 


780 


45 


AAGGTTATAC 


TGAATCAATT 


GTAGATGGT A 


TTGTTACAGA 


ACCGGATGAA ATAAAAGAAT 


840 




CGCTTGCCAT 


TGTCCTTGAT 


GAATCGAAAC 


GTTTAAATCG 


TTTAGTTAAT 


GAATTGTTAA 


900 




AIGTCGCACG 


CATGGATGCT 


GAAGGGTTAT 


CCGTAAATAA 


AGAAGTTCAG 


CCTATTGCAG 


960 


50 


CGTTACTAGA 


TAAGATGAAA 


ATTAAGTATC 


GCCAACAAGC 


TGATGATTTA 


GGTCTAAATA 


1020 




TGACTTTTAA 


TTAyTGTAAG 


AAGCGTGTTT 


GGAGTTATGA 


TAwGGATCGC 


ATGGACCAAG 


1080 



55 
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TTACTTGTGA TGAAAATGAA AGCGAAGATA TTTTATACAT TAAAGATACA GGTACAGGCA 

TTGCACCAGA ACATTTACmA CAAGTATTTg ATCGTTTTTA TAAAGTTGAT GCAGCGnAllA 

^ ACCCCGnGGT AACCAnGTA 

(2) INFORMATION FOR SEQ ID NO: 483: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



IS 

{XX) SEQUENCE DESCRIPTION: SEQ ID NO: 4 83: 





GAG CTGTTG T 


TACTTTGATG 


CCTGCAGCTT 


TATTACGGCT 


GACTTGGTAA 


TGATAAGTTT 


60 


20 


7\ /^Tk A • 1 ■ 1 


CT CAAT AT AT 


GCTATATCAT 


ATTGAATGGT 


ACGAGGTGAT 


ACACCAAGTT 


120 






7i ' 1 w I O M I > 

CiislAl iLsAl^ 


GGaATAAACG 


TTTGCTCATG 


AATTAAAAGA 


TACAAAATTT 


180 




^V,ji>\ X X X w 1 1 


Al AAl. 1 lAAv. 


AACGTAATAT 


CCTCCTATTT 


GTAATTGTAA GCGATTTCTT 


240 


25 




A 1 A 1 V7\»AA i v« 


TCTTTCATAT 


TTT AATC CG A 


AAAATTGCAT 


ATCAAAATGT 


300 






/vVjiaX J. X X/%Xa 


WjAAC xi 1 i a 


AAATAAATTA 


rATATTCATG 


TTGACAATTT 


360 




AAAAATGTCG 


CAGTATATTT 


AGTTAGACAT 


CTAACGAAAT 


GGTGGTGCAA 


TAAATGGAAT 


420 


30 


TCACTTATTC 


GTATTTATTT 


AGAATGATTA 


GTCATGAGAT 


GAAACAAAAG 


GCTGATCAAA 


480 




AGTTAGAGCA 


ATTTGATATT 


ACAAATGAGC 


AAGGTCATAC 


GTTAGGTTAT 


CTTTATGCAC 


540 


35 


ATCAACAAGA 


TGGACTGACA 


CAAAATGATa 


ttgctaaagc 


ATTACAACGA ACAGGTCCAA 


600 


CTGTCAGTAA 


TTTATTAAGG 


AACCTTGAAC 


gtaaaaagct 


GATCTATCGC 


TATGTCGATG 


660 




CACAAGATAC 


GAGAAGAAAG 


AATATAGGGC 


tgactacctc 


TGGGATTAAA 


CTCGTAGAAG 


720 


40 


CATTCACTTC 


GATATTTGAT 


GAAATGGAAC 


AAACACTCGT 


ATCGCAGTTA 


TCTGAAGAAG 


780 




AAAATGAACA 


AATGAAAGCA 


AACTTAACTA 


aaatgttatc 


TAGTTTACAA 


TAAATGATAA 


840 




GTGTGACTGG 


TAGAAATCAG 


TCACTTTGTC 


TTTAATATTA 


TAGTTAGATA 


TCTAATTGTT 


900 


45 


AGTAAGCTAA 


TTATTGGAAA 


AGACAAGGAG 


tattgaacaa 


TGAAAGACGA 


ACAATTATAT 


960 




TATTTTGAGA 


AATCGCCAGT 


ATTTAAAGCG 


ATGATGCATT 


TCTCATTGCC 


AATGATGATA 


1020 




GGGACTTTAT 


TAAGCGTTAT 


TTATGGCATA 


ttaaatattt 


ACTTTATAGG 


ATTtTyAGAm 


1080 


50 


GAyAGCCACA 
GTTA 


TGATTTCTGC 


tAatCTCTCT 


AACACTGCCA 


GTATTTGCTA 


TCTTAATGGG 


1140 
1144 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 484: 

10 



15 



20 



25 



30 



35 



40 



ACAACCAATT 


TTACTAAACA 


TGGTrrTCAT 


gCATTTTCAA 


TATTTAGATG 


TCGACCGCAT 


60 


AATCGAAGAA 


TCGCCGACAA 


TAGTACTTAT 


CGATGAGTTA 


GCACATACGA 


ATATTTCTAG 


120 


AGATCGTCAT 


GAGAAACGAT 


ATATGGATAT 


TGAAGAAATT 


TTAAATCATG 


GTATCGATGT 


180 


TCATACCACT 


TTGaACATTC 


aaCATATTGA 


AAGTTTAAGT 


AGTCAAATTG 


AACTGATGAC 


240 


CGGTGTACAT 


GTTAAAGAAC 


GTGTACCCGA 


CTATTTCATA 


ATGAGCGCCG 


ATGTATTAGA 


300 


AGTCGTAGAT 


ATCTCACCTG 


AACAATTAAT 


TAAACGCTTA 


AAAGCTGGCA 


AGGTATATaA 


360 


AAAGGATAGG 


CTAGATGTAG 


CATTTAGTAA 


TTTCTTTACG 


TATGCCCACC 


TAAGCGAAtG 


420 


CGTACATTGA 


CGTTAAGAAC 


AGTTGCCGAC 


TTGATGAGTG 


ATAAAGAAAA 


AGTCCGACAC 


480 


AACCATAAAA 


CGTCACTCAA 


ACCTCATATT 


GCTGTGGCAA 


TTAGTGGGAG 


CATTTATAAT 


540 


GAAGCAGTAA 


TTAAAGAGGC 


ATTCCATATT 


GCTCAAAAAG 


AACATGCGAa 


GTTCACTGCT 


600 


ATTTATATAG 


ATGTATTCGA 


AAAAAACAGG 


CAATATAAAG 


ATAGTCAAAA 


GCAAGTGCAT 


660 


CAACATCTCA 


TGCTTGCAAA 


ATCATTAGGA 


GCAAAAGTAA 


AAGTAGTTTA 


TAGCCAAACC 


720 


GTTGCATTAG 


GATTAGACGA 


ATGGTGTAAA 


AATCAAGATG 


TAACCAAATT 


AATTATCGGA 


780 


CAACATATTA 


GAAATAAGTG 


GCGAGACTTT 


TTCAATACAC 


CTTTAATTGA 


CCATTTAATG 


840 


TCCTTTGAAC 


ATAGCTATAA 


AATCGAAATC 


GTTCCAATCA AACAAATACC 


TGTTGAATTG 


900 


AAAATGAACA 


AATCACCCTA 


TCGTCCTAAA 


GGCAAACGTT 


TCGCCATAGA 


TATGTTAAAA 


960 


ATGATTTTGA 


TTCAAATAAT 


TTGTGTAATG 


ATGGGACTGT 


GGATTTATCA 


ACTTGATAAG 


1020 


CATGAGTCTA 


GTACGATTAT 


TTTAATGATT 


TTTCTCATCG 


GCATCATTTT 


ATTATCCATT 


1080 


TGGACGCGGT 


CCTTCATCAT 


TGGCTTTTaG 


CAGCAATTAt 


TAACGTATTT 


GTgTkTAATT 


1140 


ATkTTTTtAC 


GGAACCTA 










1158 


(2) INFORMATION FOR SEQ ID NO: 485: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D> TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485: 

ATCATATGGT CGATTTAACA GATCCAACGT ACTGCtAAAT AATTACATQA CGTTTAAACA 60 

TAGCATTGAT TATAACTATT TCTAAGTCTT CGCATTATTT GCGATGATGT GGGAATAGTT 120 

ATTTTTATTT AAAAATATAA AAAAATAGAT GCAGCAAAAT TTTAAAGCAT TTTATTTTGA 180 

ACATATTAAA AGGGAGCGTA TCATAATGGA ATGTAATGTT TATATCGTAT GCATTACGGA 24 0 

TAAATAATAT ATAAATCATT CTTGAGGAGT GAAAGAATAA TGAGAGACTA CACAAAGCAA 3 00 

TACATTAATG GCGAATGGGT AGAAAGTAAT AGTAATGAAA CGATAGAAGT TATAAATCCA 3 60 

GCAACCGAAG AAGTAATCGG GAAAGTTGCT AAAGGTAATA AAGCTGATGT TGATAAAGCC 420 

GTCGAGGCGG CAGACGATGT TTATTTAGAG TTCCGTCATA CATCTGTGAA AGAAAGACAA 4 80 

GCGTTATTAG ATAAAATTGT AAAAGAmTAT GAAAACAGAA AAGACGATAT TGTACAAGCT 54 0 

ATTACGGATG AATTAGGTGC TCCTTTATCA TTATCTGAGC GTGTCCATTA TCAAATGGGA 600 

CTAAACCATT TTGTTGCAGC GAGAGACGCA TTAGATAACT ACGAATTTGA AGAACGCCGC 660 

GGAGATGATT TAGTTGTTAA AGAAGCAATC GGTGTATCTG GATTAATTAC ACCGTGGAAC 720 

25 TTCCCTACAA ACCAAACATC ATTAAAATTA GCAGCAGCAT TTGCGGCTGG TAGTCCAGTT 780 

GTACTTAAAC CATCTGAAGA AACACCATTT GCAGCTGTTA TTTTAGCTGA GATTTTTGAT 840 

AAAGTCGGTG TTCCTAAAGG TGTATTTAAC CTTGTTAATG GTGATGGTGC TGGTGTTGGG 900 

30 AATCCTTTAT CTGAACATCC TAAAGTACGC ATGATGTCAT TTACAGGATC AGGCCCTACT 960 

GGTTCTAAAA TTATGGAAAA AGCCGCTAAA GATTTTAAAA AGGTATCATT AGAGCTTGGT 1020 

GGCAAATCAC CATATATCGT CCTAGATGAC GTAGATATTA AAGAAGCGGc TAAAGCAACa 1080 

aCAGGCAAAG TTGTTAATAA TACTGGTCAA GTATGTACAG CTGGTACACG TGTTTTAGTG 1X40 

CCTAACAAAA TTAAAGATGC ATTCTTAGCT GAATTAAAAG AACAATTTAG CCAAGTGCGT 1200 

GTCGGTAATC CAAGAGAAGA TGGTACACAA GTAGGCCCTA TCATTAGTAA AAAACAATTT 1260 

GATCAAGTAC AAAATTATAT TAATAAAGGT ATTGAAGAAG GTGCTGAATT ATTTTATGGT 1320 

GGTCCTGGTA AACCAGAAGG ACTTGAAAAA GGATACTTTG CACGTCCGAC AATTTTTATT 1380 

AATGTAGATA ATCAAATGAC GATAGCACAA GAwGAAATTT TTGGGCCAGT AATGTCAGTT 1440 

ATCACTTATA ACGATTTAGA TGAAGCGATT CAAATTGCAA ATGATACAAA ATATGGTTTG 1500 

GCAGGATATG TTATTGGTAA GGACAAAGAA ACATTGCATA AAGTAGCTCG TTCTATTGAA 1560 

GCAGGTACAG TAGAAATAAA CGAAGCAGGT AGAAAGCCAG ATTTACCATT TGGTGGCTAT 1620 

AAACAATCTG GTTTAGGTCG TGAATGGGGC GATTATGGTA TTGAAGAGTT CTTAGAAGTG 1680 



35 



40 
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AGTGCACATG ACTAATTAAG TTTTGTGTAC TGTTTTAATT TTGCAATTTT TATAAATAGA 1800 

TTTTGTAATT AAAATAAAAA TTTGCTATAG TTATTCATGT ATTTAAAAGG TTGGGGATTA 1860 

GCATAATGGG ATTGTGCTAG CACAGTTATT TATGCATTGT CATGCCTATC TATTACTTAC 1920 

TAACTAAAAA ATAATGAAAT GGGTGTAAAC TATATGCCTG AAAGAGAACG TACATCTCCT 1980 

CAGTATGAAT CATTCCACGA ATTGTACAAG AACTATACTA CCAAGGAACT CACTCAnAAA 2040 

GCTAAAACTC TTAAGTTGAC GAACTATAGT AAATTAAATn AAAAAGAACT TGTTCTAGCT 2100 

ATTATGGAAG CACAAATGGA nAAAGATGGT AACTATTATA TGGAAGGTAT CTTAGATGAT 2160 

ATACAACCAG ATGGTTATGG TTTTTTAAGA ACAGTGAACT ATTCTAAAGG GGAAAAAGAT 2220 

ATTT 2224 

(2) INFORMATION FOR SEQ ID NO: 486: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486: 

ACATTACnTT GAATCGAAAG TTTCATAAAC GACTTGAATG CCAGTTTCTT TTTCAAATTT 6 0 

CTTAATTAAC TCTGGATCAA TATATTCGCC CCAATTGTAT ACGTAAATTT TTTGATTTGT 120 

ATGCACTTGT TCTTTAGATT TAAACCAATG ACTTAAAGTA AGACAAAGCA TACCCACAAC 18 0 

TAATGCACCT ATAATGAGTT GTAAAAATCG TTTCATTATT TTACACCTCG CTTGATTAGT 24 0 

35 TTTTTCTTAT TTATCACGTA TTGAATCAAA TAATATCCTA GTATTCCTAA TACAATAACA 300 

GCAAACAATA ATGTTGAAAT CGCATTAATT TCCATACTAA TTCCTTTTCT CGCCATAGCA 360 

TAAACTTCAA CTGATAACAC ACTAAAGCCA TTACCAGTAA CGAAGAAACT TACTGTGAAA 420 

TCGTCTAGTG AATAAGTTAA AGCCATAAAG AATCCTCCTA TAATAGAAGG TAAAATATTA 4 80 

GGAATAATAA TGTTGCTTAA TAATTGTGGT TCAGTCGCTC CTAAATCTCT TGCAGCATTT 540 

AACATATTAT TATTCATyTC ATACAGTTGT GGTAAGACGA TAATCACAAC TATAGGTATG 600 

CAAAATGCAA TATGAGATAT TAGAACTGTC CaAAAkCCTA AACCAAGACC AGTAAAATGG 660 

CCAATCGTTG TAAACATAAT TAAGAATGAT GCACCTATGA CAACGTCGGA ItSATACCATC 7 20 

AAGACATTAT TCAATGTTAG TAAAGTTACT TTAAACTTTT TATTTCTTAA ATAATAAATA 7 80 

GCAATGGCAC CAAATGTACC AATAACTGTA GAAATTGAGG CTGCTAAAAG TGCTACAGCT 84 0 



40 
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AATGTAAAAT 


GTTCAAAGTG 


AATCATATTA 


CCAGCCGAAT 


TGAATGAATA GAACATTAAA 


960 




AAGAATATTG 


GGATGTATAA 


AATCGCTAAA 

* -A ^^SJ^V^ ^^^^^^^ 


AGTATCCCGA 


TATACAGCTT TCCATACCAT 


1020 


5 


TTCATATGAT 


TCACCCTCTC 


CCATTAGATG 


ATTTTGTAAT 


GATTAAAATG AATGCCATAA 


1080 




ATACAATTAA 


GAATATAGCT 


ATAGTTGATC 


CCATACCATA ATTTTGAATT GTTAAAAATT 


1140 


10 


GTTCCTCTAT 


TGCCGTACCT 


ATATTTATGA 


CTTTATTACC 


TGCAATTAAT CTTGTAATCA 


1200 


TAAATAATGA 


AAGTGATGGA 


ATAAAGGTTA 


CTTGAATCCC 


AGTCATAACA CCTTCTTTTG 


1260 




TTAACGGCAT 


GATTACTTTT 


CTAAAAGTAT 




UWV-AUU i AAA TCACmXiAGG 


1320 


IS 


CCTGCAATAA 


ATTATTAGGA 


X X \ J\_> XXX 


TGCTATTAAA 


TATAGGTAAA ATCATAAATG 


1380 


GTATATAAAT 




A CT* APT AJVJX A 


ACGCACCAGT 


TGTAAATAAC AAATTGAATG 


1440 




ATGGTAAATT 






AATCACGCCA 


TCATGACTTA ATAAACCTAT 


1500 


20 


AAAAGCATAT 


GTCTTTAACA 


ATAAATTTAT 


CCATGTTGGA 


ATAATCATTA TCATTAATAA 


1560 




GATATTTTGA 


AATTTCGAAC 


GAGTAATATA 


ATAGGCAGnT 


GGATAACTGA TAGTCAAGGT 


1620 




AATAATTGTT 


ATTGAAGCGG 


CATATAAAAT 


TGAATATGCA 


AACATTTTCA AATATTTTGT 


1680 


25 


AGTAAAAATT 










1690 



(2) INFORMATION FOR SEQ ID NO: 487: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2112 base pairs 
^0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 87: 

ACGAAAAGAA ATATTATGAT GAACAAAAAG AAAGAATAAC GATTTATATG AAGTACAATG 60 

TGAAAGGTTA TAAAAATATA AGCTTCGCTA ATTTTAAAGA AAACCCAATG GATGGTTATT 120 

40 

CTATTAGTGG TTATATAAAT AAtGaTAAAA AGTTATCATT TACAGCTGGT ATAAGATCTG 180 

TTGATGATTT TCAATTTGAT ACCGATATTT CTTATACAGA TGAATTGGGT AGAAAATTTA 240 

ATAAAAATCC TAAGTCAGTT TCTGAAATAA AAAAAGAGCA AAATACGTCC AATAAATAAT 3 00 

45 

TGTTCATATT GTGATGAAAC AAAAATATAA GTCATTAGAT GAGTTTAACT ATGTTATAAA 3 60 

TATTTGTAGT ATCTATAAAA ATCTCGACAC TATTAAAATG ATAAAGTGCC GAGGTTTTCT 42 0 

TACTTATTTA GTTAATTCAA AGTTTATGCC AGATTCATAA GAATTTGTGA CACTTTTAAT 4 80 

AGTGTAcCAT TGATTATTAC AATTTATCAA ATGGTCCTTT AGAAGGTATA AATAACAAAA 54 0 



SS 
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5 



10 



15 



20 



25 



35 



40 



TATTATTATG 


TTCAAAACTT 


TACGCTCCAA 


AAAGTAAAAA 


GGAAGTTAAG 


CAATGTTTAG 


660 


TTGCTTAaCT 


TCGGaTATTG 


AACGCATCAG 


TCCAATTTGA 


CATAGAGCCT 


TTTTTAGTTC 


720 


TTGATGTTTC 


TCTTTAAAAC 


CTTGCATATT 


TTACAAAAAG 


AAAAATTAGC 


AGTATAATTA 


780 


AGACAACGAA 


AATAAGTATT 


TACTTATACA 


CCAATCCCCT 


CACTATTTGC 


GGTAGTGAGG 


640 


GGATTTriAT 


TGGTGCGGCT 


ATATGTCACC 


TATTTTGTAT 


TGCGTCTACT 


TAGCCAATAA 


900 


GAAAAAAACG 


CAATGGCACA 


GCCACTGATG 


ACTGGTGCTA 


TGATGTGAAC 


GaAAATAAGC 


960 


ATCACCTTAT 


ACACCTCCTC 


TCTGCGTCTA 


AATTGACGsC 


TGAGaGrTAG 


GcGACTCTAC 


1020 


TATTATATCA 


TCGGCAAATA 


TACAAGCACA 


GTCACTTGCT 


TCTGATAAGT 


TATATGATTC 


1080 


TAGCTGATAG 


ATTGAATCGT 


CTACACTTAA 


TTGGACAAAT 


TCTATGAGAA 


TAGATATTGT 


1140 


TAATTTAAGA 


AAGTAGGCGA 


TTTTATTATG 


ACAAGAGAAA 


GAAGATCATT 


TAGTTCAGAG 


1200 


TTTAAGTTAC 


AAATGGTTAG 


ATTATATAAA 


AATGGTAAGC 


CTAGGAATGA 


AATTATACGC 


1260 


GAGTATGATT 


TCACACCTTC 


GACGTTTGTA 


AATGOCGGTT 


ATAAAATGTA 


GGAAAAtGGA 


1320 


TAAAGCAACA 


TCAAAACACG 


GGTACATTCA 


ATCACCAAGA 


TAACTTATCG 


GATGAAGAAA 


1380 


AAGAGCTGAT 


TAAATTACGC 


AAAGAAGTTC 


AACATTTAAA 


AATGGAGAAC 


GATATTTTAA 


1440 


AGCAAGTAGC 


GCTGATTATG 


GGGCAAAAAT 


AGAAGTCATT 


CAAAAGAATG 


CACATCAATA 


1500 


TTCAGTATCA 


GCAATGTGTA 


AAGTCCTGAT 


AATACTAAGA 


AGTACCTATT 


ATGATTCTAT 


1560 


AAAAAGAAAA 


GATAATAAAA 


TCACTAAAGA 


TGATTCAAAC 


ATAGAACATG 


CCXrrCATAAA 


1620 


TATTTTTAAT 


TCTAATAGAA 


AAGTCTTTGG 


TACAAGACGA 


ATTAAAAATC 


ATTTAAATGA 


1680 


CAAGGGTCTC 


ACTGTATCTG 


GACAAAAGAT 


AGGTCGATCA 


TGAAAAAATC 


TAGTTTCTGT 


1740 


TTATACGAAA 


TCTAAATACA 


AAAATCATCT 


AAAAGAAACT 


AATGAAAAAC 


GAATTAAAAA 


1800 


TCTTTATTAT 


TAGCTGCTGG 


TGTATTATTT 


GTTAGTCCAA 


TTTCATTATC 


TTATAATTCA 


1860 


GATGTAGCTC 


ATGCTGAAGA 


TAAGTTAGAC 


CATTCTCAAG 


CAAAGGTAAT 


ATATTTGAGT 


1920 


AACCAAAATT 


TATTTGATGA 


ACTTGAGAAA 


AAAGGTTATA 


AACTGGAAGA 


TATATTTACA 


1980 


AAAGAAGAAA 


TAAAAAAATA 


TAAAGCTGAA 


GACCAATTGA 


GAGCGGGTAA 


AACTCAATAT 


2040 


GTAGAAACAG 


GTAAAGATAC 


TGCAACATTA 


TATCTTTCTT 


CTGCATATAC 


AAAAACAATA 


2100 



45 

GCTGCTTTAG GT 2112 

(2) INFORMATION FOR SEQ ID NO: 488: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 
(B> TYPE: nucleic acid 
(C> STRANDEDNESS : double 

55 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 

^ GTAGGCACAC ATCTGCCATA TAAACATTCT TTTATACTAG TGTTCTCATA TAGTGTAGAG 60 

TTATAGTCTC CTTCTTGAAT CTCGAATAAT TCAATCAACC TATCAACCTT AGTCTCTTCC 120 

GTTACTTCTT TTTCAATATC AACTATGAAG GGGATATCAA TTGGAATAAA ACTTGACGTC 180 

,0 GAACACTTAT TTGTATTTGG ATGAAAACGA ACGAATCCAT CACTAAATCC TGTTGAAAAA 24 0 

AATATTTTTC CTTGTGATAG ATCCGGATTT TCTCGCGCCC ATTTAATTAA TTCATCTAAT 300 

CTCATTTCTT TTTTAACTTT GATTTTCATT GTTATATCTC CTCTTGAACA GTAAATTTAT 36 0 

IS CGTTAAtTGA TACGTATCCA GTCACATTAC ATAAGATGCT ATCAACATCA AAAGTCACAC 420 

AACAGTTGCG TTCAACATCA TTTGAATAGA ATCT 454 



{2> INFORMATION FOR SEQ ID NO: 4 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1372 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: dovible 

(D) TOPOLOGY: linear 

2S 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 



30 


TTGTCAGAAT 


TAGAATGCTT 


TTGAGTTACT 


TCATAATACT 


CATCAGTTTT 


TTGTGTATCC 


60 


TTTTGACTTT 


TATTTATTTC 


TTTCCACTTA 


CCAGTATGAC 


TTTCTTTTTT 


TACAGTTATT 


120 




TTCGGTTTGT 


TTAAAAAATG 


ATAACCGATG 


TTCTTTTTGG 


TATCTGTGGA 


CTCTAAAAAG 


180 


35 


ACTGAATTGT 


TTTCCTGATT 


ATCAGAATTG 


GTTGTTTGTT 


TATCGTCTGT 


ATATAACGTA 


240 




TACCCATTCG 


CTTTGCGATT 


TTCAATCGTT 


GTAACTGGAT 


TCTTTGATGA 


ATCAGTACTT 


300 




TTAATGCCAG 


TTCCTAAGAA 


GACAATTTTA 


TCGTTTAATA 


TGAAATATGA 


Ti-rrri'GGCA 


360 


40 


GTTAAAGTTT 


TGTCCTGATT 


TTCAAAATCC 


ATTCCGATAC 


TAGCATGTTG 


GTCATCAACT 


420 




TTTGTTCCGC 


CAACAAAAGT 


TTTACTCGAC 


TTTTTATCAT 


CCGTATCTTT 


TAATATTTCA 


480 




TTGTCTAAAG 


TTGTTGTACC 


TGATAAACGT 


TTCATATCGG 


CTGTCACCCA 


GAAGTTATCA 


54 0 


45 


TGATAGTGTT 


TGACATCGCT 


GTTATATAAA 


TAAGACATTC 


CAGCACCAGT 


GTGCCAACCT 


600 




TTTAAATTCT 


CTCCGTTGAT 


ACTTTCATAG 


CGTGCTACGT 


TTTTCGACGT 


CATACTTAAA 


660 




CCAAATGCAA 


AGTCTAAGTC 


TTTGTTATGA 


TAGGTGACAC 


GATCCATGTC 


ATTATATATT 


720 


SO 


TTAAGTTGTT 


GTGTTAATCC 


GTTTTTAGAA 


ATACTGTTAT 


CTGTCATTAA 


AGACTTCATT 


780 
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ACTGAAGATT 


TGACAATCTT 


TTTATACTTA 


GCTTTTGTTG 


AATCATCCAT 


GGCATCACTT 


900 




AATCTCAACA 


ATGATTTCAT 


TACTGTTGCA 


GATGCTGAGT 


GACTGGTTTC 


ATTTTCACGA 


960 


5 


CTGATAGCTC 


TACCTCGTGA 


TAAATCCATC 


ATTTCACCTT 


TATAAATGAG 


TGGCATAAAT 


1020 




CCGTCGTCAA 


TCCATGACTT 


TAAGGTTGTA 


TCATTTTGGG 


TTTTATCATT 


AAAAGGTGTT 


1080 




TCTTTTATCA 


TCGGCATCAT 


TTGAGAAATC 


CCCTCTAAGA 


GTACAACGCC 


ATAAGCACCA 


1140 


10 


GTGTATGGAA 


CGTCTTGATG 


ATCAATGTAA 


GAGCCATCTT 


TATAAAATCC 


ATTACGTTCT 


1200 




TTACCAGTGG 


CAGAATCTTG 


AACGTAAGTG 


AAGACTTTAT 


TAAATGAATC 


TATAGACTTT 


1260 


IS 


TTCATCATAT 


CTTTATCTTC 


TTCGATAATA 


CATTCTAAAA 


GTTTCACCTT 


AGAAATGTCT 


1320 


ACTAnATTTC 


CGCCTTTAGC 


AAGTTCAGnT 


TTTCCTACAC 


AAGATAATAT 


TT 


1372 




(2) INFORMATION FOR SEQ ID NO: 490: 








20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








25 


• 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490: 








ACAACAATAT 


AGTAAAGCGT 


CGTTAATTAA 


ACAATGGGAC 


CAATTTGTTC 


GTCTTATATA 


60 


30 


AATGTACTTT 


aCCTTTATTT 


TTACAAAAAT 


AGCATTTTCC 


TATGTCATTT 


AACTAAACAT 


120 




GTAAGTTCGT 


ATGAACGAGG 


TTTGTTAAAT 


AGATGATTCT 


AGGAAAATGC 


TTTTTTCTTT 


180 




TGACTTAGTT 


TAAAATATTT 


TGCCACTTTG 


TACTGATAGT 


AGTTGCATTG 


TACTGTTGTG 


240 


35 


CAGATTCTAT 


GCTATTAATT 


GAAAACTGTT 


GCAArrrn'G 


AGTATTATTT 


AGTAATTGGT 


300* 




CGACCTTTTC 


AACCATTTGA 


TTGATGTCAC 


CTTGAGGTAC 


TAAATAGCCA 


TTAAATCCAT 


360 




CTTGaATCAG 


TTCTGaTGGA 


CCATAATCTA 


CATCATAACT 


GATCACTGGT 


GTACCTACTG 


420 


40 


AAAgCGACTC TAAAATTGCT AAGCCAAAAC 


CTTCCATTTT 


ACTTGTCGAT 


AACATCAGTT 


480 




CTGCTTTAGC 


AATCTCTTCA 


TTAATATGCG 


TCTTAAAACC 


ATGAAATTTA 


ACATGTTCCA 


54 0 




GATATnATGA 


TAATCTTCTA 


CAAG 








564 


45 


<2) INFORMATION FOR SEQ ID NO: 491: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491: 





TAT d ri^nrr c 




CGGAAACTTA 


TTGTGTTACA 


AGATATATAA 


GCAGAAACGA 


60 


5 




Ar*AA^AT&&& 


i. VJAAA^I iTAAA 


CGTTTTAAAA 


ATGAAACAAA 


TGAAATCATC 


120 




T UTT Af3f2TT A 
X^\X X^WJ^S X x^\ 


TfZ a. A ft. r*TV2T^ 


lAI AvjC 1 IGA 


ATAGAAGCAT 


TTArrrrrrA 


GGAGGACAAT 


180 




XJ\X XJ\.L\3^\3 X 


v-AAAC*A i J. JL A 


TGGCAAATGA 


ATCAAACATT GAGCGCAAAT 


GGTATGTTAT 


240 


10 






TAGGTCGTTT 


ATCATCAGAA 


GTAG C!ATCT A 


TCTTACGCGG 


300 






v» r AACTT ACA 


CAC CACACGT 


TGATACTGGT 


GATTATGTAA 


TCGTTATTAA 


360 


is 




ATCGAATTTA 


CTGGTAACAA 


AGAAACTGAr? 


ft A ft GTTT ^ fT* 


ACCGTCACTC 


420 


AAATCACCCA 


GGTGGTATCA 


AATCAATCAC 


TGCTGGTQAA 


TTAAGAAGAA 


CTAACCCAGA 


480 




ACGTTTAATT 


GAAAACTCAA 


TTAAAGGTAT 


GTTACClAAGr' 


AV* X X X X AVJ 


GCGAAAAACA 


54 0 


20 


AGGTAAAAAA 


TTATTTGTAT 


ATGGTGGCGC 


TGAACATCCA 


CAGGGTG C^A C 


AACAACCAGA 


600 


AAACTACGAA 


TTACGTGGTT 


AATTAGAAGG 


AGGAAATGAC 


TTTGGGAGAA 


GTTGAATATA 


660 




GAG<jCACAG<j 


C CGTCG TAAA 


AACTCwGtAG 


CACGTGTACG 


TTT aGTA^ Ca 


GGTGAAGGTA 


720 


25 


ACATCACAGT 


TAATAACCGT 


GACGTACGCG 


AAT ACTT AC C 


ATTCGAATCA 


TTAATTTTAG 


780 




fk /■■^w n jk jk m ^^ik 

AC i i AAACCA 


AC CATTTGAT 


GTAACTGAAa 


CTAAAGGTAa 


CTATGATGTT 


TTAGTTAACG 


840 




TTCATGGTGG 


TGnTTCACTG 


GACAAGCTCA 


AGCTATCCGT 


CACGGAATCG 


CTCGTGCATT 


900 


30 




GATCCTGAAT 


ACAGAGGTTC 


TTTAAAACGC 


GCTGGATTAC 


TTACTCGTGA 


960 




vCCACUTATG 


Ik Ik Ik Tk Ik ^ Ik rr^m 

AAAGAACATA 


AAAAACCAGG 


TCTTAAAGCA 


GCTCGTCGTT 


CACCTCAATT 


1020 




CTCAAAACGT 


TAATTGTCGG 


ACGATATATA 


CAAAACACCT 


CGATATTATG 


TCGAGGTGTT 


1080 


35 


TTTTTGGCGT 


TTTTGCGGCG 


AATATGGAAT 


GTGTAGAATA 


TAAATGAATT 


TTTACCTTCC 


1140 




CACCATAAAA 


GATGAAGAAC 


CATGAATGTG 


GAGAACAATA 


AATAGTTGGA 


TATTCTGTTA 


1200 




TlTTrritiGA 


AGTGGAAGTG 


GATTTGGAAT 


ACTTTACTCn 


AAACXSATTAA AAGGTTTAAA 


1260 


40 


AAAACAACAA 


AxiAGAAA 










1277 




(2) INFORMATION FOR SEQ ID NO: 492: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492: 
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5 



10 



TTATTGACAT 


•rcrmTTATc 


CAAAATTCAT 


TGTTAAGACA 


TTTTCTTTAT 


GAAATAATAA 


120 


TnATATTGAA 


GTATATi-rrr 


ATTATTATTA 


AAAATAAATA 


AGGGGATACT 


TATGAGCACA 


180 


AATCAAACAT 


TTTTAATATT 


TGTTATAGCA 


ATTATTCTAC 


TTACATCTGT 


AATAGGAATT 


240 


GTTGGACGAT 


ACATGAGTCG 


TCAACGTCTA 


TTAAAATCTA 


TGGAAACATT 


ATGGCAAACG 


300 


ATTTCTCCAT 


TAGAAGCTTT 


TATCAGACCG 


AACTCACATT 


TCGACTATGA 


GTATAAGCTC 


360 


TACAAGGAAA 


AATTTGAATC 


ACATTCATTA 


GTTGATGATA 


AAACTTGGTC 


CGACTTAAAT 


420 


ATGAATGCAA 


TCTTTCATAA 


GATGAATTAT 


AATTTAACAG 


CTATTGGTGA 


AATGAAGCTA 


480 


TATGCCTGTT 


TACGTGGAAT 


GCTTTCAATT 


ACGAACAAAT 


CATTACTTAG 


TTTATTTAAT 


54 0 


GATAATGCTG 


AATTTAGAAA 


AAACGTAACA 


TATCATTTAG 


CTTTGATTGG 


tAAAACTGTT 


600 


aTCCAACATT 


TCCAGACCAA 


ATCACACCGG 


kAAACGTCCA 


AATATATTGn 


TCTATGCCCG 


660 


GTTTACCAGT 


ATC 










673 



20 

(2) INFORMATION FOR SEQ ID NO: 493: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 



3S 



40 



AAAAAAAGTT 


AATGCTGACG 


GTGTATTAAC 


TTTTGATATT 


CTAGAAAATA 


AATATACTTA 


60 


TGAAGTTATT 


AACGCTATAG 


GGAAAAGATG 


GATTGTTAGT 


CATGTCGAAG 


GTGAAAACGA 


120 


CAAGAAAGAA 


TATGTAATAA 


CTGTCATTGA 


TAGGAAATCA 


GAAGGCGACA 


GACAACTGGT 


180 


TGAATGTACT 


GCTAGAGAGA 


TTCCCATAGA 


CAAGTTAATG 


ATTGATAGAA 


TTTATGTTAA 


240 


TGTAACAGGA 


TCTTTTACAG 


TAGAAAGATA 


TTTTAACATT 


GTGTTTCAAG 


GTACTGGAAT 


300 


Gc-rrrrixyuv 


GTCGAGGGCA 


AAGTTAAATC 


TTCAAAGTTT 


GAAAATGGTG 


GTGAAGGCGA 


360 


TACAAGGTTA 


GAAATGTTTA 


AAAAGGGATT 


AGAACATTTC 


GGTTTAGAAT 


ATAAAATAAC 


420 


GTATGACAAA 


AAGAAAGACA 


GATATAAGTT 


TGTATTGACG 


CCTTTTGCAA 


ATCAAAAAGC 


480 


GTCTTATTTT 


ATTTCTGACG 


AAtCAACGCC 


AACGCTATAA 


AACTCGAGGA 


AGATGCAAGT 


540 


GATTTCGCCA 


CCTTCATTAG 


AGGATATGGT 


AATTATTCAG 


GAGAaGAAAC 


ATTCGAACAC 


600 


GCTGGGCTCG 


TAATGGAAGC 


TAGAAGTGCA 


TTAGCTGAAA 


TATACGGCGA 


CATCCACGCA 


660 


GAACCATTTA 


AAGATGGTAA 


AGTGACTGAC 


CAAGAAACTA 


TGGATAAAGA 


ATTACAATCG 


720 
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TATCCAGAAG CAGACCCACA ACCCGGAGAC ATAGTACAAA TAAAATCTAC CAAACTAGGT 84 0 

TTGAATGATT TAGTCCGTAT AGTACAAGTT AAAACGATTA GGGGTATAAA CAATGTAATT 900 

GTTAAGCAAG ATGTAACGCT TGGTGAGTTT AATCGAGAAC AACGATATAT GAAAAAAGTT 960 

AATACTGCAG CTAACTATGT TTCTGGATTA AATGATGTTA ACCTTTCTAA TCCTAGTAAA 1020 

GCGGCAGAAA ACTTGAAGTC TAAAGTAGCG TCAATAGCTA AATCAACACT CGATTTGATG 108 0 

AGTAGAACTG ATTTGATTGA AGATAAACAA CAGAAGGTAA GCTCTAAAAC TGTGACTACA 114 0 

TCTGACGGCA CTATCGTTCA TGATTTTATa GATaAATCmA ACATTAAaGA TGTAAAAmCG 12 00 

aTTGGAACGa TTGGCGATtC TGTAGCTAGA GGATCACATG 124 0 



IS 

(2) INFORMATION FOR SEQ ID NO: 494: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494: 



30 



35 



40 



45 



SO 



ACGGTGGATT 


TAGCCTAGAA 


GATTTAACGC 


ATCAAGGTAa 


ATTaTCAGCA 


TTTAGCTTTA 


60 


ATGATCAAaC 


AGGTCAAGCA 


ACATTGATTA 


CTAATGAAGA 


TGAAAACTTC 


GTAAAAGATG 


120 


AGCAACGTGC 


TGGCGTAGAT 


GCAAATTATT 


ACGCTAAACA 


AACATATGAT 


TATTACAAAG 


180 


ACACATTTGG 


TCGTGAATCA 


TATGaCAACC 


AAGGTAGTCC 


AATTGTTTCA 


TTAACGCATG . 


240 


TTAATAACTA 


CGGTGGTCAA 


GATAACAGAA 


ATAATGCCGC 


ATGGATCGGT 


GACAAAATGA 


300 


TCTATGGTGA 


TGGTGATGGT 


CGCACATTCA 


CAAGTTTATC 


GGGTGCAAAT 


GACGTAGTAG 


360 


CACACGAATT 


aACACACGGT 


GTGACACAAG 


AGACAGCGAA 


CTTAGAATAT 


AAGGACCAGT 


420 


CAGGCGCTCT 


AAATGAAAGC 


TTTTCAGATG 


rrrri'GGATA 


CrriXJTAGAT 


GACGAGGATT 


480 


TCTTAATGGG 


TGAAGATGTC 


TACACACCTG 


GAAAAGAGGG 


AGACGCTTTA 


CGCAGCATGT 


540 


CAAACCCAGA 


ACAATTTGGT 


CAACCAGCTC 


ATATGAAAGA 


CTATGTATTC 


ACTGAAAAAG 


600 


ATAATGGTGG 


CGTACATACG 


AATTCTGGAA 


TTCCAAATAA 


AGCAGCTTAT 


AACGTGATTC 


660 


AAGCAATAGG 


GAAATCTAAA 


TCAGAACAAA 


TTTACTACCG 


AGCATTAACG 


GAATACTTAA 


720 


CAAGTAATTC 


AAACTTCAAA 


GATTGTAAAG 


ATGCATTATA 


CCAAGCGGCT 


AAAGATTTAT 


780 


ATGACgAGCA 


AACAGCTGAA 


CAGGTGTATG 


AAGCATGGAA 


TGAAGTAGGC 


GTGGAGTAAA 


840 


AATATATAAA 


CaAGAAGAAG 


TAATGTTAAA 


CACTTATAAA 


TAATTAAATT 


TTAAATACAG 


900 



55 



BNSOOaO: <eP 076651 9A2J_> 



1420 



EP0 786 519 A2 



ATTAGATGAG AGGAGTGTGA GGGTTGTCTG CCGAAAGACT ACTCGGCAGT CTAAAATCAT 1020 

TACAAGTAGT AGATATGTGA TAATTAAATG CTGACTTAGA ATACAAAATT CATTTTAAAA 1080 

GTTGTCACAA AAAATTTACA TGTATTTTTA TTATCTTTTG CAAAACAAAG TGTTAAATTA 1140 

TAAATGAaAC ATGCATGAAT TTATTTTTTA ATACAAGAAA CGTAACTACC AAAGGA G TTT 12 00 

ACAATATGAA GAAAAGTAAA CGATTAGAAA TTGTTTCTAC AATAGTTAAA AAGCATAAGA 1260 

TTTATAAAAA AGAACAAATc ATTTCATATA TTGAAGAATA TTTTGGTGTA A 1311 



(2) INFORMATION FOR SEQ ID NO: 4 95: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1761 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDE0NE5S : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495: 

TGCACTTTCT AAAAATACTT GCTTTACTTG TTCCAATTCC TTGTCAGACA TTGATTCGGC 60 

TAATTTATTC ATTATTGCCT CAAATACGTT AATTATGTCA TCCATTTCAG TACTATAAAA 120 

ATCAAACCAT TTTGCCGTAT CTTTTTCTCT ATTAAGCTTA TGATCACTTT GAGAGCGCTT 180 

AGCTAATTCT GCATAAATAT ATGGACAAGG TGCCATTGCA GCAATTGTAT AAATAGCATT 240 

30 TTCACGACTA TGCGCTTGGA AATACATATG TTTTATGTAA TGGTCGCCAC TTGGAGGCCA 300 

AACTTTTGTT TTAATGATTT CTTCGTATGA TTCACCAACA ATTTGCGCTA AAATATCATG 360 

CGCAAGTACT TCACCTTCAA CCATAAATTC TATTTGCTCT ACTAAAAATT TTACGTCATT 420 

3S CATGCTATTC ATTTTTGGAA TTAACAAAGC ATATAAGTTT GTAAATTCTT TTAAATACGC 4 80 

AGCATCAGCT TTTAAGTAAT GGCGTAATGC GTCAGcTCCT ATATCTCCGG ATAACATCTT 540 

CTGAATAAAG TCATCCTCAT AAATATCATT AATGATTGGC TTTGCAGCTT GGTACAATTT 600 

TTGTGAAAAT TCCATTGTAA AAAATCCTCC CTAAATAAAA AAACTACTTC CAACATGAAA 660 

GTAGTTTGAT GGCAATGTTG CTATACTAGC CCCATCACTT CAATAACTAC TTTCCTACGT 720 

TGGTACTAAC CAAATCAGGT CATAAGGGTC TGAACAATTC ATCTCAGCCA TATCATTAGG 78 0 

45 

CTCCCCTAGT AGTTCCTTAG TATTCAATTG CAAATTAATC TTAGCAAACG G TTTCAACAT 84 0 

TTTCAATTAT TGTTGCTCAG TTGTATTATT ATCTTTAAAT AATAATTCTA TAATGACATA 900 

TATTTGCGAA ATAAAAAAAC CGGAACATAT CGAGAATTCC CCGATATATT CCAATCTAAA 960 

SO 

AGTTACTTAT ATAACTATTA ATTAGCTATG CATAAATGGC TTATGCAGTA ACCCAATGTC 1020 

55 
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30 



40 



45 



1140 
1200 
1260 



1380 
1440 
1500 
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TTGCTGGTGA TACACCTTTA TATTTAGCAG GTGCTACTGA ATCCCAAGTT GATTOTAAGA 
ATTGATACTT ACCAGCTGCA CCTGgATGTT GGrTTTACAG CATGAATATT GCCACCTGaT 
TCACGTTGrG CAATTTGTTT TAGATGAGCa TTCACATTTA CTGATGAACC TTCTGATGAT 
TTTGATyCAG TTGGTGTTGC AGTAACTTGT GAATTGTTTG ATGTTGATCC TTCTGGTTGT 132 0 

TGAGTTTGAG CATTTTGTGG TGCTTCAACT TCTTGTGATT GTACTTGATT AGCTTGAACA 
GCTGATGGTG CAACATTATT AGTTGCAGGT GCTTGTGCAC TCATGTCTGC TCCATTAGTA 
CCTGTTGCAT GGTAATTCCA AGCAAAGTGT GTACCATCTG ATTCAAAGTG ATAAGTAAAC 
CCTTCATAGT CAAATGTATA ATTATAAGCC CCAGCTTCAA TTGGTTTTTG ATTTAATGTT 1560 
TGATCATTTG ATTGCGCCAT TTGCCTGAAA GATGCTTTAT TTAAGTCCGC TTCACnTGCA 1620 
TGGGCTTCGT GGACCTGCAT TTCCTGGCTA CGATTCCTAA ACCTACTGGC nAAnGATGAT 1680 
GCGAGTAATG TTTTCTTCAT AATCTTAAAA TCCTCCTACA AGTGAATTTG TGTCTCTAAA 174 0 

AGTTTTACAG TGGAGGACTG T 
(2) INFORMATION FOR SEQ ID NO: 496: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 base pairs 

(B) TYPE: nucleic acid 

(C) STRAJIDEDNESS : double 

(D) TOPOLOGY: linear 



1761 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 

TCATTTATGA AAAATGTCGG AnAGAnCCaa GaAAmaCAAT TGAgCGTGAA GAAAAAGCAA 60 

35 GACTTAAAGA AGAACAAAAG GCACGTCAAA ATGAACAGCC ACMATPiPJA GATGTGAGTG 120 

ATTTTACGGA AGTGCCTCAA GAAAGAGATA TTCCAATTTA TGGGCATACT GAAAATGAAA 180 

GTAAAAGCCA GAGTCAACCA AGTCGAAAAA AACGAGTGTT TGATGCAGAG AATAGTTCGA 24 0 

ATAACATCGT AAATCATCAT CAAGCAGATC AGCAAGAACA ATTAACAGAA CAAACTCATA 300 

ACAGTGTTGA AAGTGAAAAC ACTATTGAAG AAGCTGGTGA AGTTACGAAT GTATCGTATG 360 

TTGTTCCACC GTTAACTTTA CTTAATCAAC CTGCAAAACA AAAAGCAACA TCTAAAGCTG 420 

AAGTGCAACG TAAAGGACAA GTACTAGAGA ATACATTAAA AGATTTTGGG GTAAATGCAA 4 80 

AAGTGACACA AATTAAAATT GGTCCTGCAG TAACTCAATA TGAAATTCAA CCAGCTCAAG 54 0 

GGGTTAAAGT GAGTAAAATT GTAAACTTGC ATAATGATAT TGCATTAGCT TTAGCAGCAA 600 

AAGATGTTAG AATCGAAGCG CCAATACCTG GTCGTTCTGC AGTAGGTATT GAAGTGCCAA 66 0 



SO 
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ATAAACTAGA AGTTGGATTA GGaAGAGATA TATCAGGTGA TCCAATTACT GTTCCACTAA 
ATGAAATGCC ACAC 

(2) INFORMATION FOR SEQ ID NO: 4 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 97: 

75 





AGCCAGTTTT 


GcATTTCGTC 


AAATCGCAAT 


GAATATATTG 


ATTGCTTGTC 


AAAATTTGGA 


60 




AGAAATTTAT 


TCTTCTTTTT 


CTTTTCAGCA 


GTTATCATTC 


CTAATTGTCC 


TTTCTTTTTA 


120 


20 


TCTTAGTGAT 


AAAGAATCCA 


TCTGAATTAA 


AGTCTTGCGG 


CATGATTTGT 


AACGTTTTGA 


180 


CCAACTCTCC 


AGTTATCGGA 


TGTTGAAACG 


GTTCAAATTC 


GAAGTTTTTA 


TTATTTTTCA 


240 




AAAACGTATA 


AATCACGTTT 


TCATTTTCTA 


GTTGCTCAAT 


TGTACATGTT 


GAATAGATGA 


300 


25 


TTTCTCCACC 


TATTTTTACA 


TTGTTTTTTA 


CATTTTCCAA 


TATTTCAAGC 


TGTAATTCAA 


360 




CTAGTGACTC 


AATATGTTGT 


TTGCTTTGAG 


TATACTTAAT 


CTCCGGCTTA 


TGTCTCATTA 


420 




CACCTAATCC 


GCTACATGGT 


GCATCAACAA 


GTATCTTATC 


GTATGTTTTA 


TCATAAGGTT 


480 


30 


TTGTCGCATC 


ATGTTGAAAA 


GCTTTAATAT 


TTGTTAATCG 


TAATTnTTTT 


ATATTAAAAT 


54 0 




TAATTAAGTC 


TATTTTGTGA 


TCATGTATAT 


CTGAAGCGTC 


AACTTGCCCT 


TCTGGCATTA 


600 




AAACTTCAGC 


AATGTGACAA 


GCTTTACCGC 


CAGGTGCACT 


ACATGCATCT 


AATACGTGAT 


660 


35 


CATGTCGGTC 


TACATTCATA 


ATGTGTGCAa 


CAAACATTGA 


GCrriTATCT 


TGAATTGAAA 


720 




CGAATCCATC 


TTTAAATGAA 


CGAGAATGAA 


TAATTGGTTG 


TCCTCCTATA 


TGGAGACAAT 


780 




AAGGTAAGTC 


ATGATCTTTT 


TCAACGTCAT 


AACCTTCGTC 


TTGCAACTTT 


TCAATAATAT 


840 


40 


CATCTAATGA 


TGCTCGCGTC 


AGGTTGGCAC 


GCACAGTTGT 


TGATGTCGTT 


TCTAAAAATG 


900 




ACTGTAAAAT 


TTTTTCAGTT 


TCTTCGAGAC 


CATAATGTGT 


TGCCCAATGA 


TCTATAATCC 


960 


45 


ACTTCGGCAT 


ACTATACTCG 


ATTGCCATTC 


•rrrrriTAGG 


ATCTGCAATT 


TCATTAAAAT 


1020 


CAGGTAAGTC 


ACTACGCATC 


ATTGTACGTA 


AAATACCATT 


TACGACATTA 


CCATTATGAT 


1080 




AGCCACCGCG 


TTCTTTTGCT 


ATTTCAACTG 


CTTCATTAAT 


AATGGCATGA 


TTTGGAACTT 


1140 




TATCTAnATA 


nACATATTGA 


T 








1161 
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(2) INFORMATION FOR SEQ ID NO: 4 98: 
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(A) LENGTH: 1504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498: 

AGCTCACGTC ATCTTCGGCG GCGCTAAATT AAAATAATCA ATTTCTGAGT TAAACTTTTA 60 

TTTACAACAT ACTATTACTA TACATTACAA ATTTTTAAAA TATACATTAC ACTCATTACT 120 

CAATGGmAAG CGTATGATTT CmCAGCCCCC CTAGCTTGTA GAAATCATAC TTTCCTTTTT 180 

15 TCAATATATA TACAACTATT AAATCCCATA AGATTGCAGA GCACATAAGT AAATTTTTTT 24 0 

AGAGCTTGAG GTTTGTTTAG CTTAAGCAAC CCATGAGCTC AAACACTTCC TGTTCACATA 300 

ACACTACAAA TCGCATTATG TTGCTTAATC TTATGTTTAT ATAAATTACA CACAATAAAT 360 

AGAAAGAATG TGAACATCAT GAATAAATTA TTGCTACTCG TTACATTTAT CATTCGTGTG 420 

GGTTCAGGTA TTGTTATGTT AATGCAAGGC TACGAAAAAT TAACGGGCGG ATTTACGCTG 480 

AAAGGTTTAG TACCAGTCAT CGCTAACAAT ACTGATTCAC CAGAGTGGTA TAAGTGGTTT 54 0 

TTCGCAAATA TAGTTGCACA TACGACGTCA TTATTTGATA TTGTTGTCCC ACTCGGAGAG 600 

ATTGCAATTG GATTAGGTTT AATTTTTGGA GTTTTTGCAT ATGCTGCTAG TTTCTTTGGA 66 0 

GCCnTGTTA TGATAAATTA TATCTTAGCA GATATGATAT TTACGTATCC TCTTCAATTA 720 

AdTTCTTTA TCCTTTTACT AATGAGTCAC TCATTGTTAA AACAGATTTC ACTTAAAGAA 780 

ATCATTAATT ACTTTAGAGG TCGTAAGAAC AGAGGTGAAA AAATAGATGA CCCACTTACT 84 0 

35 GATCGTGGAT GATGAACAAG ACATTGTAGA CATTTGTCAA ACCTATTTTG AATATGAAGG 900 

TTACAAAGTA ACAACGACAA CTAGCGGTAA AGAAGCAATT TCTTTACTAT CAAATGATAT 960 

TGATATCATG GTACTTGATA TCATGATGCC AGAAGTTAAT GGTTACGACA TTGTCAAAGA 102 0 

AATGAAAAGG CAAAAATTAG ATATCCCCTT TATCTATTTA ACTGCCAAAA CACAAGAACA 1080 

TGATACCATT TACGCCTTAA CTTTAGGTGC AGATGACTAT GTCAAAAAAC CATTTAGTCC 114 0 

AAGGGAACTC GTTTTACGTA TTAATAATTT ACTTACAAGA ATGAAGAAAT ACCATCATCA 120 0 

ACCAGTTGAA CAACTGTCGT TTGATGAATT AACACTTATT AACTTAAGTA AAGTtGTGaC 126 0 

tGTAAaTGGT CACGAaGTCC CTATGCGTAT TAAGGAATTT GAGTTATTGT GGTATTTAGC 132 0 

5^ TTCTaGAGAA AATGAAGTTA TTTCTAAATC AGAATTACTT GAAAAAGTTT GGGGATATGA 1380 

CTATTACGAA GATGCTAATA CCGTGAATGT CCATATACAC CGTATTAGAG AnAAATTAGA 144 0 

AAAAGAGAGC TTTACAACAT ATACCATCAC AACTGTATGG GGATTAGGAT ATAAATnTGA 1500 
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(2) INFORMATION FOR SEQ ID NO: 499: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1623 base pairs 
s (B) TYPE: nucleic acid 

(C) STRANDEDNESS: dotlble 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 99: 



IS 



20 



25 



30 



35 



40 



45 



SO 





ATAATTCGTA 


nTAATTGAGT 


TTGTTGAAAA 


ATTTAGGGTA 


ATGTAAAGAT 


60 




CATAGAyTGG AGAGATATAA 


AGATGTTGAA 


TGAGATACAA 


ATATTAaATA 


120 


a TV?/^ KT*^ e^^r* 


GATGCCTTCA 


GTTGGGTTAG 


GTGTTTATAA 


AATCTCTGAC 


GAAGATATGA 


180 


^^^P X R R ^^^^^^ 


AAATGCTGCA 


ATTGACGCAG 


GCTATAGAGC 


GTTTGATACA 


GCATACTTTT 


240 




GGCTTCACTA 


GGACGAGCAT 


TAAAGGATAA 


TGGCGTCGAT 


AGAGAAGATT 


300 




AACGAAGTTA 


TGGAATGACT 


ATCAAGGTTA 


TGAGAAAACA 


TTCGAATATT 


360 




GATTGAAAAT 


TTACAAACTG 


ATTATCTTGA 


TTTATTTCTA 


ATACATTGGC 


4 20 




AGATGGTCTA 


lUTiTAGAAA 


CATATAAAGC 


TATGGAAGAA 


CTTTACGAGC 


480 




AAAAGCAATA 


GGTGTATGTA 


ATTTTAATGT 


TCATCATCTA 


GAAAAATTAA 


540 


TGGCTCAATC 


AAGTATCAAA 


CCAATGGTGA 


A 1 LAAAX IwA 


^mTACATCCA 


TATTTTAACC 


600 


AACAAGAATT 


ACAAGAATTT 


TGTGATCGTC 


a ^'V^ a a K 


ACj XAjALTi VjCA 


TGGATGCCTT 


660 


TGATGAGAAA 








X AAAAX XoCX 


GAAAAATATC 


720 


ATAAAACACC 


AGCACAAGTT 


GTATTACGTT 


A a 

wOV^AX X Xav7V» 


a/**a*^a aTar»a 


ATTATTATTC 


780 


CAAAATCTCA 


GACACCTAAA 


CGCATTCAAG 


AAAATATAGA 


TATTTTAGAT 


TTTAATTTAG 


840 


AATTAACAGA 


AGTAGCTGAA 


ATTGATGCTT 


TAAATAGAAA 


TGCAAGACAA 


GGTAAAAATC 


900 


CAGATGATGT 


GAAAATTGGG 


GATTTAAAAT 


AACTGGATGT 


TAAATTTTAC 


GTTTATGAAT 


960 


GCCTTTTAAT 


GTGTACATTA 


AAATAAATGA 


GTTGGTTTTT 


ACTATTTGAT 


AAAACAATAC 


1020 


TCAGGTACAT 


TCAAAATCTT 


TTAAATAAAA 


AGGATGGACA 


TAGATGAAAA 


TTAGAGTCGT 


1080 


CATTCCTTGT 


TTTAATGAAG 


GGGAAGTCAT 


TACACAAACA 


CATCAACAAT 


TAACTGAAAT 


1140 


ACTTTCACAA 


GATAGTAGTG 


TGAAAGGCTA 


TGATTATAAT 


ATGCTTTTCA 


TAGATGATGG 


1200 


TAGTACGGAT 


ACCACT/iTAG 


ATGAAATGCA 


ACATCTTGCC 


ACAATAGATA 


GGCATGTCAG 


1260 


CTTTATTTCT 


TTTAGTAGAA 


ATTTTGGAAA 


AGAAGCAGCT 


ATGATTGCAG 


GTTACCAGCA 


1320 


TAGTACTGAA 


TTTGATGCAG 


TCATCATGAT 


AGATTGTGAT 


TTGCAACATC 


CACCTGAATA 


1380 
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TAGAAGTGGT GAAAATTTTA GTCGCAAAAC ATTAAGCCAT TTGTATTATA AGTTAGTTAA 1500 

TTGCTTTGTA GAAGAAGTAC AATTTGATGA TGGTGTTGGT GATTTTAGAC TTTTAAGCCA 156 0 

AAGAGCTGTT AAATCCATTG CATCACTTGA AGAATATAAT CGnTTTTCAA AAnGGnTATT 1620 

TGA 1623 
(2) INFORMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 

20 AAAGTnGGTG AAnCTATATA CTTAATCTAT ATTTATATAT TAACCATTAG GGTTAAAAAT 6 0 

TACTCTAGCA TTTATGAATA GATGGGAGTT TATTTTATTA TTATATAGGA GAGATGTTGA 120 

ATGACACATC GCGCACTATT AGTTGTTGAC TATTCATATG ACTTTATCGC AGACGACGGC 18 0 

TTACTAACAT GCGGTAAACC TGGACAAAAT ATTGAAGATT TTATTGTTTC TCGTATCAAT 24 0 

GACTTTAATT ATTATCAAGA CCATATATTC TTTTTrrTGG ATTTACATTA TTTACATGAC 300 

ATTCATCATC CTGAAAGTAA ATTATTCCCA CCACACAATA TCGTAGATAC AAGTGGTAGA 36 0 

GAATTATACG GTAAAGTAGG TAAATTATAC GAAACAATTA AAGCGCAACC TAATGTACAT 42 0 

TTCATTGATA AAACGCGCTA TGATTCGTTC TTTGGTACCC CGCTTGATAG TTTATTGAGa 4 80 

GAAAGAAGTA TTAATCAAGT CGAAATCGTT GGTGTATGTA CCGATATTTG CGTGTTACAT 54 0 

ACAGCAATTT CTGCATACAA CTTAGGtTAT AAAATTTCAG TACCTGCTGA GGGAGTGGCT 600 

CATTT 605 
40 (2) INFORMATION FOR SEQ ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1739 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE UESCRIPTION: SEQ ID NO: 501: 
TAGGTTnAAA GCATAGnTTT nTCAAAAAGA CAAATCATTC ATATATTGGA GGATATTTTG 60 
GTGTAAGATA TAGTGCAACC ACAATTGCTA AAGACTTGAA GGAACTAAAT ATATATCGTG 120 
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TGAGAGAAAA 


GTTTAGACAC 


TATTGTGAAC 


ATGAAGTTCT 


AAGTTCAATC 


ATCAATGGTT 


240 


CATACATTAT 


CGTCAAAACC 


TCACCTGGTT 


TCGCCCAAGG 


CATAAACTAT 


TTTATCGATC 


300 


AGCTAAATAT 


AGAAGAGATA 


TTAGGTACGG 


TGAGTGGAAA 


TGACACTACA 


TTAATCTTAA 


360 


CTGCCTCAAA 


TGATATGGCA 


GAATACGTAT 


ATGCAAAATT 


ATTTAAATAG 


ACATGTATCA 


420 


AATGAATAAT 


AAAAATTTGT 


TTCGTATCAC 


GTGTACTCAA 


GTTAGTTACC 


AAATATTAAC 


430 


TTGTGTACGC 


Gri-i-rnTAT 


GGAAAGAAAG 


AATTCATAGT 


CATTCAATTG 


ACTGTATAAA 


540 


AAACTTTATA 


CAACATGTTT 


TTATGGGTAT 


TTTTGAATAA 


AAAATGTATA 


TTTTGACCCA 


€00 


AAATACCTTT 


ATTTATGTAT 


AAAAATCCAT 


TATTATGTAT 


TGTATAACAA 


AAAGATATGA 


660 


AATTTTCGAC 


TTTCTTTATG 


TGAATATAAT 


CACATGTAAG 


CGTTTGAAGA 


TTGTCTATAC 


720 


TCTAAATGAA 


TTCAAAGATA 


AAAGGAGGAA 


ATAGACATGA 


CAGATGGTCC 


AATTAAAGTA 


780 


AATAGCGAAA 


TTGGAGCrTT 


AAAAACTGTG 


TTACTTAAGC 


GTCCTGGaAA 


AGAATTAGAA 


840 


AATTTAGTAC 


CTGATTATTT 


AGATGGATTA 


CTATTTGATG 


ATATTCCATA 


TTTAGAAGTA 


900 


GCTCAAAAAG 


AGCATGACCA 


TTTTCCGCAG 


GTGCTAAGAG 


AAGAGGGTGT 


TGAAGTACTT 


960 


TACCTTGAGA 


AGTTAGCAGC 


TGAAAGTATT 


GAAAATCCTC 


AAGTAAGAAG 


TGAATTTATT 


1020 


GATGATGTAT 


TAGCAGAGTC 


TAAAAAAACA 


ATATTAGGTC 


ATGAAGAAGA 


AATTAAGGCA 


1080 


TTATTTGCGA 


CACTTTCTAA 


TCAAGAACTT 


GTAGATAAAA 


TAATGTCAGG 


GGTACGTAAG 


1140 


GAAGAAATTA 


ATCCGAAATG 


TACACATCTA 


GTAGAGTATA 


TGGATGATAA 


GTATCCATTC 


1200 


TATTTAGATC 


CAATGCCAAA 


CCTTTATTTT 


ACTAGAGATC 


CACAAGCCTC 


AATAGGACAC 


1260 


GGTATAACAA 


TCAATCGGAT 


GTTCTGGAGA 


GCACX3ACGAC 


GAGAATCAAT 


ATTTATTCAA 


1320 


TATATTGTAA 


AGCATCATCC 


TAGATTTAAA 


GATGCGAATA 


TTCCAATCTG 


GTTAGATCGA 


1380 


GATTGCCCAT 


TCAATATTGA 


AGGCGGCGAT 


GAACTTGTTT 


TATCTAAAGA 


TGTCTTGGCT 


1440 


ATAGGCGTTT 


CAGAACGTAC 


ATCTGCACAA 


GCTATTGAAA 


AGTTAGCGCX3 


ACGTATTTTT 


1500 


GAAAATCCGC 


AGGCGACGTT 


TAAAAAAGTA 


GTAGCAATTG 


AAATTCCAAC 


TAGTCGAACT 


1560 


TTTATGCACT 


TAGATACAOT 


ATTTACAATG 


ATAGATTATG 


ACAAATTTAC 


AATGCATTCA 


1620 


GCCATTTTAA 


AGGCAGAAGG 


CaATATGAAT 


ATATTTATTA 


TTGAATATGA 


TGACGTAAAT 


1680 


AAAGATATTG 


CCATCAAACA 


ATCTAGTCnT 


TTAAAAGATA 


CTTTAGAAGA 


CGTACTAGG 


1739 


12) XWFORMATION FOR SEQ ID NO: 502: 









SO 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1745 base pairs 

(B) TYPE: nucleic acid 

(C) STHANDEDNESS : double 
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(xi) 


SEQUENCE DESCRIPTION: . 


SEQ ID NO: 


502: 






5 


CTGTACATAC 


AGCAATATCG 


TTAACAAATG 


AAAACAGTAT TTTAGGATTG 


TAAACCATGA 






TAAACCTAAA 


ATACTGTTAT 


TTTTATTACT 


TAAATTTCTT 


CTTCAATGCC 


TTTTCAACAT 


TOO 


10 


AAGGTGGAAC 


GAATTCAGAA 


ATATCTGCTC 


GATAAGCTGC 


AACTTCTTTA 


ACAATACTTG 


1 o rt 
XO u 


AACTTATAAA 


TGAATAATTA 


GTACTAGACA 


TCATATATAA 


CGTTTCAATT 


TCATTGTTCA 






ACTTTTTATT 


CATTGAAGTT 


AAGCGTAATT 


CATATTCAAA 


ATCACTGACT 


GCTCTTAAA P 


"1 rt n 
^ u u 


15 


CACGTATGAT 


TGTTTTAGCT 


CCTACTTGTT 


CACAATAATC 


GACTAGTAAA 




1 £ n 
o o u 




GATGAACCTT 


GACATTAGGT 


AAATGTTTAA 


CAGATTGTTC 


AATTAAATCC 




A o n 




CTAAACTAAA 


CGTACCTTCT 


TTTTTACTAT 


TTTTAAGAAC 


ACAGACATGA 


ATTTCATCAA 


4 Q n 
4 o u 


20 


ATCTATCTGT 


ACTTCTCTCA 


ATAATGTCTA 


AATGACCATA 


AGTAATGGGG 


TCAAAACTAr* 


C /I 




CCGGAATGAC 


CGCTATTGTA 


TGTTCCATGC 


TATTCTCCCT 


TTTCTAATAA 


CAATGTHTPT 


n r\ 




GTCAACCCAT 


AATGGTAACG 


TTTAATCATA 


TTAAACGGTT 


GATAATCtAT 


X X ^ X X ^ai^^ X V7n 


O D U 


25 


TTGCTAAATT 


CACAAACGAT 


GATACCATTT 


TCTTTCAATA 


AATTAAACTC 


T(!3 AAATX A fZT 






TTTAAAGCTT 


TATCAATGAG 


ACCTTTATTA 


TAAGGTGGAT 


CTAaGAAAAT 


GACATCAAAT 


/ o u 


30 


TGAATATCAC 


GTTTTGACAA 


TGCXTTTAAA 


GCTCTATCTG 


CATTATTTTT 


ATAAACTTCA 




GATTGTGCCT 


CTAAATCCAA 


ATTCGCAAGA 


TTTGaTTTAA 


TAACTTTTAC 


AGCTTTAAAA 


-7 U LI 




TTTTGaTCAA 


CAAAGATTAC 


CTTATCCATA 


CCTCGAGAGA 


GTGcTTCTAT 


TCCAAGCGCC 


960 


35 


CCGCTTCCTG 


CAAATAAATC 


TAAACCTATA 


CCTGACACAT 


CATATAAACT 


ATTAAAGATA 


X U ^ V/ 




CCTTCTTTAA 


CTTTATCCAT 


AGTTGGTCTC 


GTATTACGGC 


CTTCCATACT 


TTCTAAAGCT 


X w O U 




TTACTTiTAT 


GTTTACCTGC 


AATGACGCGC 


ATGTTGTTCA 


CACTTCCAAT 


TCATTTAGTT 


1140 


40 


ATTTAATATA 


ATTTATTGAG 


AAAAAGGAGA 


ATGATAAACC 


AATGAAACAA 


ACATTTATTA 


1200 




CACTTGGTGA 


AGGTCTAACA 


GATTTGTTCG 


AATTCATGAC 


GATGATTGAA 


TATAACCATC 


1260 




AACGTATTGA 


TAAAATTATC 


TATTTTCATT 


CACCACAAGC 


TGAAAATAAA 


AAGTCATCTG 


1320 


45 


TAGCAATCAT 


TATGAACCCT 


ACAACTGGCA 


ATCATTTCCA 


AGCATTTTAT 


ATCATGATAA 


1380 




ACGCTATTAA 


ATATCCATAT 


CCAGATTCAA 


ATAAAAAGTT 


TCAAATGATA 


AATGATTGTG 


1440 


50 


CTGAAAAATT 


CGACATACCA 


ATTTTAGGTA 


TCGATGTACA 


GCCCCCTCAA 


GCATTTCATG 


1500 


ATTTATCGTT 


ATATTATAAT 


TATTTAATTA 


GTGTGTTAAG 


GCTCCAAAAA 


TGGATACCAG 


1560 




aACTTCAATA 


ATATTAATTA 


TATATTTCGT 


GTTTCTCTTT 


TTCGTAAGTT 


TTCTTTAAGT 


1620 
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TTTGATCAAC ATCTTGCTCA TTCACATACA TAATTACAAA TTTACGATCT CTATTTGAAT 



(2) INFORMATION FOR SEQ ID NO: 503: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 





TCGTCTTTAA 


TCTTGCTGAC 


TTTAGAAGGC 


TTACGAGTGC 


GGTTACCATT 


TTTTGCATCT 


60 




TTAACTGATT 


GAACTAAAGC 


TtGACGTGTA 


GATTTATCAG 


CTAAACTAAT 


TGCACCACCA 


120 


20 


ATTACGGCAC 


CAATTAAAAT 


ACCAGGAACA 


AATTTATTTT 


CCATAAAAAA 


CTACCCCTCT 


180 




TTCAAATTTG 


CATcrx-rrAC 


GATGTAGTCT 


ATTAAATTAT 


CACAAGATGA 


TAATACCATG 


240 




TCGTATACAC 


CTTCAAAATT 


ATTCGTGTAG 


TATGGATCTG 


GTACATCACT 


CTCTTCCATA 


300 


25 


TTACTAAATT 


CTAACAGTTT 


GAACAATTGT 


CCCTTAAGAT 


TAGGATTGAT 


AGATTTAATA 


360 




TTATCAACGT 


TACTTTGATC 


CATAGCCACA 


ATGTAATCAA 


AATCATCTGT 


CGCTTCGAAT 


420 


30 


AATTCACTAA 


TCATGCCATC 


AAATGGAATA 




TGAGAATTTT 


TTGTGTACCT 


480 


TCATGAGGTG 


GCTCTCCTAA 


ATTCCAGCTA 


CCAGTACCTC 


TTGAATGTAC 


TTTAATATCA 


540 




TGAATATTTC 


TGTCTTTAAG 


TCTTTGTCGC 


ATGATTGCTT 


CTGCCATTGG 


AGAACGACAT 


600 


35 


ATATTGCCAA 


GACAGACAAA 


TGCTACATCT 


ACGATTTTGA 


TTCCTCCAAA 


CTATGTAGTT 


660 




ATATCCCCAT 


TTTATAGCGA 


CTTTAAACAA 


TAAGAAAGCA 


GATTATATAA 


AATTCTATTA 


720 




AAGTTTATTA 


AATTGTGATA 


CTTTGATAAC 


ATAACTATTA 


TTAGAGGTGA ACATTGTGGC 


780 


40 


TATGACAAAT 


GAAGAGAAAG 


TnTTAGCTAT 


TAGAGAGAAG 


TTAAATATtG 


TTAATCAAGG 


840 




ATTATTAGAT 


CCTGAAAAAT 


ATAAAAATGC 


AAATGaAGAA 


GAATTAACAG 


ATATATATGA 


900 




TTTTGJcTCaA 


yCAAGAGaAA 


GATTGTCGCC 


AAGTGaAGTG 


mCAGCTATTG 


CTGaCGCTTT 


960 


45 


AGGACAATTG 
CAATACGCTG 


CGACACGAAT 
AACTA 


AGGAGTGGGA 


ATTTTGACTA 


ATTACAAAGA 


AAAGTTACAA 


1020 
1035 




(2) INFORMATION FOR SEQ ID NO: 504: 









50 



{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 

AnCCTGACAA GATTCTTATA TTCATAGATA TGAGAGCTAA AAATGAAATC AACAAACAnA 60 

CAATAAAGTA AACGATAATA GCCCATATAC CATTTTGTAA CCATATTACA AATTGTGTTG 120 

TATTATAGCC ATTTCCAGCT AATAATTGCT GGATAAATGC ATTATTGTTT AATGTATTTT 180 

CAAGATTAGC AAtCGATGTG TTATTACTGA ATGAAACAAG TGCTATAAAC ATCGTAATGA 24 0 

CAGTAAGCAC TAATAACATC ACCCAACATA ACCAACCTAG AACTTTTTCA GTTAATCTAC 3 00 

TTACTGGACG TTTAATTTGA GTAAATTGTT CTCCAGTCAT TCGTTACAAC TCCTTATAGT 360 

ACTTATCCCG TTATTATAAC TAAATATACA GTAAATAACT ACTATTTATG ATTTTATTTT 420 

AATGACATTT TGAAATTCAA AAAGTTTTCA TTGTATTCAC TTAAAACTTC AGGTCCTAAA 4 80 

20 TCTTTATAAA CTTCAAGGCG TTCTTGCTCT TTCTTAGTCG GATAAAAACG ATGGTCGTCT 540 

TTAATCTCTT TAGGCAACAA TTGTCGAGCA GCCTTGTTTG GCGTTGCATA GCCTACGAAT 600 

TCTGTATTTT GCTTGTTATT TTTAGCATCT AATAAAAAAT TCATAAATTT ATATQCACCC 660 

TCTTTATTTT GTGCCGTTTT TGGAATTACC ATATTGTCGA ACCATAAATT CGATCCTTCT 720 

TTAGGAATAA CATAATTATA TTTATCCCCT TCTTGCACTA GAGGTGCTGC AACACCACTC 780 

CAAACAACCG CTATGTTACC TTCATTTTGT TGAAGCATCA TGGTAATTTC ATCACCTACG 840 

ACACCTCTTA CTTGTGGTGC TAGTTTGGTT AAATCTCGCT CTGCTTCTTT TAAATGGTGC 900 

GAATTACGGT CATTAAGATT ATACCCAAGT TTATTCAAAC TCATGCCTAT AATCTCTCTA 960 

GCACCGTCAA CTAGTAAAAT TTGGTTTTTA AATTTAGGAT TATACAATGA CTTCCAACTA 1020 

TCAAATGATT CATTTGGATA CTTTTCTTTA TTATATAAAA TACCTACAGT TCCAAAGAAA 1080 

TAAGGTAAAG AATATTTATT GCCTCTATCA AATGACATAT TCATATAATC TGAATCTAAA 114 0 

40 TTTTTAATAT TAGGTACCTT ATTATGATCT ATTGGTAACA ATAAATGATC TCTTTTCAAT 1200 

TTTTGAACTG iiATATTCACT AGGAnAAGCA ACATCATAAT GTGTACCGCC AGTGCGAATT 1260 

TTGGnGTCCA nCGCTTCATT TGAA 1284 
(2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5763 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



3S 



SB 



BNSOOCIO: <EP ^0786519A2_I_> 



1430 



10 



25 



EP 0 786 519 A2 

ATAATTAACA TCTTTTGCTA TATACCACCA GTTTGATACA TAAAATATCG CAGCAATAAT 60 

ATCATGTTTA ACCCTAATGA TATTATCTGA TTTTAATAAT AAGGTTGCTG TCCCTACAAC 120 

CATTAATAAA ACTATGACTG CTGGTAATAA ACGTTTTAAA CGACGTATCC AAAAGCTTTT 180 

CAATTTAATG ATACCTGTGT CATCATACTC TTTGAGTAAT AAGCTTGTAA TTAAATAACC 24 0 

AGAGATCACA AAAAATGTAT CCACACCTAA AAAGCCACCT GCCAACCATT GCTTATTTAA 300 

GTGGTAAATA ATAATTCCTA GAACAGCGaT TGCCCTCAAA CCATCGAGCC CTGGAATATA 360 

TCTCATTTTC TTATACTTTG TAAAACCCTT TGTTTTGTTC ATTTTTGCAT TCTTCCCTTT 420 

75 TAAAACTGTT CTTCTTAGAT GCTTAATTAA ATTTAGTTAT GCTGTTTAAA AGAATATTGA 4 80 

AATGCATATG TATATTATTG AATTACGACA TCATCAAAAT CATATTGACT AAAATACTGT 540 

TAAATTAAAA AAATTACCAA TGATGATTCT TACTTCGAAA TCCAATTTGT AATGCAACTC 600 

20 GGCAACTTAA AAACTATGAA GTATTATGTA TTGTAATATA ACTGTAATAT AAATTCA-ATT 660 

TATTATAAAA ATTTTCAAGA AAATATTCAA CTAGAAAATG AATTGTGCAC TCTTGGAAGT 720 

GCAAGTCACT GTCTTAATTC ATATTTTTTG AAACAAGTTA GATATAAATT TTCAAAATAA 7 80 

AATCAGAAAC TAGAACATAA ATAAGGCTCC CTTCAAAATT TTCATTTTTC AATGTCTACT 840 

TTGAAGGGAG CTTATTCACA ATGAATTATA CTCTACAATG TTATATTGAC TGCGGGCCCA 900 

AACACAGAGA ATTTCGAAAA GAAATTCTAC AGGCAATGCA AGTTTATGTT AGCTCACACC 960 

AAGTGCAATC TTAGCGTAAC GTGACATCAT ATCTTTTGTC CAAGGTGGAC TCCATACGAT 1020 

ATTCACTTCA GTATCCTGAA TTTCAGGAAT CTCTGCTAAT ACTGTTTTAA CTTGaTCAAT 1080 

AATTTGAGGT CCCATTGGAC ATCCCATTGA TGTTAAAGTC ATATCAACTG TACATACGCC 1140 

TTCATCATCA ACATTCACTT TGTATACTAA ACCCAAATTA ACGATATCAA TTCCTAATTC 1200 

AGGGTCAATT ACCATTTCTA ATGCACCTAA GATACTATCT TTCAATGCCT CTTCCATCCA 1260 

40 TATCACCTCT TTAATXSTCAT ATTATTCATA ATATATCAAA TATCCGACMi AACXXTCAATA 1320 

AAATGCTATO ATGTATCTAT ATGAACTAAG CAACTTATGA GGAGAGAGAT ATGCAACCAC 13 80 

ATTTAATATG TCTAGACTTA GACGGAACAT TATTAAACGA TAACAAAGAA ATTTCATCAT 144 0 

ATACTAAACA AGTATTAAAT GAATTACAAC AACGTGGaCA CCAAATTATG ATTGCGACTG 1500 

GCAGACCTTA TCGTGCAAGT CAAATGTATT ATCATGAATT AAATTTAACG ACACCAATTG 1560 

TTAATTTTAA TGGCGCTTAC GTACATCACC CTAAAGATAA AAACTTCAAA ACTTGCCATG 1620 

AAATTTTAGA TTTAGGCATC GCACAAAACA TTATTCAAGG ATTACAACAA TATCAAGTAT 1680 

CGAATATTAT AGCAGAAGTG AAAGATTATG TTTTCATTAA CAATCATGAT CCAAGATTAT 1740 

55 



30 



35 



45 



SO 



1431 



EP0 786 519 A2 

AAGAATCCCC TACCTCAATT TTAATTGAAG CCGAAGAAAG TAAAATACCT GAAATCAAAA 1B60 

ATATGCTTAC TCATTTTTAT GCCGATCATA TTGAGCATCG ACGCTGGGGC GCACCATTCC 1920 

5 

CTGTCATTGA AATTGTAAAA CTTGGTATTA ATAAAGCAAG AGGCATTGAG CAAGTTAGAC 1980 

AATTTTTAAA TATTGACCGA AATAATATTA TTGCATTCGG TGATGAAGAT AATGATATTG 2040 

AAATGATTGA GTACGCGCGT CACGGTGTTG CTATGGAAAA TGGTTTGCAA GAACTTAAAG 2100 

ATGTAGCGAA CAATATTACA TTCAACAATA ATGAAGATGG CATTGGTCGA TATTTGAATG 216 0 

ATTTCTTTAA TTTAAATATT AGATATTACT GTTAATTTAT AACTAATCAT TTTATAATAT 2220 

75 TTTAAAACAA TAGGAGGTAA GTTACGATGC CCAAAATAGT CGTAGTCGGA GCAGTCGCTG 2280 

GCGGTGCAAC ATGTGCCAGC CAAATTCGAC GTTTAGATAA AGAAAGTGAC ATTATTATTT 2340 

TTGAAAAAGA TCGTGATATG AGCTTTGCTA ATTGTGCATT GCCTTATGTC ATTGGCGAaG 2400 

20 TTGTTGAAGA TAGAAGATAT GCTTTAGCGT ATACACCTGA AAAATTTTAT GATAGAAAGC 2460 

AAATTACAGT AAAAACTTAT CATGAAGTTA TTGCAATCAA TGATGAAAGA CAAACTGTAT 2520 

CTGTATTAAA TAGAAAGACA AACGAACAAT TTGAAGAATC TTACGATAAA CTCATTTTAA 258 0 

25 

GCCCTGGTGC AAGTGCAAAT AGCCTTGGCT TTGAAAGTGA TATTACATTT ACACTTAGAA 264 0 

ATTTAGAAGA CACTGATGCT ATCGATCAAT TCATCAAAGC AAATCAAGTT GATAAAGTAT 2700 

TGGTTGTAGG TGCAGGTTAT GTTTCATTAG AAGTTCTTGA AAATCTTTAT GAACGTGGTT 2760 

30 

TACACCCTAC TTTAATTCAT CGATCTGATA AGATAAATAA ATTAATGGAT GCCGACATGA 2 820 

ATCAACCTAT ACTTGATGAA TTAGATAAGC GGGAGATTCC ATACCGTTTA AATGAGGAAA 2880 

35 TTAATGCTAT CAATGGAAAT GAAATTACAT TTAAATCAGG AAAAGTTGAA CATTACGATA 2940 

TGATTATTGA AGGTGTCGGT ACTCACCCCA ATTCAAAATT TATCGAAAGT TCAAATATCA 3000 

AACTTGATCG AAAAGGTTTC ATACCGGTAA ACGATAAATT TGAAACAAAT GTTCCAAACA 3060 

40 TTTATGCAAT AGGCGATATT GCAACATCAC ATTATCGACA TGTCGATCTA CCGGCTAGTG 3120 

TTCCTTTAGC TTGGGGCGCT CACCGTGCAG CAAGTATTGT TGCCGAACAA ATTGCTGGAA 3180 

ATGACACTAT TGAATTCAAA GGCTTCTTAG GCAACAATAT TGTGAAGTTC TTTGATTATA 324 0 

45 

CATTTGCGAG TGTCGGCGTT AAACCAAACG AACTAAAGCA ATTTGACTAT AAAATGGTAG 3300 

AAGTCACTCA AGGTGCACAC GCGAATTATT ACCCAGGAAA TTCCCCTTTA CACTTAAGAG 3360 

TATATTATGA CACTTCAAAC CX3TCAGATTT TAAGAGCAGC TGCAGTAGGA AAAGAAGGTG 34 20 

50 

CAGATAAACG TATTGATGTA CTATCGATGG CAATGATGAA CCAGCTAACT GTAGATGAGT 34 80 

TAACTGAGTT TX3AAGTGGCT TATGCACCAC CATATAGCCA CCCTAAAGAT TTAATCAATA 3 540 
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GTTAGAATTA TGTTGGACTG GTACTACTAT CCAGTCCATT TTTTATGTTT AACATTTTTA 3660 

GAATCAAAAA AGACATAAGG TCTTGGACTA ATAATTGTCC ATGCCTTATG TCATATACTA 3720 

TATGTCTTAT CAATTAGCCA ATACCGAATA ATTTTGATAT AGGsCCTAAC GGTAGAATGA 3780 

CACCTAATAC CATTGTGATG ATAATTAATG CAATTGTTAT CCAAAACATT GTGTGACTTT 384 0 

GTTCATGTCT CTTTCTTTTA GCAATCGACA CTTCCATCAA TCCAACTACT GCAACACCAC 3900 

ACAGCATTTT CAATGTAAGC AACATATGAT TTGCCCCGCC ATTCATAAAT GACTGAATTA 396 0 

ATATCCAAAA TCCTGAAATT AACGTCAACA GCATAAATAA GCGTAAAATC ATGTGCAACG 4 020 

,5 GTTTGAAAAA TGGTGATCTG CCTTGATTTT TTGAAATGTT TAAGTATGTA GCGATAAATA 4 08 0 

AAATAATCGC TAATACCCAA CTTAATATAT GTAAATGTAA CATACTGATT CCCCCCACTT 414 0 

TAATTATTTA TATTATTAAA TTAAAGCTTC TTGGGATTAA TACCCACTTG CTTGTAATTT 4 200 

20 AATCATGATT TGATTATACA CGAATATATA TTCTACCACA CTTCTATATT TGAGAGGAAG 4 26 0 

AACATGACAT TTTATTCCTT ATTAGAATAT TGTGAFVTCTG CTGTAAAATA ATCAACTACT 4320 

TCTTAATATC AATATTTCAC TTTCATCTCA AAATGGTAAC ATTATAAATA ATTTATCTTT 43 8 0 

AACACCTTTT TAGAAAAGCA AGAAAAAACT AACCAATCTA TATAAAGACT GGTTAGCTTT 444 0 

TTAAATGATA ATTATTTAGC GATATAAGTT GTCAGCGTTC CAATATTATC AATAGTCACT 4500 

TTAACTTCAT CACCTGGTTG TAAAAATTTA GGTGGTTGCA TACCTGCACC AACGCCTGCT 4 5 60 

GGTGTACCAG TTGCAATAAT ATCTCCCGGA TGTAGTGCAA CATATTTTGA AATTTCTTCT 4 6 20 

ATTAATTCAT CAATTTTAAG AATCATTTCG CCAGTGTTAC CATCTTGTCT AATTTCATTG 4 680 

TTAACTTTTG TAACAATATT TACATTTTCA GGTAATGGTA GTTCGTCTTT AGTAACGATA 474,0 

TAAGGACCCA TTGGGCAACC GCCAGTTAAA CTTTTTGATA AAAATGCTTG ATCTTGTTCA 4800 

CTTTGTGCTT TGCGATCAGT GATATCGTTA ATAATTGTAT AGCCGTAAAC ATAATCTAAA 4860 

40 GCTAATGCTT TTGGAATCTT TTCACCAGAC TTACCAATAA CAATACCTAA TTCACCTTCA 4 920 

TAATCTAATT GATCAGTAAT ATCTTTATGA TTTGGAATTG TTGCATTATC TCCTGTTAAA- 4 980 

GATGACX3CTG CnTTGTAAA TACATATAAT TTTTCCACTT CATGATTTAA TTCGTTCGCA 5040 

TGATCTTTGT AATTTCTACC AAAAGCAATC ACATTATTCX5 GAGGTGTTAC TGGTGGTAAA 5100 

AATTCAATGT CATTAAATGA AATTTTATAG TCTTCAGCTT TGCCGCTATC TTCTGCTGCT 5160 

ACAACTGCTT TACGTACTTG TTCTTGAAAA TCTAAAGTAT GATTTTGTTG TAAACCAGCT 5220 

AACAATGTTT TAGGATGGAA ATCTCCTTCT GCAAAGTCAG CAAATACTTG TGTTAAATCC 5280 

CATACAGCAT CTTCGCGTTT TACTTTAACG CCATATGAAG TTTTGTCATT ATACTTGAAT 534 0 
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TTCGTTATCA AATAACAAAT AAATAAGTAA GACAATTTTG AAAATGAGTT GTGTTCATTC 54 60 

TGCTACAAGG ACTTTGCACT TAATCGAAAT TATTTTTTAT TCTTTTGAAA ATCAAAATAC 5520 
5 

TATAGTTGCA ATGTACCAAA TTTGAAGAAG TATAAATAAC CTTTAACTTC TTTATTAAGA 
ATCGTTTGAA GCGTATTTTG ATAATATTTC ATCTGTATCT TATATTTATT TTTTAATTGT 
GTACCAATTT CTTCATCTGT CATCCCACGG CGACGATTAA ATGCATCGGT TTTATAGTCT 5700 
ACAAAATAAT GCACACCATC TTTAACAAAG ATTAAGTCAA TCATACCTTG AATAATTGAG 
ACG 

,5 (2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 
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5580 
5640 



5760 
5763 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 

CCATATGATT TTGTGCAATA ACTCTTTTTC TTTCTTCTTT TCGTAAAAAG TTGTACATCG 60 

CTTTTGTGTT TAAGAGACTA TTGTTTTCTT TAGGTTTTTG AACTTCACTC AGTGTATTTC 120 

TAGTTGTTAA CACTAAAATT CCAACTGTTG TATCTTTGTA TCTAGCCATA ACTTTATTCA 190 

GATGTTTGTC ATTTGTAATA ACTACGACAT AATTAAACAC TTCATAATAA TCATTAATTT 24 0 

GATTATCTAA TCTATCCAGC TTATCTAATT CTGTTTTAAT CTCATAGACA ACGCCTTTGC 300 

CGTTTAACAA TATAAAATCA GCaATACTTT TCCCTATGGG CATCTCAGAA AGTGCAGTAG 360 

TTGTATTAAT AGAATGTCGT CCTAGAAGGA GTkTATTAAG TATnGTGTTT TTGTAAAAAT 420 
AT 

40 (2) INFORMATION FOR SEQ ID NO: 507: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1188 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



422 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 
GCTTATGTAT TCTCAAAATA TTTATGTGAT ACGCAAAGGA GACATGGTTA TTCGACCAGC 60 
ATTTGATGAT GACGATCAAA GAAACGGTAG TGAAATAATT CGGTTTGACA AAACGCGTAT 120 
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TTATCTTGGC 


AAGAAAGCAG 


AGACAAACCG 


CATTACTGGC 


ATTTCTAGTA 


AACCACCTAT 


TTTACTAACA 


CCATTATTTC 


CAACTTATTT 


TTTCCCAACA 


CATTCTGACA 


GACAAAATGA 


AAATATTTGG 


TTAAATATGC 


ATTATATCGA 


AAGTATTAAA 


GAATTAAAAA 


ATCGTAAATG 


TAAAGTGACA 


TTTATTAATA 


ATGAATCAAT 


CATTCTTCAT 


GTTTCATACC 


ACAGTTTATG 


GCATCAATAT 


AACAATTCCA 


TTTTTTACTA 


TTACATGGTA 


GATAAACAAT 


CTCGCATGAT 


ATCAAAAAAT 


CCCGACCAAC 


CAATAGATTA 


TAATAAAGCC 


ACATTGAATG 


TGTTTGAAGC 


ATTGACACGC 


TATTCTTTAT 


TTGAAGATAA 


ATAAATTGTT 


TATTTTTAAA 


ATATGCGGAA 


I'CrnTATAA 


ATATAGTGTA 


AATGTTCTGC 


ATATTTTTTT 


AAGGTATCTA 


TTGCAAATTA 


ACTTAATCTT 


GTTATAATAA 


TATTTGTGCT 


TGaTATTCAA 


ACACATACAA 


ATTAATCCAC 


AGTAGCTCAG 


TGGTAGAGCT 


ATCGGCTGTT 


AACCGATCGG 


TCGTAGGTTC 


GAGTCCTACC 


TGTGGAGCCA 


TTGGAAACGT 


ACTCAAGTTG 


GCTGAAGAGG 


CGCCCCTGCT 


AAGGtGTAGG 


TCGCGAAAgG 


CGCGAGGGTT 


CGAATCCCTC 


CGTTTCCGTT 


ACTTGCTAAA 


ATGGTATATA 


CCATTTTAnC 


TTTTTTGTTT 


ACTTATATAT 


AATGAATGAG 


AATTTCACTG 


TTCTTTTATA 


TCAATTTTAA 


AATTCTAAAA 


ACCTTTCCTA 


GATAATCTTC 


TCTAAGAAAG 


GTITTTATAC 


TTGTTGAACT 


TATAATTAAT 


TTATTACATA 


GCAATATTTA 


CCTGTTTTTA 


ACTATAAAAT 


TATCACTACA 


TGAAATACGA 


TAATTCGGaT 


CTCTTAACTT 


CTCTGCaATT 


AATGtACTCA 


TTGgTTTCAT 


CGTATGATTC 


ATGTATAATA 


GCATTTkTTA 


AATAATTC 





(2) INFORMATION FOR SEQ ID NO: 508: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 0 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 
CCCAACTTCG GTTATAAGAT CCCTCAAAGA TGATGAGGTT AATAGGTTCG AGGTGGAAGC 
ATGGTGACAT GTGGAGCTGA CGAATACTAA TCGATCGAAG ACTTAATCAA AATAAATGTT 
TTGCGAACAA AAtCACTTTT ACTTACTATC TAGTTTTGAA TGTATAAATT ACATTCATAT 
GTCTGGTGAC TATAGCAAGG AGGTCACACC TGTTCCCATG CCGAACACAG AAGTTAAGCT 
CCTTAGCGTC GATGGTAGTC GAACTTACGT TCCGCTAGAG TAGAACGTTG CCAGGCATAA 
TATTAATCCA CAGTAGCTCA GTGGTAGAGC TATCGGCTGT TAACCGATCG GTCGTAGGTT 
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GGCGGTAACA 


CGGGTTCGAG 


TCCCGTAGgA GTCATACAAG 


CAGAAGTGAA 


ATATCGCTTC 


480 


TGTTTTTTTA 


TTACATATTT 


ATTGTTGAGG 


AAGGTTGTCC 


GAGCTGGCCG 


AAGaGCACGC 


540 


CTGGaAAGTG 


TGTAGGCGTC 


ACAAGCGTCT 


CAAGGGTTCG 


AATCCCTTAC 


CTTCCGTAAA 


600 


GGCGCTTAAA 


TTGGTTTTAC 


CCATTTTAAG 


CGCTATTTTT 


ATTTTGGACT 


CAATCCCTTG 


660 


ATATATCTGC 


ATTTGAGCTA 


TTATCCTCAT 


TTTTACACTT 


CTTATTTATT 


TATATCCATT 


720 


TAAAATTTTT 


TAGCCACAAT 


GTGACTAATT 


TTTGaTGAAT 


AATCCTAATT 


TTAGtCACAA 


780 


GATTTTGAAG 


TTTAGTCACA 


AAACAAATCA 


TTCAGATTTT 


TTTCyATAAA 


TTTAGTTTCA 


840 


(2) INFORMATION FOR SEQ ID NO: 509: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1373 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 






TTCTATGAAA 


CTATCAGCTG 


TTTCTTTTTC 


CTTGGTAAAA 


ATCGTT5TAA 


CGCCATGATA 


60 


CATGTATTCT 


CTTTCAaCAC 


CCGATTGGTC 


ATAAGCATCG 


AAAACACAAA 


TAATTTCATC 


120 


TGAAATAACT 


GCATTATAAT 


TTGCAATTGC 


ATCTATTAAT 


TGCATTCTAG 


CTTCTTCTAA 


180 


ATTCTCTTTT 


GCAATGGCGC 


TTAGCGTTGG 


TGATTGTCCT 


ATCATATTGT 


ATCCATCAAT 


240 


GATTAAGTAA 


CGTTCTTTCA 


TTATATTTCT 


CCAACATCAT 


GTCTTTTTCG 


AAATACTTCG 


300 


TACATCATTA 


AACTTGCTGC 


AACCGAAGCA 


TTCAAACTGT 


TTACATGTCC 


AACCATTGGA 


360 


ATCTTAATAT 


AAAAATCGCA 


TTTATCACTT 


ACTAGGCGAC 


TCATACCCTG 


TCCTTCGCTA 


420 


CCAATTACAA 


TAGCCAATGA 


CATGTCCGCT 


TCTAGATTTC 


TATAATCTGT 


TGCATTATTA 


480 


GCTTCAGTGC 


CAGCTACCCA 


AAAGCCATTA 


TCTTTTAGTT 


CATCGATAGT 


TTTAGCTAAA 


540 


TTTGTCACTC 


GAATAACTGG 


TACATGTTCA 


ATTGCACCTG 


TTGAGGCTTT 


TGCAACTGTT 


600 


TGCGTTAGTG 


TAACTGAACG 


ACGTTTAGGA 


ATAATAACAC 


CATCAACTCC 


CGTTGCATCG 


660 


GCTGTTCTTA 


AAATTGATCC 


CAAGTTATGT 


GGGTCTTCTA 


AGCCGTCTAA 


TATAAGTACT 


720 


GTCAATAAAC 


CTTCTTTTTC 


TT'in-l^SCTGT 


TTTAAAAATT 


GATCGAAGTC 


AGCATATTCA 


780 


TATGGTGCAA 
GATTTTGGTA 


TAAGCGCTGC 
CAGTTTGAAC 


AACACCCTGA 


TGTGGTGCAT 


TTGCTAAAAA 


ATCTAATTTA 


840 


AATGATTTTT 


TGATCrriTG 


CATTT-riTAA 


AATTTCATTA 


900 


ATTTGTTGCT 


TTTTAATACC 


TTCTTGAATC 


AATATCTTAT 


TTAyCGGATG 


CCCAGTAATA 


960 
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TTCGTTTACT ATTGTTATTA TTTTATTTAA TAATGCCTCT AATCGTTCTT CTCTTTTTTC 1080 

TAAATAaAGA AAACCTATCA CTGCTTCTAT CkCTGAACTT TTACGATATG TTTGAACATC 1140 

^ AGTGTTTTTA GCTTTAGTAT GACTTTTCGC GTTACGCCCT CGCTTCAAAA TATCCATTTC 1200 

TTCGTCTGTA AACCATTCTT GCTCCATTAA ATATTCTAAC GTTTGCGCCT GACTTTTGGC 1260 

AGATACATAT TTTTTAGACA TTTGATGTAG TTTATTAGGC TTACTTTTAA GCTTTAAAAC 1320 

10 

GATATAGGTA CGTACATATT GATCTAaGAC TGCGTCnCCC ATATATGCTA AGG 1373 

(2) INFORMATION FOR SEQ ID NO: 510: 

,5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 
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TCAAGTGCCA 


ACTACACAAC 


TGATTTACAT 


TCTTTAGGTC 


AATATGTACA 


AGAAGGCCGT 


60 


CGCTTCTTAT 


TCGAAACAGT 


TGTAAAAGTA 


AATCATCCTA 


• 

AATATGATAT 


TACTATTGAA 


120 


AAAGATAGTG 


ATGATCTAGA 


CGGATTAAAT 


TATTTAGCTG 


GTAAAACAAT 


CGACGAAgTT 


180 


AACACAAAAG 


CATTCGAAGG 


TACATTATTA 


GCGCATACTG 


ATGGTGGTGT 


TCCTAACATG 


240 


GTAGTGAACA 


TTCCACAATT 


AGATGAAGAA 


ACTTTCGGTT 


ACGTCGTATA 


CTTCTTCGAA 


300 


CTTGCTTGTG 


CAATGAGTGG 


ATACCAATTA 


GGCGTAAATC 


CATTTAACCA 


ACCTGGTGTA 


360 


GAAGCATATA 


AACAAAACAT 


GTTCGCATTA 


tTAGGTAAAC 


CTGGTTTTGA 


AGaCTTGAAA 


420 


AAAGrATTAG 


rAGAACGTTT 


ATAAAATACA 


TTACITCAAA 


GATTAGTGAA 


GTTTGAAAAG 


480 


ATAGAACTAG 


ACGTTAACTA 


TTTAAAGCAT 


ATTTTCGAGG 


TTGTCATTAC 


AAATGTAAAA 


540 


ATGTAATGAC 


AACCTCGTTT 


TTATTTATAT 


GCAAGAACTA 


GGTTACTAGC 


TAaTGTGACA 


600 


AGATGTTAAG 


AGAAAATTAA 


aGATAAAATA 


ATATCTGCCT 


TACAATAATA 


TTGTTATACT 


660 


ACTAGnGACT 


GATTTATTAG 


CATGATTACA 


TGTTAATGTT 


TCTTTACTTA 


GTAATTA 


717 



(2) INFORMATION FOR SEQ ID NO: 511: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2700 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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AATCTAATTT TTCCTCCTAA TCTCGGATTG GATAGTGGTC AATTTTTCTG GCCCGCCATT 60 
TTAGCGTTTG TTCTAACTGG GATTGGTTTA CCATTATTAG GTGTGATTGT AGGTGCACTT 120 
GATAAAGAAG GATATATTGG CGCATTAAAT AAAATTTCAC CTAAATTTTC AATATTGTTC 180 
TTAATCATCA TTTATTTGAC TATAGGACCA CTTTTTGCAA TACCTAGAAC TGCATCTACA 24 0 

TCTTTTGAAA TGACAATTAC ACCAATTATA CATAGCAATA GTAGTATCGC TTTATTTATA 300 
TTTACGATTA TCTACTTCAT AGTCGTTTTG TATATTTGTT TAAATCCATC TAAGTTAATC 3 60 

GATCGTATTG GTTCATTATT AACACCATTA TTATTGATTA CTATTTTAGC GATGATTATT 4 20 

75 AAAGGATACT TAGACTTTAG CGGTAATAGT GCTGGAAAGG GCAATGAAGC ACTATATCAT 4 80 

TCTAATTTTT CAAGTTTTGC TGAAGGCTTT ACACAAGGCT ATTTAACAAT GGATGCCATT 54 0 

GCAGCAATTG CTTTTTCAAT GATTGTTGTT AATGCAGTAA AACTAACAGG CATTACTAAA 6 00 

ACAAATCAAA TATTCAAACA AACTTTGACT GCTGGTTTAA TTGCAGCCGT AGCTTTAATT 660 
TTCATATATA TTTCATTAGG TTATATTGGT AATCATATGC CAGTAAGTGA CATGACGTTA 720 
GATCAATTGA AATCCAAAGA TCGAAACATT GGGACATATT TATTAACGAC AATGGCTTCA 7 80 

ACAGGATTTG GTTCATTCGG AAAATATTTA TTGGGCATCA TTGTGGCGCT GGCATGTCTA 84 0 

ACTACAGCAT GCGGGCTTAT TGTTGCAGTT TCTGAATATT TCCATAGAAT CGTACCTAAA 900 
GTATCATACA AAGCATTTGT ATTAGTTTTC ATTTTAATGA GTTTTATTAT TGCTAACCAA 960 

GGTTTAAATG CTGTTATCTC AATGTCAATT CCGGTATTAA GCATTGTATA CCCAGTAGCA 1020 

ATAACTGTTG TATTATTAAT TTTAATTGCC AAATTCATAC CGACAAAACG CATTTCACAA 10 80 

CAAATTCCAG TTATTATCXST ATTTATATTG TCGATTTTCA GTGTTATTAG TT^GTTAGGT 114 0 

TGGcTGAAAA TTAACTTTAT AGAATCATTG CCTCTAAGAG CGTATTCTTT AGAGTGGkTC 1200 

CCAGTAGCAA TTATTGCAAC GATATTAGGC TATCTAGTCG GCATATTTGT AAAACAAGAT 1260 

40 CCAATTAAAT ATCAACAGGA ATAACGAATA ATATAAAAGA GGTTGGGACA TAAATCCCTA 1320 

AAAAAACAGC AGTAAGATAA TTTTCAATTA GAAAATATCT TACTGCTGTT CTCTATTTAT 1380 

ACAATACTTC GTATTGAATG GCTTCX3CTTT CCTAGGGTGC CGTCTCAGCC TCGGTCTTCG 1440 

ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACGT GTAATTTTAC 1500 

TTTGAAATAC TTTAAAAAAA TAAGACACTT TGCCCAACTT GCACATAAAT GTAAAATTCA 1560 

ATAAAATAAA TTTTCTGTGT TGGATCCCTT CGTATAATTT AATAAATACT ACTAAACTAA 1620 

ATTAACGAGG TGCCTTATGT ATAAAAATTA TAACATGACC CAACTTACAC TACCAATAGA 16 80 

AACTTCTGTT AGAATTCCTC AAAATGATAA TACGCGATAT GTTAATGAAA TTGTTGAAAC 174 0 
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AAGATATGCA 


TACCX3TAATG 


ATAGATATAG 


TTTTAAACGT 


GACTTCAAGC 


TATATGAATG 


TGATGACTGT 


TCATCATGTT 


CTTTGAGACA 


TCAATGCATG 


AAGCCAAATT 


CGAAATCCAA 


TAAGAAAATT 


ATGAAGAATT 


ATAATTGGGA 


ATACTTTAAA 


GyCCAAATTA 


ATCAAAAGCT 


TTCTGAACCA 


GAAACGAAAA 


AAATCTATAG 


TCAAAGAAAA 


ATTGATGTAG 


AGCCTGTTTT 


TGGATTTATG 


AAGGCTATTT 


TGGGTTTCAC 


TCGAATGTCA 


GTTCGAGGAA 


TAAATAAAGT 


TAAACGAGAG 


CTAGGTTTTG 


TATTAATGGC 


ACTTAPiTATA 


AGGAAAATAG 


CAGCTCAACG 


AGCTGTACAT 


TATAAAATAC 


ATATCAAAAA 


AGCTGATTTC 


TATCAAATAA 


TTAATAGAAA 


TCAGcrrrrr 


TACATTGCCT 


AAGAATTTAA 


TGTCCCAAGC 


ccrrriTATC 


GAATAACTTA 


TTGTAAACCT 


TGTCTTTCTT 


GGTTATTGTT 


TTCGTTATTT 


TTTTCGTGTT 


TTlXn-lTCCA 


CTCTTTTTGA 


GTCATTACAT 


CGTCAACTTG 


CATGTTAACT 


TCAACTACAT 


CTAAACCAGT 


AATATATTTA 


ACTTGTTCTT 


TAACTAATTC 


AGTTACTTTA 


CGGAAGATTT 


TTGGTGCTGA 


TTCACCATAT 


TCTAAGATTA 


CTTTTAAGTC 


TACAGCAGCT 


TGTTTTTCAC 


CAACTTCAAC 


AGATACACCT 


TGAGTAACAT 


TGTTGCCACT 


TGAGAATGCA 


TTAGTGAATG 


TATCAGTTAA 


GCCACCTTTC 


ATATCTAAGA 


TACCTTTAAC 


TTCACGTGCA 


GCGATACCAG 


CAATTTTTTC 


AACAACTTCA 


TCAGAGAATG 


TTAATTTGnT 


TTTGAATTGA 


GGCTCCTGAT 


TTnGTTCnTG 



(2) INFORMATION FOR SEQ ID NO: 512: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 




ATCAAATACA 


ATAATTAAAA 


TAaTGGCTAA 


TACaCCTAaA 


ATaATAATcG 


AGTTaGAGTA 


GCCTAAGCGA 


yCACCcTAAC 


aGTArACATT 


CTAGGCATAT 


ATTTaTCTTT 


AGACATTGAA 


GCCGCTAACA 


TCGGAAAAGC 


TGTAAATCCA 


GTGTTGGCTG 


CTAATACCAA 


AATCATCACG 


GTCGTTGCCT 


GTACAAAATA 


GAAGGCAGCA 


TTATCACCAA 


ATATTTGCAT 


AGCTAATTGT 


GATAAAACTG 


TCGTTTCCGT 


TTGTGGCAAA 


ATTCCATAAA 


CATATGCTAA 


ACCAACGATA 


CCAACTAATA 


AAAACGCTAA 


AATTGAACCC 


ATAGCAATTA 


ACGTTTTCAC 


AGCATTTriA 


GCACTTGGCT 


CTCTAAAATT 


GGTTACCGCA 


TTTGAAATAG 


CTTCAACACC 


CGTTAACGAT 


GAAGCCCCTG 


ATGAAAATGC 


TTTTAATAsC 


AAGAATAATG 


TTACTCCAGG 


AACCGCAGTT 



07665 19A2_I > 



1439 



EP 0 786^-51^1^^2 



5 



75 



AATATTAATA 


TCACTAACCC 


TATAATGAAA 


AGATATACTG 


GATAGGATAA 


TACGGTGGCA 


600 


GATTCAGTTA 


AACCACGTAA 


ATTTAATATT 


AAAATAAAAA 


GTACAAGTAw 


ACATGCAATC 


660 


AGTrCJcTTAT 


GCCCATATAA 


ACTTGGGAAT 


GCAGCAACAA 


ATGCATCAGC 


ACCAGATGAT 


720 


ATmCTAACAG 


CGACAGTCAG 


TATGTAATCG 


ACTAATAATG 


AGCCTCCTGC 


AAGCAATCCC 


780 


CATTTTTCTC 


CTAAATTGGT 


CTTGGACACC 


ATATACGCGC 


CGCCACCTTT 


AGGATATGCA 


840 


TAAATAATTT 


GCCTATAAGA 


CATAATTAAA 


GCAGCTAATA 


AGATCAGAAC 


AGCACCTGCA 


900 


ATCGGTAAAG 


TATACCAAGT 


TGCAACTGCA 


CCCACTACTG 


ATAGTGTAAT 


CAGTATTTGT 


960 


TCGGGACCGT 


AAGCCACTGA 


AGATAATGCA 


TCCGACGAAA 


GAATCGCTAA 


CCCTTTAAAC 


1020 


TTCGATATCT 


TTTCGTCTTT 


TAGTTCTCTG 


'rrrriTTTAG 


GTTGCCCTAT 


AATAAGtCTT 


1080 


TTAAATTGAT 


kGAcATAAtC 


TCCTATTCCT 


TTTTATAGTT 


TTnGATGGAA 


ATCAC 


1135 



^0 (2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 

30 

rrATGGTAAA ACTTTAACAC AAGCATTACA TAAATCGCAA TCATTfAtGT GGCAAAAACG 60 

TTGAATTTAA GCCATTTCTA TTCATTTTAC GTAATTATTA GCCGTATATG TGTAATAATA 120 

CACATTTTAT TCAGATTTTT TATCGCGCTC CATTAAATCT TTTACGCATT CTTTTACTGA 180 

35 

GATATTTTCA AATAATACTC TATATAATGC ATTTGTAATT GGCATATCCA CATTTTTTTC 240 

TTTAGCTAAA TGATAAACTG ATTTAGTTGT ATAAATACCT TCAACAACCA TATTCATTTC 300 

40 AGATAATGCT TGATCCATTG ATTCACCTTG TCCAAGTTTA TATCCTAATG TGAAATTCCG 360 

AGAATGTGTT GATGTGCAAG TAACGATTAA GTCACCGATA CCACCTAAAC CTAGAAATGT 4 20 

CATAGGATCG GCACCTAACT TTTCACCTAA TCTACTAATT TCCGCTAAGC CACGAGTCAT 480 

TAATGCAGCT TTTGCATTAT CACCGTAGCC AATTCCAGCT ACGATACCAC TTGCTACTGC 54 0 

GATGATATTT TTCAATGCAC CACCAAGTTC AACACCAATC AAGTCATCAT TCGTGTACAC 600 

ACGCAAATAA TCATTCATAA ATAAATCTTG CGTTAATTTA CTTACACTTT TATCTTTTGA 66 0 

SO 

TGAAGCAGCA ACTGTAGTTG GTTGCTTGAC TACAACTTCT TCCGCATGAC TTGGCCCTGA 720 

CAACACGCCA ATACCTGCAT TATATTCAGG TGAAATAGAA TCTTCAATCA TTTCTGACAC 78 0 
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CAGCTTATCA TTAATTTGAG AAGCAACTTC TCGCATTGCT TTAGTAGGTA AAGCCATTAA 
GTAAATATCT GCAAATTG 

(2) INFORMATION FOR SEQ ID NO: 514: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 
CAATACTCGG TATTATAGAT AGTCCTACTA CATTTATATG GGTTTGTATC ATTACATTAA 
TTGCACATCA ACTTGAAGGT AATATCATTA CACCAAATGT AATGGGTAAA TCTTTAAGTA 
TCCATCCTTT AACAATTATC GTTGTTATTT TAGCAGCAGG AGATTTAGGT GGCTTTACAT 
TAGTTCTGAT TGCAGTGCCA TTATATGCTG TACTTAAAAC GGTTGTTAGT AATATTTTCA 
AATACCGCCA ACGCATTATT GATAAAGCAA ACAGTAATGT TAAGGACTAA TTCTGTGGAT 
GTCTTTTAAG AATATAAGAT ACTATCGCAT CAAAAGTTGA AACTACAGCT TTTGAGGCGA 
TTTTTTTGTG CATAAAAAAT CAGTCmAATG AAATATCAAA TAATTTTCCA TCAGTCCGAT 
TATTATAAAA GCAAAAAAGC TTTGCTCACA TATATAATAA CGTGAACAAA GCTGTTGAAT 
GATATTATTT AATTGCGTGG AATCCGCTAT CTACATGAAT ATTTTCACCT GTAACGCCAC 
TTGATAAATC ACTTAATAAG TAAGCCGCAG TTTTACCTAC TTCTACT 
, (2) INFORMATION FOR SEQ ID NO: 515: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 
GGTTCATTCC AAAAAAGTAC GCGATAATTA GCGAAGAAGA ATTTGAAAAT TTAAATGTTG 
TTAAACCAAA TAAAAATAAT GTTTTCTGGT CAGTTATAGG AAGTTCGGTT TTGTTTGGAG 
TTACTTTAAG GAAATACATA CATGTTTTTG ATGTTCAATT AGATAAGCTA GTTGTAATGA 
TATTGTGTGC TCTCGCTTTA ATTTGTGTTA TAGTTTTTTA TTTTAACTTA AATAGAAAGC 
TTAAGTTAAA AGTGTTTGAT ACAAATATTG AAAAAnATAA GAGAGTTATA TAAAwAcCAA 
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TTTCATTAAT TGCCCTTATG ACAATCGAAC CTCAAAATAT AATAATATTT ATTTATTGGA 
TTATGATGAC AATGCTTTTC TTTTTGTTAA ATATGACTTC GATAGGTAAT GAAAAAGTTC 
GCGTTATAAT GAAAAATAAT TGATTACATT TAAAATATTC TAAATGTTGT CGACACAATC 
CTTTTAAGAC GCTAGTAGAA TTTAAATGAC TTCTAATGTA TATGAAAGTG TATCAATATA 
AAACCAATTG AAAAGAAGTG GAGACATTGC TTTGTGAAAC TGAAAATATT AATAAGAATC 
CCAAATATAG AATTATCAAA TACAAAGATG AATATTTGAT GATTGATTTA GTAAGTACAT 
GGTTAGCACT CTTTTTCCCA ATGATTAATT GGCTGATTCC AAAAAAGTAC GTCAAAATCA 
GCGAAAAAGA TTTTGAAACT TTAAACATTG TG 
(2) INFORMATION FOR SEQ ID NO: 516: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 
GTTCTATTAT TTCTTCCAAT GGTAATGGAT TTTCGTAAAA TGATTGATAT AAGTTGATAA 
TCTCAATAAC TTTTAATTGA TCTGGAAACA TCGTTTTTTG GAACATTATA CTGCGATTTT 
CACTTTGTAA TAACTTGTCT TTATCAAATA TCTCACCAGA ATTAGCATTA ACATTACCAA 
TTAATATATC AATTAACGTT GACTTTCCAG CACCATTTTT TCCAATTAAA GCGATACATT 
TACCTTGTTC AATATCGAAC GAAATATTTT TTAGAACACA TCTTTTATTA AATGACTTGT 
TGATATTAGA TATTTGAATC AATTTAATCA CCTCTATTTT TTTCTTAATT TAATATTAGT 
AAATTTATTA GATTTAAAAT AGAACACTTT GTCATAGATT TGAAATGACA AATGTCATTA 
TTAGATkTAC ATAATATATT TATCGTtATT TTAATTTTGG GCAAAATAAA AAGAGCCTCT 
ATAATCGrGc TCCTTACAAA TAaATTATAA AAttGGCGAA CTAAAT 
(2) INFORMATION FOR SEQ ID NO: 517: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS r dovible 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 
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GGAACACCTA AAGAAAGAAA 
AAAGAAAATA CTAAATGGTT 
GTAATTGATG AACTGTCTAC 
AAGAAATTAC CACTCATTAA 
TTACAGGATT TATGGGCTCA 
TTCAGTCGTT ATCGAGAAAG 
AACTGGGAGC TAAGAGACGG 
TTAAGCATGA AAGCGAAAGA 
GTAGTCTTAT CTGAAAAAGA 
GAATCGGAAG AAGAAGGAAC 
CTTCAACTAT CTAACGGTGC 
AAGAAGTTAG ATAAGTTAGA 
TTTTATAACT TCaAACATGA 
TTaGAGGATT CAAACTATAA 
CATCCAGCAA GTGCAGGGCA 
TTTGGACTTA CATGGTCATT 
GGACAAAATC ATACGACTAT 
GTATATAAAG CTTTACAAAA 
GCAA6AATAG CTAAGCATAA 
AAGCCAGGTA CATTTAAATA 
GAGATAAATA GATTGAGAAT 
GTGTATGGAC CGTTGCAAAA 
TTATTGACTA ATAAGATGTT 
TACTTAAAGT TACCTGAAGA 
AAGAAGCTAA AGATAGAGCA 
ACTATACGAA AGAACTTTGT 
GCAAAGATTG TGCAAAAGGC 
TAAAGTTATC TAAAAGTTAT 
TTTGTTATGC AATCAAAGAG 
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TGATGCATTA AACACAGAGG 
ATGCGATCAA TATAAAAAAG 
ATTTAAAAGT CCTAAGAGTC 
TAGATTTATA GGATTAACAG 
AGTTTATTTG ATAGACAGAG 
GTACTTTAAA CCAACACATC 
ATCTGAAGAA AAGATATATG 
TTATCTAGAT ATGCCTGACA 
AAGAAAAGTA TATGCAGAAT 
AGTTGTAGCT CAGAATGGGG 
AGTTTATACA GATGATGAAG 
GGAAATTATA GAGGAGTCTC 
TAAAGAAAGA ATACTTCaAA 
AGAACGTTGG AATAGTGGAG 
TGGATTAAAC TTACAACAAG 
GGAATTATAC CAACAAGCAA 
TATTCATCAC ATTATGACCG 
TAAAGAACTA ACGCAAGAAG 
GTAATGGAGG TATAAGATGG 
TATTGAGTCA GAGATATATA 
GGAGATACTT AACCCAACGA 
AGGTGAACCA GTTAGAACAA 
ACGAAACCTA GAAGAAATGG 
TCATAAGAAA GTAATTAGGC 
AATAGGAGAT GCATGTCACA 
TAAAGCGGTA cGwtATCATG 
CTACAAATCT GTAGTAATAT 
ACGACACAAG TACACGAGGC 
GTGTAAGAGA TGACCAAGCA 



CTGATATCTA TGTAACCAAT 
AATGGCCATT TGATATGGTT 
AAAGGTTTAA ATCTATTAAA 
GAACACCTAG CCCAAATAGT 
GCGAAAGACT TGAGTCTTCA 
AAGTTAGCGA ACATGTTTTT 
AACGAATAGA AGATATATGT 
GAGTTGATAC TAAACAAACA 
TAGAAAAAAA CTATATTTTA 
CATCATTAAG TCAAAAACTA 
ATGTAAGACT TATA.CATGAT 
AAGGCCAACC AATATTATTG 
GGTTTAAGGa AGCAACCACA 
ACATTAAGCT GCTTATAGCA 
GTGGGCACAT TATTGTTTGG 
ATGCAAGATT ATATAGACAA 
ATAACACAAT AGATCAAAGA 
AATTGATGAA AGCTATTAAA 
GAAAGGCATC ATACGATATT 
ACCTACAAGA GAACAAGAAA 
AAGAGCTAGA CACTAACATT 
CTGAACTAAT GGCAACAAGG 
TCGAAGCAGT TGAAAGTGAA 
TAAAGTATTG GAATAGAGAT 
TGCATCGTAA TACAGTTACT 
CAGGTATCAA ATAACATTGT 
GATAGTATCG GAAAGATGTA 
ACATCGCTAT GCGtGTGTCT 
TAATAACATT TATAAGCATG 
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AGATAGCATT AGATAGGGAT AATCATCTTT GTCAAATGTG TTTACGTGAA GACATAGTAA 1920 

CAGATGCAAA CATAGTGCAT CATATTATTT ATGTTGATGA AGATTTTAAT AAAGCTTTAG 1980 

ACTTAGATAA TTTGATGTCA GTTTGTTATA GCTGTCATAA CAAAATTCAT GCAAATGATA 204 0 

ATGACAAAAG TAATCTTAAG AAAATTAGAG TATTAAAAAT TTAAATAAAA AAATAATTTA 2100 

TTTTTATAGC CCCCTACCCA TCGGCTTAAA ATGTTTTTTC GACGGGTACC GGCGGGGGCC 2160 

CTTCGCTTGC AACGCGGATA AACTTTTATG AAAGGGGGTC TTTATATGAA ATTAACAAAA 2220 

AAACAGCTGA AAGAATATAT AGAGGATTAT AAAAAATCTG ATGACATATT AATTAATTTG 22 80 

TATATAGAAA CGTATGAATT TTATTGTCGG TTAAGAGATG AACTTAAAAA TAGTGATTTG 234 0 

ATGATAGAGC ATACAAACAA GGCTGGTGCG AGCAATATTG TTAAGAATCC ATTAAGCATA 2400 

GAACTGACAA AAACAGTTCA AACACTAAAT AACTTACTCA AGTCTATGGG TTTAACTGCA 24 60 

20 GCACAAAGAA AAaAGATAGT TCAAGAAGAA GGTGGATTCG GTGACTATTA AAGTTTTAAA 2520 

TGAACCTTCA CCAAAACTAT TAACAACATG GTATGCAGAG CAAGTCACTC AAGGGAAAAT 2 580 

AAAAACAAGC AAATATGTTA AAAAAGAATG TGAGAGACAC CTTAGATATC TAGAAAATGG 2 640 

^5 AGGTAAATGG GTATTTGATG AAGAATTAGC GCACCGTCCT ATTCGATTCA TAGAAAAGTT 2700 

TTGTAAACCT TCCAAAGGAT CTAAACGTCA ACTTGTATTA CAACCATGGC AACATTTTAT 2760 

TATTGGCAGT TTGTTTGGTT GGGTTCATAA AGAAACAAAA CTGCGCAGGT TTAmAGAAGC 2820 

TTTGATATTT ATGGGGCGAA AAAATGGTAA AACAACTACT ATAtCTGGTG TTGCTAACTA 2880 

TGCTGTTTCT CAAGATGGAG AAAACGGCGC TGAAATCCAT CTTTTAGCAA ACGTAATGAA 2 940 

ACAAGCTAGG ATTCTATTTG ATGAATCTAA GGCGATGATT AAAGCTAGCC CAAAGCTTAG 3000 

AGAAAATTTT AGACCTTTGA GAGATGAAAT TCATTACGAT GCAACTATAT CTAAAATTAT 3 060 

GCCACAGGCT TCAGACAGTG ATAAGTTGGA TGGTTTAAAT ACACATATGG GCATTTTTGA 3120 

TGAAATTCAT GAATTTAAAG ATTATAAATT GATTTCAGTT ATAAAAAACT CAAGAGCGGC 3180 

AAGGTTACAA CCCCTTCTTA TCTACATTAC GACAGCAGGG TACCAACTAG ATGGACCACT 324 0 

TGTTAATATG GTAGAAGCGG GAAGAGACAC CTTAGATCGA ATCATCGAAG ATGAAAGAAC 3300 

4S TTTTTACTAT TTAGCTTCTC TCGATGATGA CGATGATATA AATGATTCGT CGAATTGGAT 3360 

TAAAGCAAAT CCTAACCTAG GTGTTTCTAT CGATTTAGCT GAAATGAAAG AAGAGTGGGA 34 20 

AAAGGCTAAG AGAACACCAG ATGAACGTGG AGATTTTATA ACCAAAAGGT TTAACATCTT 34 80 

SO 

TGCTAATAAT GATGAGATGA GTTTTATTGA TTATCCAACA CTTCAAAAAA ATAATGACAT 354 0 

TATTTCCTTA GATGAGTTGG AAGGTAGACC ATGTACTATA GGTTATGATT TATCAGAAAC 3 600 
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AACACATTCT TGGATTCCTA AGCATAAAGT TGAATATTCT AACGAAAAGA TACCCTATAT 3720 

AGAATGGGAA GAAGACGGAT TACTAACAAT ACAAGATAAT CCTTATATAG ACTACCAAGA 3780 

^ TGTTTTAAAT TGGATAATAA AGATGAATGA GCATTATGTT GTCGAAAAAA TCACTTATGA 3840 

TAGGGCGAAT GCTTTTAAAT TAAATCAAGA GTTAAAGAAT TATGGCTTTG AAACAGAAGA 3900 

AAcwAGACAA GGGGCTTTGA CCTTGAGCCC TGCaTTGAAG GATCTAAAAG AAATGTTTTT 3960 

10 

AGATGGGAAA ATAATATTTA ATAATAATCC TTTAATGAAA TGGTATATCA ATAATGTTCA 4 02 0 

GCTGAAACTA GACAGAAATG GGAACTGGCT GCCATCTAAA CAAAGCAGAT ATCGTAAAAT 4 080 

AGATGGTTTT GCAGCATTTT TAAACACATA TACAGATATT ATGAATAAAG TTGTTTCTGA 414 0 

CAAGGGTGAA GGAAACATAG mATTTATTAG TATTArAGAT ATAATGCGTT AAGGAGGTGA 4 200 

ATGTTATCGC AAAAGAGAAT ATTGTCACAC GCATAAAGAA AAAATTGATA GACAATTGGA 4 260 

20 TTGaTCAGTC AGCTTCTAAG CTTTATGACT TTAGCCCATG GAAAAATAAA TCTTTTTGGG 4 320 

GTGTAATCAA TAATACGCTT GAAACTAATG AAACGATATT TTCAGCTATT ACnAAGTTAT 4 3 80 

CTAATTCGAT GGCTAGTTTG CCCTTGAAAA TGTATGAAGA TTATAAAGTA GTTAATACAG 4440 

AAGTATCTGA TTnACTTACA GTGTCACCGA ATAATTCTCT GAGCAGTTTT GATTTTATTA 4 500 

ATCAAATTGA AACAATCAGA AATGAAAAAG GTAATGCATA TGTG 4 544 
(2) INFORMATION FOR SEQ ID NO: S18: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 
CATGGCAGAA TATTGAAGCT GTAAAAAAAG GACATGTAAT TTCATATAAA GCAGAAGATT 60 

ATTGGTTCAC AGATCCTATT ACATTAGAAC ATTTGAGAAG TAAATTAAAA AAAGAAATTT 120 

TAAATAAAAA ATAATAGAAA TAAGTTGTAA AAATTTTCTT ATGCATTGGT ACTAATGTTT 180 

TTAAGGAGTG ATTAAATGAA GCAACTGGTT GGAATTCCCG AATCAATGTT AATTCCTTTG 240 

ATAGCTCGAG CAAAAGAGTA CGAAAACGAA AAACCAATAA TAAAAGACGC ACTATCAAAA 300 

AAAATATTTG ATGGTTTAGA TGATATGTAC AAAAATGTTA CATGTGATGA CATGTCTCAA 360 

ATTGGAATTA GTATACGTAC TGTGATAATA GATTGTGTTA CTAAGAGGCT TATCAAGGAT 4 20 

AATAAAGATT TAATCGTGGT CAATATAGGT TGTGGCTTAG ATACAAGGTT TCAAAGATTT 4 80 



NSOOaO- 0786519" ' - 



1445 



10 



2S 



EP 0 786 519 A2 

ACATTTTTTA AAGAAAGTAA TAGTTATAAG ATGATATCTA AATCTATGCT AGATTACAGT 600 

TGGATTGATG ATGTCAAAAA TTATAAATTT TTTAATAGTA AGTCAGATAT ATTGTTTATT 660 

ATTGAAGGTG TATTGaTGTA TTTTGATGAG AGTGTAATGA CTCAATTATT GGACACTATT 720 

ATCAAAAAGA TGGGAGATCA TAATTTGACA TTTGCGATTg AATTTTGCTC aAAAACAATT 7B0 

GCGAATAATA CmAAGAGACA TCAATCGGTA TCCAAGTTAT CCTCACCACC TGTTTTTAAA 840 

TATGGGTACC ATGATTTAnA AAAATTGGAT GAnATTTACC C 881 

(2) INFORMATION FOR SEQ ID NO: 519: 

^5 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3122 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 

TGAATaaAAA TATATTAATA GATAAACACA AATGTGTCcA AATACCCCTA GAGGTATTTG 60 

ACnAGTTCCA TCCAACTGTT TAAAATACCC CTACAGGTAT TTTTAGCS3AG GTTATTATGA 12 0 

AACAATACGG AGAAAaGTTT ATCGATGAAT TTAGTAAAGC AGAATTGGAA AAACTAGCCA 180 

AGCAAGGGCA ATTAATTGAC GTTAGAACAG AAGAGGAGTA TGCATTAGGA CATATCAATG 24 0 

GTTCCATACT TCATCCTGTT GATGAGATTG AGTCATTCAA TAAAGAAAAA AATAAAACCT 300 

ATTATGTAAT CTGTAGAAGT GGTAACAGAA GTGCTAATGC TAGTAAATAT TTAGCTAAAC 360 

AAGGTTATAA CGTTATAAAT CTTGATGGTG GTTATAAAGC TTATGAAGAA GAAAACGATA 420 

GTTATGATAC ACAAGAAGAA TATAAAAGTA TAGAAATTAA AGCAGATCGT AAACAATTTA 480 

ACTATCGTGG TCTTCAATGT CCAGGGCCAA TTGTAAAAAT TAGTCAAGAA ATGAAGAATA 54 0 

40 TTGAAGTAGG TGACCAAATT GAAGTCAAAG TCACAGACCC TGGATTCCCT AGTGACATTA 600 

AAAGTTGGGT GAAACAAACA AGGCATACTT TAGTTAAGCT TGATGAAAAT AACAATGGAA 660 

TTAATGCGAT TATTCAAAAA GAAAAAGCAA AAGATTTAGA TATAAATTAT TCTGCTAAAG 720 

GTACTACAAT TGTATTATTT AGTGGAGAAT TAGACAAAGC TGTAGCAGCG TTGATTATTG 780 

CAAATGGTGC TAGAGCTGCT GGAAAAGATG TAACTATCTT CTTTACTTTT TGGGGGCTTA 84 0 

ATGCATTAAA AAAAGTGCAA ACAGTTAATG TTAAAAAGCA AGGTATTGCA AAAATGTTTG 900 

ATTTAATGTT GCCCAAAAAG AATATACGAA TGCCTCTTTC CAAAATGAAT ATGTTTGGTT 960 

TAGGAAATAT GATGATGCGC TACGTAATGA AAAAGAAAAA TGTTGATTCA TTACCAACAC 1020 
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TCATGGGTAT TCAGAAAGAA GAACTTAGAG ATGAAGTTGA GTACGGTGGT GTAGGCACTT 114 0 

ATATTGGTGC TACTGAAAAT GCGAATCATA ATTTATTTAT CTAATTAAAT CTATTAATAA 1200 

5 

AAGGAGTTGT TATCATGTTT TTTAAACAGT TTTACGATAA TCATTTATCT CAAGCATCAT 1260 

ATTTAGTGGG TTGTCAACGT ACAGGAGAGG CAATAATAAT AGACCCTGTT CGTGATTTAT 1320 

CGAAATATAT AGAAGTTGCA GATTCTGAAG GTTTAACAAT TACACAAGCT ACAGAAACAC 1380 

10 

ATATTCATGC TGATTTTGCT TCAGGAATTC GTGATGTGGC TAAACGCTTA AATGCAAATA 144 0 

TATATGTGTC TGGCGAAGGT GAAGATGCAT TAGGGTATAA AAATATGCCA TCAAAAACAC 1500 

;5 AATTTGTTAA ACATGGAGAT ATCATTCAAG TAGGCAATGT TAAATTAGAA GTTCTGCATA 1560 

CTCCAGGACA CACGCCTGAA AGTATTAGCT TTTTACTCAC TGATTTAGGT GGTGGTTCAA 1620 

GTGTTCCGAT GGGATTATTT AGTGGTGACT TTATTTTTGT TGGTGATATA GGTAGACCTG 1680 

20 ATTTAtTAGA AAAATCTGTT CaAATAAAgG GTTCTACAGA AATTAGCGCG AAACAAATGT 174 0 

ATGAGTCCGT TCAAAATATT AAAAATTTAC CAGACTATGT TCAAATCTGG CCGGGTCATG 1800 

GTGCTGGAAG CCCTTGTGGT AAAGCATTAG GTGCCATACC TATATCTACA ATAGGTTATG 1860 

25 • 

AGAAAATTAA TAACTGGGCA TTTAATGAAA TTGATGAGAC TAAATTTATT GAATCATTAA 1920 

CATCAAATCA ACCAGCACCA CCGCATCATT TTGCACAAAT GAAACAAGTT AATCAGTTTG 1980 

GTATGAATTT ATATCAATCA TATGATGTTT ATCCTAGTTT AGATAATAAG AGAGTAGCAT 2040 

30 

TTGATCTTCG TAGCAAAGAG GCCTTTCACG GTGGCCACAC AAAAGGAACA ATCAATATAC 2100 

CATACAACAA AAACTTTATT AATCAAATTG GTTGGTACTT AGATTTTGAA AAAGATATAG 2160 

ATGTAATTGG AGATAAATCT ACTGTTGAGA AAGCGAAACA CACTTTACAA TTAATTGGGT 2220 

35 

TTGATAAGGT AGCAGGCTAT CGTTTGCCAA AATCAGGCAT TTCAACCCAG TCCGTTCATA 2280 

GCGCTGATAT GACAGGTAAA GAAGAACATG TATTAGACGT ACGTAATGAT GAAGAGTGGA 2340 

40 ATAATGGACA CTTAGATCAA GCAGTTAATA TTCCGCATGG TAAATTATTA AATGAAAATA 2400 

TTCCTTTTAA TAAAGAGGAT AAAATATATG TACATTGTCA GTCAGGTGTT AGAAGTTCAA 2460 

TTGCAGTGGG TATATTGGAA AGCAAAGGTT TTGAAAATGT GGTGAATATT AGAGAAGGCT 2520 

''^ ATCAAGATTT TCCAGAATCA TTAAAATAAT TTAAGGATGT GGAAAAAATG AATAAGCATT 2580 

ATCAAATTGT TATTATTGGT GGCGGTACAG CAGGTGTTAC CGTAGCATCA AGACTATTAA 2640 

GAAAAAATCA AAACTTAAAA GAGAAAATAG CAATTATAGA TCCAGCAGAC CATCATTACT 2700 

SO 

ATCAACCATT ATGGACGTTG GTTGGTGCAG GGGTATCTAG TTTGAAAAGT TCTCGTAAAG 2760 

ATATGGAAAG TGTTATACCT GAAGGTGCTA ACTGGATAAA ACAGGCTGTT TCAAGTTTTC 282 0 
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TAGTAGCTCC AGGATTACAG 


ATTAATTGGT 


CTTCAATTAA 


AGGACTAAAA 


GAAAATATAG 


2940 




GTAAAAATGG 


TGTTTGCTCT 


AACTATTCAC 


CTGACTATGT 


TAACgAAACT TGGAACCAAA 


3000 


5 


TTTCTAATTT 


TAAACAAGGA 


AATGCCATTT 


TTACGCATCC 


AAACACTCCT 


ATAAAGTGTG 


3060 




GAGGTGCgCC 


TATGAAAATT 


ATGTATTTAG 


CTGAAGATTA 


TTTTAGGAAA 


CATAAAATCC 


3120 


10 


GT 

(2) INFOEiMATION FOR SEQ ID NO: 52 0: 






3122 


IS 


(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








20 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 








ATAnAGATAT 


AnATnAATAT 


ATTGAGGTCA 


AACGATGATA 


ATTAAAATTT 


TAACAATTCT 


60 




ATTACTACTT 


TGTATATTGA 


GCTATTTGGT 


TACAAATAGA 


AAGAAGCCTT 


TTCTGTTCTT 


120 


25 


AAAGACACTC 


TTTATGGGTG 


TGGTATTTAT 


CTTTATAGGA 


TATATTTCAC 


TGGCAATATC 


180 




TGCCGTAATT 


ATTTATGGTA 


TTATTCAATT 




GATTTTGGTA 


GTTTTTTCTT 


240 


30 


AATGGGTATT 


ATATTGATCT 


TGATTTCAAG 


TATATTCCAA 


TTATTTATAG 


TTAGATTACT 


300 


TTTTAGAAAA 


AAGAATGTCG 


ATTTGACAGA 


GGTTGTCGTT 


TTAsAsCATT 


TAATTCAATG 


360 




GTTCTTAGTT 


TACTTTGCGA 


TCTATCAAGC 


AGTAAATGAA 


AAAATGGACA 


TTAATGATAT 


420 


35 


TAATATCGAC 


AATTTCCAAT 


CTGTCTTTTT 


TGACGTGTCT 


AATTTGAATT 


TAGTAATTCT 


480 




ACCAACGTTA 


ATCATTAGCT 


GGGTCACAAT 


ATTTAACTAT 


AGAATGAGAA 


GTTACAAATA 


540 




AAATCTATGA 


GATTATACCT 


TCAGACACCA 


ACATTCAAAT 


GGTGTCTTTT 


TTGTTGTGTG 


600 


40 


orrrrATTTT 


TGAAATTCGA 


AAAAGTAGAG 


GCATGAATTT 


TTTGACTAGT 


GTATAAGTGC 


660 




TGATGAGTCA 


CAAGATAGAT 


AGCTATATTT 


TGTCTATATT 


ATAAAGTGTT 


TATAGTTAAT 


720 




TAATAATTAG 


TTAATTTCAA 


AAGTTGTATA 


AATAGGATAA 


CTTAATAAAT 


GTAAGATAAT 


780 


45 


AATTTGGAGG 


ATAATTAACA 


TGAAAAATAA 


ATTGATAGCA 


AAATCTTTAT 


TAACATTAGC 


840 




GGCAATAGGT 


ATTACTACAA 


CTACAATTGC 


GTCAACAGCA 


GATGCGAGCG 


AAGGATACGG 


900 


SO 


TCCAAGAGAA 


AAGAAACCAG 


TGAGTATTAA 


TCACAATATC 


GTAGAGTACA 


ATGATGGTAC 


960 


TTTTAAATAT 


CAATCTAGAC 


CAAAATTTAA 


CTCAACACCT 


AAATATATTA AATTCAAACA 


1020 




TGACTATAAT 


ATTTTAGAAT 


TTAACGATGG 


TACATTCGAA 


TATGGTGCAC 


GTCCACAATT 


1080 
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TCAAAATCTT GTGAGAGAAT TTGAAAAAAC ACATACTGTC AGTGCACACA GAAAAGCACA 1200 

AAAGGCAGTC AACTTAGTTT CGTTTGAATA CAAAGTGAAG AAAATGGTCT TACAAGAGCG 1260 

AATTGATAAT GTATTAAAAC AAGGATTAGT TAAATAAAAC TTCAATCGTT GCTGTTATCT 132 0 

GGAAATAATT AATTAAATGT TATGTTAATT TTTGTTAATG AAAAAAGTAA TCTATTTAAT 1380 

GACAGGTTAA TGTAATTGTC CTGAAATTGA CTATATACTC AGTAAGTATC AATTTTAAGG 144 0 

AGAGCTTATA ATGAAATTTA AAAAATATAT ATTAACAGGa ACATTAGCAT TACTTTTATC 1500 

ATCAACTGGG ATAGCAACTA TAGAAGGGAA TAAAGCAGAT GCAAGTAGTC TGGACAAATA 156 0 

TTTAACTGAa aGTCAGTTTC ATGATAAACG CATAGCAGAA GAATTAAGAA CTTTACTTAA 1620 

CAAATCGAAT GTATATGCAT TAGCTGCAGG AAGCTTAAAT CCATATTATA AACGTACGAT 1680 

TATGATGAAT GAATATAGAG CTAAAGCGGC ACTTAAGAAA AATGATTTCG TATCAATGGC 174 0 

TGATGCTAAA GTTGCATTAG AAAAAATATA CAAAGAAATT GATGAAATTA TAAATAGATA 1800 

ATAAATAAAA CAGGTTGAGA CAAAAAATGG TCTTAACCTG TTTTCAATTT GCATATGTGA 18 6 0 

TAAATTCTAT ATCAAAATGC TTATGTATAA TGAATGACAT TTAAAAGTAG GGGAGACAAA 192 0 

25 TATAAATACA ATAGTTCCTA GGATTACTCT CAAAATAACT ATATCAATTA TTTACTTTGC 1980 

TCTCCTATTT TTTAAAATAT GTACATGTTT AAACAATCAA AAGTGTACAA TATTAAATTA 204 0 

TCATTTCCAG TTCTAGTGCT ATATTGGTAG TAGTTGACTA AATGAAAATA AGCTTATAAC 2100 

AAGTTTTTTC AATACTCGTG GGGCCACAAC AGAGAGAAAT AGGATCACCA ATTCCAACAG 2160 

ACAATGCAAG TTGGCGGGGC CCCAACATAG AGAAATTGGA TCACCAATTT CAACAGACAA 2220 

TGCAAGTTGG GGTGGGGCCC AACACAGAAG CTGGCGAAAA GTCAGCATAC AAAAATGTGC 2280 

AAGTTGGCGG GGCCACAACA GAGAGAAATA GGATCACCAA TTCCAACAAA CAATGCAAGT 234 0 

TGGCGGGGGC CCCAACATAG AAGCTGGCGA AAAGTCAGCT TACAAAAATG TGCAAGTTGG 2400 

CGGGGCCCCA ACATAGAGAA ATTGGAACCC CAATTTCTAC AGACAATGCA AGTTGGGGTA 2460 

GGACATCGAT AAAGAAATAC TTTTTCTTTA GCAATTAGTA TTTCTTATGC ATGAGCTTTA 2520 

CTCATGTATT CATTTTTTAA GTACaCATTA GCTACAGCTA ATGATAAAGA ACCACTACAT 2580 

45 AATAAATCAT TAGTGGTTCT TTATCATTTC TATCTCACTC TTTTACTGGA AGAAAAAGTT 264 0 

TACGTTTGTA GAACATGCCA CAATACCAAA AATAATTAAG AAAAATAAGA CGATAAGCAT 2700 

GATGACACTT TTCAAACAAC CTCTATCAGT TTCTTTCGAT ' riTCXTrCn ' GAACCTTTTT 2760 

50 ATAATCTTCA AGTAGTTTTG CGGCTTTTTT ATTTATATGT TTATTCATGA TGTTGACTCC 2 820 

TTATAATATA TGTTTAATTC ATTAAAATAG TTGAAAACAT GACTTGAAAT AAAGATATAA 2880 
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AATTTGATGA 


TGATATTTGC 


TTTTTATTTT 


CCAAATGGAA 


TTTACTTAAA 


CTGATGCATT 


3000 




AAAATATTAA 


TGAAGCACTA 


GAATACATAA 


ATGAATAGTa 


ATGGTGcACA 


GTATAGAATA 


3060 


5 


ATTAAGGCTA 


TATTAAGTAT 


AAATATCGTT 


AACTGTAAGC 


TATCTTTAGT 


TTTAATATAA 


3120 




ACTATTAGGA 


TAATCGACGT 


AAGAAGAATC 


ATATATATTA ATGATGAAGA 


AGTCCATACA 


3180 




AAATCCGCAT 


CATTTGTTGT 


TAATAATGGG 


ACTATAATTA 


ATCCGAAATT 


AATCATGCAT 


3240 


10 


GCTATATATA 


CTATAATGTT 


ATACACAATG 


TTAATTTTTG 


TTCACCACCT 


TATACTTCTA 


3300 




TTTTAAAAAC 


TTCTTTATAA 


TGATATATTG 


TTTAATGTTG 


AAATAATTAG 


ATTATCTAAT 


3360 


15 


TTTCATTTGC 


TTTACATGTA 


AAAGGCTATA 


TATAGTATGC 


TCTTTATGAT 


TCTAAATGCT 


3420 


TTTTAATATT 


TAATGCTCAT 


CAACATTTGG 


ATTTTGAATA 


TTCAATTCAA 


AAACTTTATT 


3480 




AGCTACGTCA 


ATTGTAAAAT 


CAGAACCATA 


GTTGACATGA 


GCTACTTTTA 


ATTTTCCATC 


3540 


20 


TAAATAATAG 


ATTGCGATTG 


CAACATCGTA 


AAATTCGTCA 


ATGACAAATA 


AACTCTTTTC 


3600 




GTTTGTTACA 


ACCTCATGCT 


CTCCTGAGTA 


TACAACGTTA 


ATTTCCCAAT 


CATTAAAAAC 


3660 




CATTTGTTAA 


CCTCCTTGAA 


CATTTAAATT 


GATTCAACTT 


AAGTTTAACT 


TATTCATACA 


3720 


25 


ACTTCGTACA 


ATATCTAGAT 


GAACATTAAT 


TGTATTTCTA 


GAAATCTTTT 


TCAATTATAT 


3780 




GTACTAATTA 


TACT* PTT AA A 


X X 1 \« X X X 1 


CAGTATAGTT 


TTAAAACGAT 


TTTAAAATAA 


3840 




TTCTGCAAAT 


ATATTAACAC 


ATAATGTGTT 


CAAAAAGTTT 


TGAACAATTT 


CAAAACTTTT 


3900 


30 


ATATAAAGGG 
GTCATGTTAG 


nTTGACAACA 
CCAATTTTTA 


TGGATTCAAA 
AG 


TnTCTTATTT 


TAAAAATTAC 


CTCATATAGT 


396 0 
3982 



(2) INFORMATION FOR SEQ ID NO: 521: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 
AGCTTGGaTG ATTTAATaTG GcCCtTTCCC AaCcTTAGAT AATGAAAGAT tTGATAATCC 60 

45 

TGAATATAAA GAAGCTATGA AAAAATATcA ACAGAGATTT ATGGCTGmAG ATGAGGCTTT 120 
GAAGAAATTT TTTAGTGAAG AGAAAAAAAT aAAAAATGGA AATACTGATA ATTTAGATTA 180 
so TCTAGGATTA TCTCATGAAA GATATGAAAG TGTATTTAAT ACTTTGAAAA AACAAAGTGA 24 0 

GGAGTTCTTA AAAGAAATTG AAGATATAAA AAAAGATAAC CCTGAATTGA AAGACTTTAA 3 00 

55 
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GTTAGGTAAA ACATTTTATC AAAACTATAG AGATGATGTT GAAAGTTTAT ATAGTAAGTT 4 20 

AGATTTAATT ATGGGATATA AAGATGAAGA AAGAGCAAAT AAAAAAGCAG TTAACAAAAG 4 80 

^ GATGTTAGAA AATAAAAAAG AAGACTTAGA AACCATAATT GATGAATTTT TTAGTGATAT 54 0 

AGATAAAACA AGACCTAATA ATATTCCTGT TTTAGAAGAT GAAAAACAAG AAGAGAAAAA 600 

TCATAAAAAT ATGGCTCAAT TAAAATCTGA CACTGAAGCA GCAAAAAGTG ATGAATCAAA 660 

10 

AAGAAGCAAG AGAAGTAAAA GAAGTTTAAA TACTCAAAAT CACAAACCTG CATCTCAAGA 720 

AGTTTCTGAA CAACAAAAAG CTGAATATGA TAAAAGAGCA GAAGAAAGAA AAGCGAGATT 780 

TTTGGATAAT CAAAAAATTA AGAAAACACC TGTAGTGTCA TTAGAATATG ATTTTGAGCA 84 0 

TAAACAACGT ATTGACAACG AAAACGACAA GAAACTTGTG GTTTCTGCAC GAAGAAAGAA 900 

ACCAACATCA CCGACTACAT ATACTGAAAC AACGACACAG GTACCAATGC CTACAGTTGA 960 

20 GCGTCAAACT CAGCAACAAA TTATTTATAA TGCACCAAAA CAATTGGCTG GATTAAATGG 1020 

TGAAAGTCAT GATTTCACAA CAACGCATCA ATCACCAACA ACTTCAAATC ACACGCATAA 1080 

TAATGTTGTT GAATTTGAAG AAACGTCTGC TTTACCTGGT AGAAAATCAG GATCACTGGT 114 0 

25 TGGTATAAGT CAAATTGATT CTTCTCATCT AACTGAACGT GAGAAGCGTG TAATTAAGCG 1200 

TGAACACGTT AGAGAAGCTC AAAAGTTAGT TGATAATTAT AAAGATACAC ATAGTTATAA 1260 

AGACCGAATA AATGCACAAC AAAAAGTAAA TACTTTAAGT GAAGGTCATC AAAAACXTTTr 132 0 

TAATAAACAA ATCAATAAAG TATATAATGG GCA 13 53 

(2) INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 



GTCTGAGTCA GGTGCTGTTT GGTTAGATGC TGAAAAAACA AGTCCTTATG AATTTTATCA 60 

45 ATTCTGGATT AATCAATCAG ACGAAGATGT AATTAAATTC TTnAAATACT TTACTTTCTT 12 0 

AGGAAAAGAA GAAATTGATC GCTTAGAACA ATCTAAAAAT GAAGCACCGC ATTTACGTGA 180 

AGCTCAAAAA ACATTAGCTG AAGAAGTAAC TAAATTTATT CATGGTGAAG ATGCATTAAA 24 0 

50 TGATGCAATC CGTATTTCAC AAGCATTATT TAGTGGTGAT TTAAAATCAT TATCAGCGAA 3 00 

AGAATTAAAA GATGGATTTA AAGATGTGCC TCAAGTGACA TTATCAAATG ACACAACAAA 3 60 
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TGTTAACAAT 


GGTGCGATTT 


ATATTAATGG 


TGAGAGACAA 


CAAGATGTTA 




480 




AGCACCAGAA 


GATAAAATTG 


ATGGCGAATT 


TACGATTATT 


CGTCGCGGTA 


2VGAAAAAATA 


c jt n 
540 


5 


CTTCATGGTT 


AACTATCAAT 


AAATATAATT 


GCATAGCTAA 


ATAAATTAGA 


GCCTACTCAT 

^7^^ X^»W X ^ap^^ X 


600 




ATTCATTCCT 


AAGAATGTAA 


TGAGTAGGCT 


CTTAATGTAC 


TTTTCTGTCT 


GTAAATTAXP 


c £ n 
bo 0 




TAAAGAAATC 


TCTATCGCCT 


TGGCCAGGTG 


ATTGACGTTC 


TGAT CGGCTT 


TGACGTmvS 

X NJ'^W^? X X X X 


f Zu 


10 


GTTGTTCTTT 


TTGTTGTTTT 


AATTTCACTT 


TAACTTCTTT 


TGTTTTACCA 


TCArGGATAJV 

X Vm'^WvJVJ^^ X 


ion 




CGGTAACAGT 


GACTGATTCA 


CCAGGTTTTT 


TATTTTCATA 


T AAAT AG CTT 


CTTAAATrAA 


PAH 


IS 


CATCATCTTT 


AATTTTCTTG 


CCATCAATTT 


CTGTAATAAT 


ATCACCTTTT 


TTAAGATCVaA 


Q n n 

3 U w 


TATCACTATC 


AGCTTTGGCG 


ACATAAATAC 


CGTCTTCTCT 


ATCAGTATGA 


AGTrGrTPGr* 

X 1 V7\» X V^Vj\v 


y D u 




GCTCTTCTTC 


AGGAATATCT 


TTCAAATTAA 


TTAAACCAAT 


ACCAATCGAA 




1 r\ o 


20 


TTTTACCATG 


TTTTACAAGT 


TGTTCAATTG 


TTACTTTAAC 


X X X X 




1 rt fi rt 




ACCCGATACC 


TTCAACTTGT 


GTCGCAGCAA 


TTTTCATTGA 


GTTAATACCA 


ACTAAATTAr' 






CATTAATATC 


TACTAATGCG 


CCACCTGAGT 


T AC CTGGGTT 


AATAGCAGCA 


X ^ X u X i. X vKnn 


T o A r* 


25 


GAACGCTAAC 


TTTTGTATTG 


CCACCAGTTG 


TCTCAGCGTC 


AATCGTACGT 




X z b U 




AAATGATACC 


AGATGTTACA 


GAGTTAGCAA 


ATTGTAATCC 


TAATGGGTTA 




X J ^ U 




ATACGCTATC 


GCCAGTTTGT 


ACTTTTGAAG 


AGTTGGCAAA 


TTGAAXCGCT 


TTAAT* A f^fTT 

X X^W\Xn^w^X A 


1 ft 
x^ oU 


30 


TTGTATTTTC 


AATTTTAAGT 


ACAGCAATAT 


CAGTTACTGC 


ATCTTTACCA 


ACPAATTTf G 


1 A d o 

X *« ** 




CTTTTAACTTG 


TTTTTTATTA 


TGTAATTGGa 


CTCTAATTTC 


ATTTGCGCCA 


TCAATAACAT 


X ^ U VI 




GATTGTTTGT 


AACGATATAA 


GCTGAATTGT 


TGTTTATTTG 


aTAGATAACA 


CCTGAACCTA 


1560 


35 


CTCCAGCTTC 


AGATGGTTTA 


GATGATTTGC 


CyTTTAATAA GTCGTCTACA 


CTTGATGCTT 


X O £ V 




TTtGCATGDc 


AATAACTCCA 


ACAATTGTAG 


GGGAGACAGA 


TTTTATCATT 


TCATGAACGG 


X O O v 


40 


TACCGAATTT 


CTTGCTTTGA 


CCGTCTAATT 


GATTGCCACC 


TTTATTATTT 


GTTGTCTGAA 




CAGTTGAACC 


ATCTTTATTT 


AAAATTGTAC 


TATTTAATAC 


TTTGCCTATA 


CCAAGTACTTA 


X O u u 




GAAGTGCACC 


AATAATTCCA 


GCAATCAATG 


CAACGATGAC 


TGTnTTAAAC 


CATGGAAATT 


X o o w 


45 


TAGGTCTTCT 


GTATCTAGGT 


GTTTGGCTAT 


GGTTTGTTGT 


AGAATGATCT 


GTATGATTAA 


1920 




AATCTGACAT 


ACTTAACCTC 


CATTATATGA 


TTTATATATG 


CTTTAATTAT 


GTCTTTTnTT 


1980 




TATAATT 












1987 


SO 


(2) INFORMATION FOR SEQ ID NO: 523: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 
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(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523: 





TTAGAAGTCA 


AATCATTAnT 


GGCGTCnTAT 


CGAGTATACT 


ATTAACTTCA 


ACTATTTTAG 


60 


10 


CAATTGCATA 


TATTTTAATG 


TGGTTTAACG 


GCCATATGAC 


aCTAACTTTG 


ACCTTAACGA 


120 




CAATAATTAC 


AAGCTGTTTA 


ACCTTATTAA 


TATGTAGTAT 


TTTTATTAAT 


CCACTTATAC 


180 




AAAAAATTAA 


GCAGTTTAAT 


ATAAAAACTA 


AGCAATTTGC 


TAACGGAAAT 


TACGCAAGCA 


240 


15 


ATGATAAAAC 


GTTTAATTCA 


CCAAAAGAAA 


TTTATGAATT 


AAATCAATCT 


TTTAATAAAA 


300 




TGGCTTCTGA 


AATTACGCAA 


CAAATGAATC 


AAATTAAATC 


CGAACAACAA 


GAAAAAACAG 


360 




AACTGATTCA 


AAACTTAGCC 


CATGATTTAA 


AAACACCTTT 


AGCAAGCATT 


ATTTCATATT 


420 




CTCAAGGACT 


ACGTGATGGT 


ATAATCACTA 


AGGATCATGA 


GATTAAAGAG 


TCATACGACA 


480 




TATTAATTAA 


ACAAGCAAAC 


AGATTATCAA 


CATTATTTGA 


TGATATGACT 


CATATTATCA 


540 




CTTTAAATAC 


AGGTAAAACA 


TATCCCCCAG 


AATTAATACA 


ACTAGACCAA 


TTACTTGTAT 


600 


25 


CAATATTGCA 


ACCATATGAG 


CAACGTATCA 


AACATGAAAA 


CCGCACATTA 


GAAGTGAATT 


660 




TCTGTAACGA 


AATTGATGCA 


TTTTATCAAT 


ATCGAACGCC 


ACTTGAGCGT 


ATTTTAACAA 


720 


30 


ACTTACTTGA 


TAATGCGCTA 


AAATTTTCAA 


ATGTTGGTAG 


TCGCATTGAT 


ATTAATATTA 


780 


GTGAAAACGA 


AGATCAAGAT 


ACTATCGACA 


TTGCTATTAG 


CGATGAAGGT 


ATTGGCATTA 


840 




TACCAGAACT 


ACAAGAACGT 


ATATTCGAAC 


GTACATTCAG 


AGTAGAAAAC 


TCTCGTAATA 


900 


35 


CAAAAACGGG 


TGGTTCTGGA 


TTAGGCTTAT 


ATATAGCTAA 


TGAACTCGCG 


CAACAAAATA 


960 




ACGCAAAAAT 


CAGTGTAAGC 


AGTGATATAG 


ATGTAGGAAC 


TACGATGACT 


GTAACATTAC 


1020 




ACAAATTAGA 


CATTACGTCA 


TAATCCGATT 


TATTTATAAA 


ATAAAATGCA 


AAGACTAAAA 


1080 


40 


AGAAGCTCCC 


ATTAATGAGG 


GCTTCTTTTT 


TTGTTTATTT 


AGAATAAACT 


TTATGGGTAT 


1140 




CCTTCTCATC 


ATTTTCAAGA 


CTTGAAAGAT 


TTGTAGCTTG 


AATAATATAT 


TTAGGACGTG 


1200 




CCTTAACTTC 


ATAATATATC 


CTGCCAATAT 


ATTCACCTAC 


AACACCAATT 


GAAATTAACT 


1260 


45 


GTATGCCGCC 


TAATAATAAA 


ATAGCTGCAA 


TCGTTGAAAA 


ATATCCCGGA 


ATATTAACAC 


1320 




CAGATATCAT 


AATATTGATG 


GAATAAATAG 


A 






1351 



(2) INFORMATION FOR SEQ ID NO: 524: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 
TTGTTGTCAG TTTAACGCAA CGTGTTACCG ACTTTTCAAG TACACAAcAA TATATATGTC 
TATTCCATTT TTAGCCCcTG CCATTTTCAT CATTGGTGGT ATTATGTTGT TTATTTCAAC 
ATTTAATAGC TTAGATGAAA CTGCTGAAAA TAATAACAAA ATAAAGAAAC TAATGATTAA 



AGGACTTATC ATTATTAACA nTCATTTAT CGTTATGATG GTTTTAACAC CATATTGGTA 240 

CTTGTATTTA ATCGTCTATC TTATTTTCTT GXTGITTTTA ITGTGGCAAA AGGTTTATAA 3 00 

ATTTTAATAC CAAAACTATT AAACACTTCT GATATTCTTA GTTCAAAATA TCAGAAGTGT 360 

TTTATAGTGT TATCTAGTTC AGATAAATAT TTCCTTACTT AAAAAAACGC CCTCCTCTTA 4 20 



TTTTGACCCC nAT 

(2) INFORMATION FOR SEQ ID NO: 525: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 



30 


CCCCCACTAT 


GATATGCTTG 


GCCTATTGCG 


TCAATCCCCT 


TATATTTCGG 


CAATAGAGAT 


60 


GGATGTATAT 


TCAATATTTT 


ACCTTCAAAT 


GAAGCTAATA 


AGTCTGGACC 


AATTAGACGC 


120 




ATGTAGCCAG 


CTAGAATAAT 


CCACTCTACC 


TTATCTTCAT 


TTAATAATGT 


TACTAAATGT 


180 


35 


TGTTCATACG 


CTGCTTTTGA 


ATCAAATTGT 


TTTGGTTCAT 


TAATATAAAC 


AGGAATATCG 


240 




TGcrrrrrixi 


CTCTATCTAT 


ACAAAACGCA 


TTTTGATGAT 


CCGTATATAG 


CGCCGTAACT 


300 




TCAATATTTT 


CAAGTTTTCC 


TGATTCAACA 


TGCTCAACTA 


TATTTkCAAA 


GTTACTTCCT 


360 


40 


GAACCTGATG 


CAAAAATCGC 


AATTTTAACC 


ATTGTTATAC 


CCCCAACAAT 


TCAATTGCAG 


420 




TTGACTCATT 


TTTCACAATA 


TGACCAATTT 


GATAGGCTTC 


CACATTTTGT 


TCTGCTAAAA 


480 




TCTTCAAAGC 


GCGTGATACA 


TCTTTTTCAT 


CAACGATAAC 


CGTATAGCCA ATACCCATAT 


540 


45 


TAAAAATGTT 


ATACATTTCA 


TTTGTGTCTA TATTGCCTTG 


TTGTTGTAAC 


CAATCAAATA 


600 




TTTTTGGTGT 


TGGAAATGAT 


GTAGTATCAA 


TTCTAGCAGC 


ATATCCGGCT 


GGCAATGCAC 


660 




GTGGAATATT 


TTCATAAAAA 


CCTCCACCAG 


TAATATGATT 


CATTGCCTTA ATAGAAACTT 


720 


SO 


crrrrrrcAA 


AGCAAGTACA 


GGTTTGACAT 


ATAATTTAGT 


TGGTTCTAAA 


AAGACATCGA 


780 
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GCACTAAACT GTATCCATTT GAATGAATGC CACTTGACGC AAGCCCTATA ACAACTTGTC 900 

CCTCTTTCAC TTCTGAACCA TCTACATAGT CATCCTTTTC AACTGCTCCA ACAGCAAATC 960 

CAGCTACATC ATATTCGCCT TCGTGATACA TTTCACCCAT TTCAGCAGTC TCTCCACCGA 1020 

TAAGTGCAGT ATTCGTTTCA ACACATGCAT CACTAATACC TTTAACAATT TGTTCAATAA 1080 

CTTCAGGAAC AACTTTGTTT GTAGCAATGT AATCTAAAAA ATATAATGGT TCTGCACCTG 114 0 

TCGTTAAAAT ATCATTAACA CACATTGCGA CTGCATCGAT ACCTATCGAA TCATGTTTAC 1200 

CATAGTCGAT AGCTAGTTTT AATTTCGTAC CTACTCCGTC TGTTCCAGAA ACTAAAACTG 1260 

GCGCTGtCAT AnTAATTGT GATAAATCAA ATGTAGCACC GAATCCACCT AAACCACCGA 1320 

TAACTTCTTT ACGCATCGTA CGCTTAACaw GaCtAGaCaT TCtTTCtACA GCTTCATAAC 138 0 

CAGCATGAAT ATTTACACCA GATTGTTCAT ATGCTTTAGA CATTTAAATT TCCCTCGCTA 144 0 

TCAAAAAAGT GTTTGmTT AGAAATATAT TGCTTTTGTC GATGACTTAA ATGCGCTTTA 1500 

TAATTTGCTT CATAATCATA TAACCCTGCA GGATAATCTC CAGTGAAACT TTCTACACAT 1560 

AAGCCACTAT ATGGCGCGTC ATAATCTAAA CCAATTGATT CAATTAACCC ATCTACAGAT 1620 

25 AGATATGCTA ATGAATCAGC GCCAATATAA TCTTTAATTT CTTCAGGTGA TTTGCTTGCA 16 80 

GAAATTAATT CTGCCGTAGT TGAAACATCG ATTCCGTAAA AACTTGGAAA CATAAATTCC 174 0 

GGTGATGCTA TACGCACATG CACTTkATTT GCACCAGAAT CkTTTAACAT TTTCACAATG 1800 

30 CGTCGAATTG TCGTACCGCG nACAATGGAA TCATCAACAA GACTG 184 5 
(2) INFORMATION FOR SEQ ID NO: 526: 



20 



35 



40 



45 



SO 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 

TCATAAATnG TAAATGTTAA TCGTCATATA ATATTAATTT ACAACACCAT TTTGGTTATT 60 

TGAAGCTTGT GGCGCTTGTT GTGTGCCACC TTGATTTTGA TTTGAGTTTT GATCTGTAGC 120 

AGGTTGTTGT TGATTGCTGG AmtCACTGTT ATTAGTTGAm TCACTGTTCT CGTTAGATGT 180 

CTTATCTTTA TCTGTCGTAT CATTTTGTTC TTTTCTCAAT AAACTATTAT CTAAAGGCGT 24 0 

TAATGGTATT AATGAACCAT AATGATTAAT GACACGTTGA TCTAAGAAAT CATTTTTATC 300 

ATTAATAGGT GATAATTCTA AGTCTTTACG AAGTAAGTTT GCATATTTTT GAATGCTTTC 360 
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5 



10 



15 



20 



AGXTTTAATT 


TCAACATCAT 




\JA 1111 1 1 


AAAGCTTTAA 


TTTCAGAGTT 


480 


AGTTAAATTA 


TGCTTTGCAT 


XXX Xr\^.^xA\. 


A UTTTYt a ATP 
An X X XVjMM X V* 


AUlj 1 xaTCAA 


GTTTATCAAT 


540 


AGAATCAACT 


TCCTGTGCTT 


TTTGGA AT A A 
X X X VtfVxnnXAA 


x\nx \.« X XnnX X 


u u ^^^^^^^^^ ^P^P^P 

AAl lULAl X 1 


w* WVjTTGTC C 


600 


ACGTTTTAAG 


TCTGAATCAT 


G A TGTOT ArST" 


TPTHrSPli. aPT 


v»u 1 AAAtSCtrr 


CAT CACCaTT 


660 


TAATTTTTGG 


xr^v_VvW X X X X X 


X«/\l X X X>v%X 


PTTaPPaPTH 


X \»A X L. 1 1 vjr 1 


TAGGTTCATT 


720 


TAAGTCGTAT 




AV? X n X /I X Vd^^ 


AL.V»A/\\J»^ 1 (JA 


TTTACAGCCT 


CGACAAATGC 


780 


XXX wr\ 4, f \ X X vj 




xv\ X Ari X \^f\J\\^ 


ALKj 1 Av^A 1 1 V- 


ATGGTAGCTT 


CTACCGAATC 


840 




ATTfiG A CP A r* 


a T a TYIP TYS 
V..>n.l>ix\j^«A lAj 


l\7lJA 1 V9ViV7 1 A 


AT CTT ATCGT 


AATAGCCaAC 


900 


TTTAGGAATG 


TAGCTGATAG 


TATCACGTGG 


AATACTAAGC 


ATTCTAATTT 


GATGTTTTGA 


960 


TTGATTAAAA 


GTAGTTAAAA 


TCATAGCGTC 


TGATCTAGAG 


TGTTCAGCAT 


CCTGTCCTTT 


1020 


TTTTCTTCTT 


CCATCGTTAT 


CATCGATACC 


TAAGAAAAGA 


ATAGAGATAG 


GTTGTTCTTC 


1080 


GGGATGACTT 


TATTATCTCT 


TAAGTTGGAT 


TGACGTTAGC 


ATTTTTGCTG 


TCTTGAGAAG 


1140 



^ 1141 
(2) INFORMATION FOR SEQ ID NO: 527: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 



40 



45 



50 



GCACCATTAG 


CATTGGGCnC 


ACTGATTGGA 


GTAnCAGTTG 


TTGAAAATTC 


GGCGCC/UVCA 


60 


AGTAAACmGG 


CmCaGGCaGC 


mATamCCCCa 


TATTATAcTT 


ATAATGGTTA 


TATTGGTAAT 


120 


AATGCTAATT 


TTATTCTGGA 


TAAGAATTTT 


ATTAACGCGA 


TTAAGTATGA 


TAATGTGAAA 


180 


TTCAATGGTA 


TTAAATTAGC 


TAAAACGAAT 


ACGATAAAAA 


AAGTAGAGAA 


ATATGATCAA 


240 


ACrriTAAAG 


GTGTTTCTGC 


AAAAGGAAAC 


GAAGCAAGTC 


AATTGCAATT 


TGTAGTTAAA 


300 


AATAATATTT 


CATTAAAAGA 


TATCCAAAAA 


GCTTATGGCA 


AGGACTTGAA 


AAAAGAAAAT 


360 


GGTAAAACAA 


AGGAAGCTGA 


TAGCGGTATT 


TTTTACTATC 


AAAATGCTAA 


AAAGACATTA 


420 


GGCATCTGGT 


TTGTCGTTGA 


TCATAATAGA 


GTTGTCGAAG 


TAACAGTTGG 


ACATACACCA 


480 


TACAAAACAA 


GTAAATAAAA 


TAATGGCATA 


TTAAGGCTAG 


AGTGTGAGGA 


GTGATACCGC 


540 


ATTCTAGTCT 


TTTTTATTAA 


ATAATAACGA 


TTATTGCGTC 


TTACATAGTT 


GTTTGAAATT 


600 
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GATTAAGTAT ATAGAGCACT ATTTTGTATT TGTTAATATT TTCACAAAAA TAAAGCCTTG 720 

ATAAATTTTA AAATATAATT AAGCTCAATT TTTAAAATTT TATTTAGCTA CAGATAACAT 780 

TTTTAAAAAA GAAAAGAATC AATAAATAAA ATCAACGAAC AAAAAGTATA GAAATAAATA 840 

GAAATAATCA TTTACTTTTC TGAAAAATTA AATTAATATT TTATTTATAA GCTGTTTTTA 900 

AGATTTCAGG AGGAATGAAA TGTGaGGAAA TTTTCAAGAT ATGCATTTAC AAGTATGGCA 960 

GCATTAACCT TGTTGAGCAC TTTATCACCA GCAGCATTAG CGATTGATTC AAAAAATAAA 1020 

CCAGCTAATT CTGATATTAA ATTTGAGGTG ACTCAAAAGA GTGATGCGGT CAAAGCATTA 1080 

AAAGAATTGC CTAAATCCGA AAATGTAAAA AATATTTATC AAGATTACGC TGTTACTGAT 114 0 

GTAAAAACTG ATAAAAAAGG ATTTACGCAT TATACATTGC AACCGAGTGT TGATGGTGTT 1^00 

CATGCACCTG aCAAAGAAGT GAAAGTACAC GCAGACAAAT CAGGAAAAGT CGTTTTAATC 1260 

20 AATGGGGATA CTGATGCGAA GAAAGTAAAG CCAACGAATA AAGTGACATT AAGTAAAGAT 1320 

GACGCAGCCG ACAAAGCATT TAAAGCAGTT AAGATTGATA AGAATAAAGC GAAAAATCTT 13 80 

aAAGATAAAG TCATTAAAGA rAACAAAGTT GAAATCGATG GTGACAGTAA TAAATACGTT 1440 

TATAATGTTG AGTTAATTAC AGTGACACCA GAAATTTCAC ATTGGAAAGT TAAAATTGAT 1500 

GCTCAAACTG GCGAAATTTT AGAAAAAATG AACTTAGTTA AAGAAGCTGC AGAAACTGGT 1560 

AAAGG 1565 

30 

(2) INFORMATION FOR SEQ ID NO: 528: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 

GGATAGGTAA ATCCATTCAA AATTGGTTGT ACTCATTTTA AATCATATTT TTTAATATAG 60 

AAACACATAA TTAAAGGAGT GATGATATTA ATGAATAAAC TTCGAGACAC TACTTTTCTA 120 

TCATATTTAT TTACTATCAT ATTGTGGGGA TCTGCATTTC CAATGATAAA GATTGCGTTA 180 

AATGATTTTA GTGCAGAATC ATTGTCGGCA TTTCGTTTAA TTTTAGCAAC AATAATTCTC 24 0 

TTGCCGTTCG TAATTATAAA GAAATTGCCT ACCCCTGAAC TAAGAGATAT CCCTGTTATT 300 

TTTATTTTAG GATTTTGTGG ATTTGTGATA TATCACACAG CTTTAAATTT TGGTGAAaCT 360 

TTGATTAGTG CAGGTATAtC TGGrATTCyA GTCTCTACAA CGCCTATTTT TTCTAGTGCT 4 20 
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GCATTTATAG GAATATCCAT TATTTCAATA AGTAAAGATG ATTACACAAC TATTAATGTA 54 0 

TTAGGTGTTT TTATTATTTT ACTTGCATCT TTTAGTGAAA GTTTGTATTT TACTTTCCAG 600 

AAAAAATACA TAGAAAAATA CGGCTTCATC GCTTTCACAC TATaTACAAT AATGGCAAGC 660 

TCACCATTTA TGCTTATTTT TATTCCTGAA ATCATCAACG ATATACACGG CGCCACTTTT 720 

ACATCAATAG TATCGGTACT TTATTTAGCT ATATTCCCTA CTATAATTCC ATACGTTTTG 780 

CTTGCTTATA TTGTGAAGTC AGTTGGTGTC TCTGATGCAA CAATGTCTCT TTATTTAACA 84 0 

CCTATCGTTT CTTTATTATT ATCTTATCTG TTATTAGACG AGCTACCAAC AACCCTTGCT 900 

75 ATTATAGGCG GAATTATCAC TCTACTAGGC GTTAGTTTAA GTAACTTCTT TCAAAATACA 960 

TAATTATTCC AAGTCCCGCA CCTCAGAATC CAAAAACATT CGAGTGATAA AATTTTAAAA 1020 

ATCAAAAATA TAAAAATGAT CTAATTTCGT CAAATTTACC AATATAAATA CTAATATTTG 1080 

CAATTCACAA AGGGGTATAG TCTGAGTGTA TTCTAATACG AAAGGACTTG GTGGATATGT 114 0 

ATTACAGTTA TGGAAATTAT GAAGCATTTG CGCGCCCTAA AAAACCTGAA AATGTAGAAA 1200 

ACAAATCCGC TTACTTAATC GGATCTGGTC TAGCTTCACT TGCTGCAGCT TGTTTTTTAA 12 6 O 

TAAGGGATGG TCAAATGGAA GGTTCGAAGA TTCATATTTT AGAAGAGTTA CCTAAAGCAG 1320 

GTGGTAGTCT TGATGGTGAA AATATGCCTT TAAAAGGCTA TGTTGTCCGC GGTGGTCGTG 1380 

AAATGGAGAA CCACTTTGAA TGTTTGTGGG ACTTATTCAG ATCTATCCCT TCATTAGAAA 144 0 

TCGATAACGC GTCTGTATTA GATGAGTTCT ATTGGCTAAA CAAAGAAGAC CCTAACTATT 1500 

CTCGCTGTCG TGTTATTGAG AAACAGGGTC AACGTTTAGT CACAGACGGA GACTTCACTT 1560 

TGACTAAAAC GGCGATTAAA GAAATTTTAG ATTTATGCTT AACGAATGAA GAAGATTTAG 1620 

ATGATGTCAA AATAACAGAT GTATTTTCCG ATGACTTCTT TAATTCAAAC TTTTGGATTT 1680 

ACTGGAAAAC GATGTTTGCA TTTGAACCGT GGCATTCTGC AATGGAAATG CGTCGCTATC 174 0 

40 TAATGCGATT CGTTCATCAT ATTAGTGGTC TCGCAGACTT TTCAGCTTTA AAATTCACTA 1800 

AATATAATCA ATATGAATCT TTAGTATTAC CtATGGTTGA ATATTTAAAA TCGCATGGGG 1860 

TTCAATTTGA ATACGATGTA AAAGTCGAAG ATATTAAAAT AGATGTTACG ACAAGTCAAA 1920 

AAATTGCCCG AGAAATATTA ATTGACCGTA ATGGTAATGC AGAATCTATT AAACTGACTA 1980 

TAAACGATCT TGTCTTTGTG ACAAACGGTA GTATTACAGA AAGCTCTACT TATGGTGATA 204 0 

ATGATACACC AGCGCCACCA ACTGACGAAT TAGGTGGTAG TTGGACACTA TGGAAAAATT 2100 

TAGCGCGACA AAGTCCTGAA TTTGGTAATC CTGATAAGTT TTGCCAAAAT ATTCCTAAAA 2160 

AAAGTTGGTT TGTTTCAGCA ACTTCTACAA CAAACAATAA AGAGATTATC GATACAATAG 2220 
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TCAATGATTC TGCATGGCAA ATGAGTTTTA CAATCAATCG TCAGCAACAG TTTAAAGACC " 234 0 

AACCTGAAAA TGAAATATCT ACATGGATTT ATGCCTTATA TTCAGATGTA AACGGCGATT 2400 

ATATTAAAAA GCCAATTACA GAATGTAGTG GTAATGAAAT ATGCCAAGAA TGGCTGTATC 2460 

ACTTAGGTGT ATCAACTGAC AAAATTGAAG ACTTAGCAAA ACATGCATCT AATACGATTC 2520 

CTGTTTATAT GCCATATATC ACATCTTATT TCATGACGCG TGCTATCGGC GACAGACCTT 2580 

TAGTCGTCCC GCATCAATCT CAGAACTTAG CATTTATTGG TAACTTTGCA GAAACAGAGC 264 0 

GAGACACTGT ATTTACAACA GAATATTCGG TTCGTACTGC CATGGAAGCT GTTTATCAAT 2700 

TACTAAATAT AGATCGTGGT ATTCCAGAAG TCATCAATAG TCCATTTGAT CTTCGCGTCT 27 60 

TAATGGATGC CATATACGAA CTGAATGACC ACCAAGATTT GCGTGAGATT ACTAAAGATT 2820 

CGAAAATGCA AAAACTCGCA TTAGCAGGAT TCCTTAAAAA GATAAAAGGT 2870 
20 (2) INFORMATION FOR SEQ ID NO: 529: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
2S . (D) TOPOLOGY: linear 



75 



30 
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40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529: 

CnGTTTAGCT GCAAAAAGTA AATAACGACA CTGTATTTGG AATTTTGCAA TTAGAAACAC 6 0 

TTTTGGGTGA CATTAACTCA ATTTTCAGCG AGATTGAAAG CGAATACAAA ATGTCTAGAG 120 

AAGAAATTTT AATTTTACTA ACTTTATGGC AAAAAGGTTC TATGACGCTT AAAGAAATGG 180 

ACAGATTTGT TGAAGTTAAA CCGTATAAGC GTACGAGAAC GTATAATAAT TTAGTTGAAT 240 

TAGAATGGAT TTACAAAGAG CGTCCTGTTG ACGATGAAAG AACAGTTATT ATTCATTTCA 300 

ATGAAAAGTT ACAACAAGAG AAAGTAGAGT TGTTGAATTT CATCAGTGAT GCGATTGCAA 360 

GTAGAGCAAC AQCAATGCAA AATAGTTTAA ACGCAATTAT TGCTGTGTAA GTTTAATAGC 420 . 

ATAAAAAGAG GTTTTCATTA AGTTGAAAAC CTCTTTTTGT TGTTGGCATT AATTTTTCAA 4 80 

^ ATGTTGACTA CTCAATCCTA AATTATAAAT AGTATAGCGC aCAAaTGCTT AAGAAATTTT 540 

TTCTATGGCA CAAATGAATG GAGCATGATT ACGTTGGTTT AAAAATTGAT ATTGCAAAAC 600 

TTGCGCATGC TTTTGATCCA AAGTACTCAA GTAATCAAGC AATGCATGCT TCTCAATTTG 660 

SO 

TCCTTCGCTA TGACCATGAT ATATAACAAG TACAATAATA CCTTCAATTG ACATTAATGA 720 

TAGCAATGAA TTAATAGCTT GGATTGTCGT GTCAGGCTTT GTCACGATAG ATTTATCACC 780 

55 



1459 



TATATGATGT TCAATATTTT CATGTCCATC 
AACTTTATCA CGTGTATTTT CCAAAGCTAA 
^ TTCTGGTACT TGTTCGGCTA AAAATAAAGT 
AACAATACTT TCTGGTGTTA TATGTTGTTT 
TTCTAATTTC ATTGCTTCAC CTTAGACTTG 

10 

GCTAATTCAG CATCGATGCC ATTTAATACT 
ACCATGATAT CTATTGGACC ATCACCGGTA 
ACTTCTAATT GGTCCACAAC TAGGTTTGTG 

75 

CCTTTATCGT ATTGTTTTAC CATCGGTGTA 
ATACCTTGTA CATCCATTTG TGCAcTTCTT 

20 CGCCAGGTAA GCCATTAATG ATGTGTGTAC 
CCACACCATC ATAATAAGTT TTCATATCAT 
GATGGATTGT TTGTAGTCCT AATTCAACCC 

25 AATATTCGAC AACATCGTCT GGTAGACAGT 
CACCCGGTTC TTTAAGTACA GGTTCGAATT 

TATTTGTAAA TGCCTGAAAA TAAGCAATAT 

30 

TTTCTTTAAT TTCTTTAAAT TGTACTGCGA 

CGCTACCTGC AGCAGAACAA AATGTACATC 

GACAGTCAAA CCCGCCATCC AATGCAACTT 

35 

AATGGTAATT CCATGTGTGA TAACGTTTGT 
TATGTCATTT TCCTTTCTAT AAAAAAAGAG 
GTTATAGTGT TTATTATAGT TTGACAAAAA 
CTAAATCAGT CTGGTGGCTA GTAATTGGCA 
TGTGGCCTTT AAATACAATT TATATGAAAC 
45 GTTTAGTGCT AATGATAAAT TCATTTGGCA 
TATTTGATAA ATTAGGTGGA TACAAGACGA 
GTACAACGCT ACTTAATTTC TTTCACGGGT 

SO 

TAGGGTTTGG TGGCGGAATG ATTATTCCTG 
CAAATGGCGG AAGACAAACG TTTAATGCGA 

55 
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TTTTATTAAA GAAACATGAT TGAAATCCTT " 900 

ATCTTGAATG TCGAAACCAT AAACATGTCC 960 

GTCATTGCCG TTACCGCAAG TTGCGTCTAC 1020 

AATAAGTGTT TTTGAAAAAG GGAGTATACG 10 80 

TAACGCAAGC CTTGATAAGA ATTTCTACGT 1140 

TCCCATTTAT TAACACTCCA CATTGGACCT 12 00 

ATTCGGTGAA CGATCATTTC AGGGGGAATC 126 0 

TACTCTTCTT GAGTCATAAA AGTTAATAAA 1320 

CCTTTTAACA AATGAAGTAA ATGAATTTTA 1380 

TGGCAGTAGC CATCATCATG TCATAGTCTT 144 0 

ATACATTGAT ATTATGCTTA CGTA AT TT T G 1500 

GGGCACGATT GATTAAATCA GATGTTGACT 1560 

ATAAGTATGT TCGTTGATTC AAATCTGCTA 1620 

CAGGACGCGT ACCAATAGAT AATCCCACAA 1680 

TTTCTTTTAA TACTTCAACC GGTGCATGTG 174 0 

ATTTTCCTTC GTGCCATTTC TCATGCATCT 1800 

TTGAATCTGC ACGATTACCT GCAAAGTCTC 1860 

CACCATGTGC TACAGTGCCA TCGCGGTTAG 1920 

TAAATATTTT TTGTCCAAAT TTATTTTTTA 1980 

TTTCAAAAGC GTATTGGAAA TGATTGCCCA 204 0 

TTCTAAGTAC AGATTTTAAC ATATTTTAAT 2100 

AGAGAGAGGA ACTATGAAAT ATGAATATAC 2160 

TGGCGTTAAA TATTACTGGT TCCAGTTTTT 222 0 

AAGAACTTGG AAAAAGTTTA ACTCTTGCTG 2280 

TGGTTATTGG AAACTTATTA GGTGGTTCAC 234 0 

TTTTAATTGG AACTTTCACT TGTCTTTGTA 24 00 

GGCCTTGGTA TGCTGTATGG CTTGTAATGT 2 4 60 

CGATATACGC TATGGCTGGA GCAGTGTGGC 2 520 

TATACTTAGC GCAAAATATT GGTGTGGCTG 2 580 
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ATCTTATTAT GTATGTTGTG TTTGCGCTTG TCGCGGTAAC GCAATTTAAT ATTGAAATTA 2700 

ATGCGAAAGT TAAATATCCA ACTCATTTAG ATATTACTGG TAAAAAGAAT AAAGCAAGAT 2760 

TTATTTCATT AGTACTAATT TGTGCAATGT TTGCAATTTG TTGGGTTGCA TATATTCAAT 2820 

GGGAGTCTAC AATCGCTTCA TTTACACAAT CTATTAATAT TTCAATGGCA CAATATAGTG 2880 

TTTTATGGAC AATTAACGGA ATAATGATTT TAGTAGCACA ACCATTAATT AAACCGATTC 2 940 

TCTATCTGTT AAAAGGAAAC TTAAAGAAGC AAATGTTTGT CGGCATCATC ATTTTTATGT 30 00 

TGTCGTTCTT TGTCACGAGT TTTGCCGAAA ACTTTACAAT ATTTGyTGTC GGTATGATTA 3060 

TTTTAACTTT TGGAGaATGT TTGTATGGCC AGCAGTTCCA ACTAT 3105 
(2) INFORMATION FOR SEQ ID NO: 53 0: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH:. 5532 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRAT'IDEDNESS: double 

(D) TOPOLOGY: linear 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 0: 
TAATGATTAA ACCAGATGCA GTACAAAGAA ATCTAATTGG TGAAGTAATT TCAAGAATTG 60 

AAAGAAAAGG ACTAAAACTT GTCGGTGGTA AATTAATGCA AGTACCAATG GAACTTGCTG 120 

AAACACATTA TGGTGAACAC CAAGGTAAAC CATTTTATAA TGATTTAATT TCATTTATTA . 180 

CATCAGCACC AGTGTTCGCA ATGGTAGTTG AAGGTGAAGA TGCAGTTAAT GTATCTAGAC 24 0 

ATATTATTGG CAGCACCAAT CCTTCAGAAG CTTCACCAGG ATCAATTAGA GGTGATTTAG 3.00 

GTTTAACTGT TGGTAGAAAT ATCATTCACG GTTCAGATTC ATTAGAGTCT GCTGAACGTG 360 

AAATTAATCT ATGGTTTAAT GAAAATGAAA TTACTAGCTA TGCTTCACCA CGTGATGCAT 420 

40 GGTTATATGA ATAAAATATA AACTGTAAAC CTTTACGATT TATTTATAAA GGTAGAAAGG 4 80 

GTTTTGTTAT GTGGTTAGTC ATTATGaTTA TACATAACAA GGCCCGTTTT TTATGTTGTA 54 0 

GTAAATTACT TGAAAAATTT TATAGTTTTT kGGTAAcACG TAtTaAAAaG AGAGGAATAT 6 00 

TCTTTATCAA ATGAAACTAA ACAGAGAGAA GGGGTTGTTA AAATGAAgAA TATTATTTCG 660 

ATTATTTkGG GGATTTTAAT GTTCTTAAAA TTAATGGAAT TACTATATGG TGCTATATTT 72 0 

TTAGATAAAC CACTTAATCC TATAACAAAA ATTATTTTTA TACTGACTCT CATTTATATT 7 80 

TTTTATGTAT TAGTAAAAGA ATTGATTATA TTTTTGaAGT CAAAGTATAA CAAAAGCGCT 84 0 

TAACATATGT ATATTTTAAT ATCATAATTT TTTTAAACGG ACTGATTAAC TTTATTAATA 900 
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GATACGATTA TATTAAAACG GCTAATCATT TTTAATTAAT GATTATATGA TGCAACTGTT 1020 

TAGAAATTCA TGATACTTTT CTACAGACGA ATATATTATA ATTAATTTTA GTTCGTTTAA 1080 

TATTAAGATA ATTCTGACAT TTAAAATGAG ATGTCATCCA TTTTCTTAAT TGAGCTTGAA 1140 

AACAAACATT TATGAATGCA CAATGAATAT GATAAGATTA ACAACATATT ATAATGTTAT 1200 

CGTGGAAGTA TGAAAGGAGC GAGTGTGTAT GAGATACCTA ACATCAGGAG AATCACATGG 1260 

ACCTCAATTA ACAGTTATTG TTGAAGGTGT ACCTGCAAAT ATAGAAATTA AGGTTGAGGA 13 20 

TATTAATAAA GAAATGTTTA AGCGTCAAGG CGGTTACGGA CGTGGACGTC GTATGCAAAT 13 80 

TGAGAAAGAT ACAGTAGAAA TAGTATCAGG CGTTAGAAAT GGTTATACAT TAGGTAGTCC 1440 

AATTACTATG GTTGTAACCA ATGATGACTT TACGCATTGG AGAAAAATTA TGGGAGCAGC 1500 

TCCAATAAGT GAAGAAGAAC GTGAAAATAT GAAACGTACT ATTACAAAAC CAAGACCTGG 1560 

20 TCATGCAGAT TTGGTTGGAG GTATGAAATA TAATCATCGT GATTTACGAA ATGTGCTAGA 1620 

GCGATCATCT GCTAGAGAAA CAGCAGCTCG AGTTGCAGTC GGTGCCTTAT GTAAAGTGTT 1680 

ATTACAACAG TTAGATATCG ATATATACAG TCGTGTTGTT GAAATAGGTG GAATTAAAGA 174 0 

TAAAGATTTT TATGATTCAG AAACATTTAA AGCAAATCTT GATCGTAATG ATGTTCGTGT 1800 

AATTGATGAC AGTATCGCAC AAGCAATGCG AGATAAAATT GACGAaGCTA AAAATGAAGG 1860 

AGATTCAATT GGCGGTGTCG TTCAAGTTGT AGTTGAAAAT ATGCCTGTTG GTGTAGGTAG 1920 

TTATGTGCAT TATGATCGTA AGTTAGATGG TAAGATTGCA CAAGGTGTTG TCAGCATAAA 1980 

TGCTTTTAAA GGTGTAAGCT TTGGTGAAGG ATTTAAAGCA GCTGAAAAGC CAGGTAGTGA 204 0 

GATTCAAGAT GAAATTCTAT ATAATAGTGA AATTGGTTAT TATCGTGGAT CTAATCACTT 2100 

AGGTGGTTTA GAAGGCGGTA TGTCAAATGG AATGCCAATT ATCGTTAATG GTGTAATGAA 2160 

ACCAATTCCA ACGTTATATA AACCATTAAA TTCAGTAGAC ATTAATACTA AAGAAGACTT 2220 

40 TAAAGCAACA ATTGAACGTT CTGATAGTTG TGCTGTTCCT GCAGCAAGTA TCGTCTGCGA 2280 

ACATGTCGTA GCATTTGAAA TAGCAAAAGC ATTATTGGAA . GAATTCCAAT CAAATCATAT 234 0 

TGAGCAACTT AAACAACAAA TTATTGAGCG CAGACAATTA AATATTGAGT TTTAACAACA 2400 

AGAACAATTG AGGTGTAATC ATGAAATTAC AAACAACATA CCCTTCAAAT AATTATCCAA 2460 

TATATGTTGA ACACGGTGCA ATTGACCATA TTAGCACGTA TATTGATCAG TTTGATCAAA 2520 

GTTTTATATT AATTGACXIAG CATGTAAATC AATATTTTGC TGATAAATTT GATGATATTT 2580 

50 

TATCATATGA AAATGTACAT AAAGTTATTA TTCCAGCTGG TGAAAAGACG AAAACATTTG 264 0 

AGCAATATCA AGAAACATTA GAGTATATTT TATCCCATCA TGTAACTCGT AATACAGCAA 2700 
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ACGAGGCGTG CACTTTATAC AAGTGCCAAC GACTATACTA GCGCATGATT CTAGTGTTGG 2820 

CGGTAAAGTG GGTATTAACT CAAAGCAAGG TAAAAACCTT ATCGGTGCAT TTTATCGTCC 2880 

AACTGCTGTG ATTTATGATT TAGTCTTTTT AAAGACGTTA CCATTTGAGC AAATATTAAG 294 0 

TGGCTATGCA GAAGTTTATA AGCATGCGTT ATTGAATGGT GAATCAGCGA CGCAAGATAT 3000 

CGAACAGCAC TTTAAAGATA GAGAGATATT ACAGTCATTA AATGGTATGG ATAAATATAT 3 060 

TGCTAAAGGT ATTGAAACGA AGCTGGATAT TGTTATTGCA GATGAAAAAG AACAAGGTGT 3120 

ACGTAAATTT TTAAATTTAG GTCATACATT TGGTCATGCT GTTGAATACT ATCATAAAAT 3180 

ACCTCATGGT CATGCAGTGA TGGTTGGCAT TATCTATCAA TTTATAGTTG CGAATGCTTT 3240 

GTTTGATTCT AAGCATGATA TTAATCATTA TATTCAATAT TTAATACAAC TCGGCTATCC 3300 

TTTAGACATG ATAACTGACT TGGATTTTGA AACGTTATAC CAATATATGC TAAGTGATAA 33€0 

AAAGAATGAT AAGCAAGGTG TACAAATGGT CTTGATTAGA CAArTTGGAG ATATCGTTGT 34 2 0 

ACAACATGTT GATCAACTAA CATTACAACA TGCATGTGAA CAATTAAAAA CATATTTTAA 34 80 

GTAGGTGAAT GAAATGGTAA ATGAACAAAT CATTGATATT TCAGGTCCGT TAAAGGGCGA 3 54 0- 

AATAGAAGTG CCGGGCGATA AGTCAATGAC ACACCGTGCA ATCATGTTGG CGTCGCTAGC 3600 

TGAAGGTGTA TCTACTATAT ATAAGCCACT ACTTGGCGAA GATTGTCGTC GTACGATGGA 366 0 

CATTTTCCGA CTGTTAGGTG TAGAAATCAA AGAAGATGAT GAAAAATTAG TTGTGACTTC 3720 

CCCAGGATAT CAATCTTTTA ACACGCCACA TCAAGTATTG TATACAGGTA ATTCTGGTAC 37 8 0 

GACAACACGA TTATTGGCAG GTTTGTTAAG TGGTTTAGGT ATTGAAAGTG TTTTGTCTGG 3 84 0 

CGATGTTTCA ATTGGTAAAA GGCCAATGGA TCGTGTCTTG AGACCATTGA AACTTATGGA 3900 

TGCGAATATT GAAGGTATTG AAGATAATTA TACACCATTA ATTATTAAGC CATCTGTCAT 396 0 

AAAAGGTATA AATTATCAAA TGGAAGTTGC AAGTGCACAA GTAAAAAGTG CCATTTTATT 4020 

TGCAAGTTTG TTTTCTAAGG AACCGACCAT CATTAAAGAA TTAGATGTAA GTCGAAATCA 4080 

TACTGAGACG ATGTTCAAAC ATTTTAATAT TCCAATTGAA GCAGAAGGGT TATCAATTAA 4140 

TACAACCCCT GAAGCAATTC GATACATTAA ACCTGCAGAT TTTCATGTTC CTGGCGATAT 4200 

TTCATCTGCA GCGTTCTTTA TTGTTGCAGC ACTTATCACA CCAGGAAGTG ATGTAACAAT 4260 

TCATAATGTT GGAATCAATC CAACACGTTC AGGTATTATT GATATTGTTG AAAAAATGGG 4320 

CGGTAATATC CAACTTTTCA ATCAAACAAC TGGTGCTGAA CCTACTGCTT CTATTCGTAT 4 3 80 

SO 

TCAATACACA CCAATGCTTC AACCAATAAC AATCGAAGGA GAATTAGTTC GAAAAGCAAT 44 4 0 

TGATGAACTG CCTGTAATAG CATTACTTTG TACACAAGCA GTTGGCACGA GTACAATTAA 4500 
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AAACTTGTTA GGGTTTGAAT TACAACCAAC TAATGATGGA TTGATTATTC ATCCGTCAGA 4620 

ATTTAAAACA AATGCAACAG TTGATAGTTT AACTGATCAT CGAATAGGAA TGATGCTTGC 4 680 

AGTTGCTTCT CTACTTTCAA GCGAGCCTGT CAAAATCAAA CAATTTGATG CTGTAAATGT 474 0 

ATCATTTCCA GGATTTTTAC CAAAACTAAA GCTTTTAGAA AATGAGGGAT AATATAAAAT 4 800 

GGAAGATATC TATAAATTAA TAGACGATAT CAATCTTACAA AAACTAGAAA ATTTAGACTC 4 860 

TCGTGTTAAT GAAGCAATAA CTACTGACAA CGATGACGCA TTATTTATTC TAGGAGAGAC 4 920 

ACTTTACAAT TTTGGATTAA TGCCaCAAGG TTTGGAAGTA TTCCGCGTGT TATATCACAA 4980 

ATATCCAGAC GAAAGTGAAT TGCTGATTTA TTTTATTGAA GGTTTAATGT CTGAAAATCA 504 0 

AACTGACGAA GCGTTAGAAT ATTTAAGTTA TGTTGAACCA TCACCTGAAA AGTTGATGTT 5100 

AGAAGCAGAT TTATATCAAC AAATTAATAT GATGGAAGTT GCTATTGATA AATTACAAGA 516 0 

20 AGCACTTGAA CTAGAGCCAA ATGATCCAAT AATCCATTTT GCATTGGCTG AAATGTTATA 522 0 

TTATGATGGT CAATATTTAC GTGCTACCTC TGAATACGAA ACCGTTTTAG AAACTGGTGA 5280 

ATATCAAGTT AATGGTGTAA ACTTATTCTC TCGTATGGCA GATTGTAGTT TACAAAGTGG 534 0 

kAACTATAGT GATkCcGATt CgCTTATACXS ATGrAATTAA TGAAGATGAA ATGACTTCAG 5400 

AAGATTATCT CAAAAGAGCC ATTTCTnACG ATAAAAATGA CATCACTCAA GAAGCAATTA 54 60 

AAATAATGAC TACATTACTt TCTAAAGATC CTGATTATAT TCAAgGCTAC TTGTATTTAC 5520 

aATCaTTATA TG 5532 
(2) INFORMATION FOR SEQ ID NO: 531: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 

AATTGGGTTA TACTATAGGT AAATTTAAGG AGGTAAGAAA ATGGATAAAA AAGAATTAGC 60 

GAAATTTATA GGCAATAAAA TCAGATACTA TAGAACCaAA TTGaACTTAA CTCAAGATCa 120 

ACTTGGAGAA AAACTCmACa CTAAAAArGC TACTATTTCA AATTATGAGA CAGGGTACAG 18 0 

AACTCCTAAA CAAGATGATT TGTTTGAAAT TGCTCATATT TTAAATATCA GTATCGATGA 24 0 

SO 

TTTGTTTCCT ACAAGAAATA ATAAAAAAAA CGACATCACT TCCATATACA ACAAACTCAC 3 00 

ACCTCCCCGG CAAGAAAACG TACTTAACTA CGCAAATGAG CAATTAGATG AACAGAATAA 3 60 
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AACTGGTGCT GGCATAGGAG AAGAATTATA TGATGACATA TTGCATGAAG AAGTATTTTT 
TAAAGAAGAC GAAACGCCAT CAAATGCTGA TTTTTGTATT TTAGTTAATG GTGATTCAAT 
GGAACCTATG TTAAAACAAG GAACATACX3C TTTTATTAAG AAAGAAGATT CTATTAAAGA 
TGGTACAATT GCACTCGTTG TATTAGATGG AGTAAGTCTT ATCAAGCGTG TAGATATATG 
CGAAGACTAT ATTAATTTGG TATCTCTAAA TCCGAAGTAT GATGATATCA AAGTCGCTTC 
GTTTAGTAAT ATTAAAGTAA TGGGCAAAGT TGTATTGTGA TTAATAGCGC CTATATGGCA 
CTTTAATATA AAAGACGTCT ATTTCAGCAG TGTTTAAAAG GAGTTTATAA TGAAAATAAC 
TAATTGCAAA ATAAAAAAAG AAACTATAGT ATATGAAGTT TTAACTAGTG GTAATCAACC 
ATTCACTTAT GAGTTACCTA AAGATTTATC GTCACATAAT GC 
(2) INFORMATION FOR SEQ ID NO: 532: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LEl^GTK; 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 
TGGnAAATGG AAACCAAAAT ATGaTCCTCG TGcAAGTTGA AGCGGGACGT TATGAAGAAT 
GGGTAAAGAA TGGTTATTTT AAACCGTCAG AAGATAAATC AAAAGAAACA TATACAATTG 
TTATCCCGCC ACCAAATGTA ACTGGTAAAT TACATTTAGG ACATGCATGG GATACGACTT 
TACAAGATAT CATTACACGT ATGAAACGTA TGCAAGGATA CGATACGTTA TACTTACCAG 
GTATGGATCA TGCTGGTATT GCGACACAGG CAAAGGTAGA AGCTAAATTA AATGAACAAG 
GAATAACTAG ATATGATCTT GGTCGTGAAA AGTriTTAGA ACAGGCATGG GATTGGAAAG 
AAGAGTATGC GTCATTTATT CGTGCGCAAT GGGCTAAATT AGGTCTAGGT TTAGATT 
(2) INFORMATION FOR SEQ ID NO: 533: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 
GATCCTGAAC CCGCATTTGT TTCCACTAAA ACAGTATGCC CACTTTCTAC TAAAGCGTGC 
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ATTTTCATAC CATCCACCTC CATAATCATC TTAACGCGAA CATTTTGAAA GCGCAATCAA 180 

AAATCCACAA AATTGTAAAG GTTATTACAC TGACTTTTCC GAAAATTGTG GTAAAATATA 240 

ATTAAGAAAG AACAAGGAGG CACTTACTAT GATTACTTAC AAAAATATTT TAATCGCAGT 300 

TGACGGTTCA CATGAAGCGG AATGGGCATT TAACAGAGCA GTTGGTGTTG CTAAACGTAA 360 

CGATGCGAAG TTAACAATTG TGAATGTAAT TGATTCAAGA ACGTATTCTT CTTATGAAGT 420 

TTATGATGCT CAATTTACTG AAAAATCTAA GCATTTTGCA GAAGAATTAT TAAATGGTTA 4 80 

TAAAGAAGTA GCTACTAACG CTGGTGTTAA AGATGTAGAA ACGCGTCTAG AGTTTGGyTC 540 

TCyTAAATCT ATCATTCCTA AAAAGCTTGC ACATGAAATT AATGCAGACT TGATTATGAG 600 

TGGTACATCA GGCTTAAATG CCGTGGAAAg ATTTATTGTT GGTTCTGTAT CAGAATCTAT 660 

CGTTCGTCAT GCGCCATGTG ACGTGTTAGT TGTTCGTACT GAAGAGTTAC CAGCAGACTT 720 

20 CCAACCACAA GTT 733 
(2) INFORMATION FOR SEQ ID NO: 534: 



IS 



2S 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6060 base pairs 

(B) TYPE: nucleic acid • 
<C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534: 
TGATGATCCT GAAGCGCTAT TGGATAATTA CAACACTGAA GATGTTGATG CACACAATTA 60 
CAATAATATA AATCATGTTA TTTTTGCCTG CGATGCGGGT ATGGGTTCTA GTGCAATGGG 120 

3S 

TGCArGCATG TTACGTAATA AATTTAAAAA GGCGGGCATT AATGATATTA CAGTTACAAA 180 
TACTGCGATT AATCAATTGC CAAAAGATGC TCAATTAGTT ATCACTCAGA AAAAACTAAC 240 
40 TGATCGTGCT ATTAAACAAA CACCAAATGC CATCCATATT TCAGTGGATA ATTTCCTTAA 3 00 

TTCACCAAGA TATGAAGAAC TTTTAAATAA TCTAAAAAAA GATGATCAAG CATAATAATT 360 
AAATAAATTA AAAAATGGAG GATACCGCCA TGTTATTGAG TACACGTGAA AAAGAAATGA 420 
TAGCCCtAtT GATTAAGTAC CACGGtCAAt ATATCACTAT ACACGACATT GCTCAGCAAC 4 80 

TTGCGGTGTC CTCTCGTACT ATTCACCGTG AATTAAAAGG TGTTGAAGCA TATTTAACTT 54 0 

CATTTTCATT AACTTTAGAA CGCGCAAACA AAAAAGGGcT ACGCATTGCT GGCACAGATT 600 
CTGATTTAAA CGATTTGAAG CAATCGATTG CACAACATCA AACCATTGAC TTATCTGTTG 660 
AAGAGCAGAA AGTAATTATT ATATACGCTT TGATACAAGC CAAGGAGCCA GTTAAACAAT 720 

55 



45 



SO 



3NSOCXaO: <£P_0786519A2_I_> 



1466 



10 



IS 



EP0 786 519 A2 

TAGAGCTTGA TTTAAATAAG TACCAACTAT CTTTATCTCG AAAGCGTGGC GAAGGCATTT 840 
ACTTGGTAGG TACTGAATCA AAGAAACGTG AATTTTTAAG TCAATTAATG GTGAATAACT 900 
TAAATAGTAC TAGCGTTTAT TCAGTAATTG AAAATCATTT TGTCTTTCAT TCATTAAATC 960 

AAATCCACAA AGACTTTGTT GACTTAGAGC GCATTTTTAA TGTTGAAAGA CtATTaATGG 1020 

ACTACCTAAG TGCCTTACCC TACCAACTTA CCGAATCAAG TTATTTAACT TTAACTGTCC 1080 

ATATCGTGCT CTCCATTTCA CGTATAAAAA ATGGAGAGtA TGTCGCATTA AACGATGATA 114 0 

TTTATGATTC TGTACAAAAC ACATTTGAAC ACAAAGTaGc AAGCGAACTT GCTGATAAAC 1200 

TTGGTCAAAT ATATGACGTC ACGTTTAATC AGGCAGAAAT TGCTTTCATT ACTATCCATT 1260 

TACGTGGAGC TAAACGAAAA AATCTTAATG ATACATCATT AAATAATCGT TGTGAAGAAA 1320 

ACAAAATTAA AGCGTTTGTT AACAAAGTAG AAATGATTTC CGGTATGACA TTTGCAGATT 1380 

20 TGGATACTTT AGTAGATGGA CTGACGCTAC ACCTTAATCC TGCAATCAAT CGTTTGCAAG 1440 

CTAATATCGA GACCTATAAT CCGTTAACAG ACATGATTAA GTTCAAATAT CCAAGACTAT 1500 

TTGAAAATGT AAGATTAGCT TTAAATGATT GTTGGCCTGA TTTGATTTTT CCAGAGAGTG 1560 

2S AAATTGCTTT TATAGTtTTA CACTTTGGTG GCTCGATTAA AAACCAAGGT AATCGATTTT 162 0 

TAAACATATT AGTCGTTTGC AGCAGTGGTA TGGGAACTAG TCGTCTATTA TCAACTCGTC 1680 

TAGAGCAAGT TTTTAGTGAG ATTGAGCGTA TTACACAAGC ATCAGTCAGC GATTTGAAGT 1740 

CACTAGATTT AAGTCAATAT GATGGCATTA TTTCTACTGT GAATTTAGAC ATCGACTCCC 1800^ 

CCTATTTAAC GGTAAACCCA TTATTACCAG ATAGTGATAT CAGTTATGTC GCACAGTTTT 1860 

TAAATACAAA GTCTACGTTC CAAGAGACGC ATGATAAATC ATCAAACATG ATTGATAAGG 1920 

ATGATGTTCA TGTTGAAACG AAAGATGTTG ATGGCAACAC ATCTTTTOAA AATGAACAAA 1980 

CTTCATACTT AACTTCAGTT TTCGAAAAAC ATTTAAGTGA CGAAAAATCA GAACAATTAT 2040 

TGCATCATAT GCGTTCGGGT TTAACTTTGC TTGATTCAGT GAAAATAGTT AGTACCX3AAG 2100 

TTAAACAGTG GCAAACATAT ATCGCAGATT ATCTATATCA ATGCGATGTA ATAAACGATC 2160 

CAACGTCATT CGCTGAACTA CTAGAGCAAC GATTGATTGA CAATCCAGGA TGGATATTAA 2220 

45 GTCCATATCC TGTTGCAATA CCACACCTAA GAGACAATAT GATTAAACAC CCTATGATTC 2280 

TAATCACAGT TTTAGAAGAA CCGTTAACAT TGCCTAGTAT TCAAAATGAC AATCAAACAA 234 0 

TTAAATATAT GATTTCCATG TTTATTTCTG ACAATGATTT TATGGCATCA CTGGTAAGTG 24 00 

SO 

ACTTGTCCGA ATTTTTAAGT TTGAAATTAG AATCTATTGA TACTTTTATG GAAAATCCAC 24 60 

AGGAACTTGA AACATTATTA AGAAACAAAT TTTTAGAACG AATTAAAAAA CAATTTATTT 2 520 
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TAACAGCCAA AATGAAGCAA TTGAAAAAGC AGGTAAAGCC TTAGTTGATA GTGGTGCTGT 2640 

AACAGATGCT TATATTCAAG CAATGAAAGA TCGTGAGCAA GTCX3TATCAA CATTTATGGG 2700 

AAATGGCTTA GCAATTCCTC ATGGCACAGA TGAAGCTAAA ACAAATGTGA TTCACTCAGG 2760 

TTTAACATTA TTACAAATCC CTGAAGGCGT TGACTGGGAT GGCGAAGTAG TTAAAGTTGT 2820 

CGTGGGAATT GCTGGTAAAG ATGGCGAACA TTTAGACTTG TTATCTAAAA TTGCAATTAC 2 880 

ATTTAGCGAA GAAGAAAATG TGGATCGTAT CGTTCAAGCA AAATCTGCAG AAGAAATTAA 2 94 0 

ACAAGTATTC GAGGAGGCAG ATGCATAATG AAAGCAGTTC ACTTTGGTGC TGGTAACATA 3000 

GGTCGTGGTT TCATTGGTTA TATTCTgCAG ACAACAATGT TAAAGTAACA TTTGCAGACG 3 060 

TCAATGAAGA AATCATTAAT GCTTTAGCTC ATGATCATCA ATACGATGTT ATTTTAGCTG 3120 

ATGAGTCTAA AACAACGACG CGCGTGAAtA ATGCTGATGC AATTAATTCA ATGCAACCTT 3180 

20 CTGAAGCGTT GAAACAAGCA ATTCTAGAAG CTGATATTAT TACAACAGCT GTTGGTGTTA 324 0 

ACATACTACC TATTATTGCT AAATCTTTTG CGCCTTTCTT AAAAGAAAAA ACAAACCATG 3 3 00 

TTAATATTGT TGCTTGTGAG AATGCTATTA TGGCAACTGA TACATTGAAA AAAGCAGTAC 3360 

TTGATATTAC TGGCCCTCTT GGTAACnaTA TTCATTTTGC TAACTCAGCA GTTOATAGAA 3420 

TTGTACCATT ACAAAAGAAT GAAAATATAT TAGACGTTAT GGTTGAGCCA TTTTACGAAT 3480 

GGGTTGTTGA AAAAGATGCA TGGTATGGTC CAGAACTAAA CCATATTAAA TATGTTGATG 354 0 

ATTTAACACC ATATATTGAG CGTAAATTAT TAACTGTGAA TACAGGACAT GCATATTTAG 3600 

CGTATgCTGG tAAATTTGCA GGTAAAGCTA CAGTTTTAGA TGCAGTTGAA GATAGTTCAA 3660 

TTGAAGCTGG CTTACGCCGT GTTTTAGCTG AAACTAGTCA ATATATTACT AATGAATTTG 3720 

ATTTTACTGA AGCGGAACAA GCTGGTTATG TTGAAAAAAT AATAGATCGT TTCAACAATT 3780 

CTTATTTATC TGATGAAGTA ACACGTGTCG GACGAGGTAC ATTACGTAAA ATTGGCCCTA 3840 

40 AAGATAGAAT TATAAAACCA TTAACATATC TTTATAATAA AGATTTAGAA CGCACTGGTT 3900 

TATTAAATAC AGCTGCATTG TTATTGAAGT ATGATGATAC AGCAGACCAA GAAACTGTTG 3960 

AGAAAAATAA TTACATTAAA GAACACGGTT TAAAAGCGTT TTTAAGTGAA TATGCTAAAG 4020 

TTGACGATGG CTTAGCCGAT GAAATAATTG AAGCGTACAA TTCACTTTCA TAATTTATTG 4 080 

AGCTTTGTTT GAAACAAGAA GTTTCCAACG TTATTCGTTA ACAATCAGTA ATAATGTAGT 4140 

AGTTCCCTTG AATTAACAAT ATTAAATTTC TGAACATAAA AAATACTCCC TTCAACATAG 4200 

ACACTTAACT TGTGTTATGT ATGAAAGGAG TATTTTTGCG TTAATAATTT GTTTTATTTT 4260 

CGAGCCACAG CCACCTATTC AATGGCTATT GGTCATTACT AAAACAAATT CATATTAACT 4 320 
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(A) LENGTH: 977 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 5: 



10 


AACAAAGCCT 


TCCAATTATC 


TGCGTCGGTA 


GAACAAGTAT 


TAGCAACTTT 


ATCACCTACG 


60 




CTAAACAGTC 


CTTACGATTT 


ATACGGCACG 


ACAAAAATGC 


TAGATATTAC 


ATTCGATTCA 


120 




TTTGAACATG 


ATGGTACAAC 


GTACCCTGTC 


GACTATGCTA 


CGTTTGAAAA 


TGATTATGAA 


180 


IS 


GATAATAAAG 


ATCCTGAGTT 


TAGACGTAAA 


AGTTTCAAAT 


CGTTTAGCGA 


TGGGATTCGA 


240 




AAATATCAGC 


ATACTACCGC 


GGCTACATAT 


AATATGCAAG 


TACAACAAGA 


AAAAATTGAA 


300 


20 


GCTGATTTAC 


GTGGATTTGA 


ATCAGTCATC 


GATTATTTAT 


TACATAGTCA 


AGAAGTAACG 


360 


CGTGATATGT 


TTGACCGTCA 


AATCGATATG 


ATTATGCGTG 


ACTTGGCACC 


AGTTATGCAG 


420 




AAATATGCTA 


AACTTTTACA 


ACGTATTCAC 


GGATTAGATA 


ACATGCGTTT 


TGAAGACTTG 


480 


25 


AAGATTTCTG 


TAGACCCTGA 


TTATGAACCA 


GAGATTTCAA 


TTGAAGACTC 


AAAAAATTAT 


540 




ATTTTCGGTG 


CGTTAAGTC5T 






AL-A 1 GTTACG 


TGAAGCATAC 


600 




GATCAGCGAT 


GGaTTGATTT 


TGCACAAAAT 


AAAGGTAAAG 


ATACAGGCGC 


ATTTTGTGCA 


660 


30 


AGTCCATACT 


TTACACATTC 


ATATGTGTTT 


ATTTCTTGGA 


CTGGTAAAAT 


GGCTGAAGCA 


720 




TTTGTCTTAG 


CACATGAATT 


AGGTCATGCA 


GGTCATTTTA 


CATTAGCTCA 


AAAACATCAA 


780 




CCATATCTTG 


AATCAGAAGC 


ATCAATGTAC 


TTTGTTGAAG 


CCCCTTCTAC 


AATGAATGAA 


B40 


35 


ATGTTGATGG 


CCAATTATTT 


ATTTAACACA 


AGTGATAATC 


CAAGATTTAA 


GCGTtGGGTT 


900 




ATTGGCTCAA 


TTTTATCTAG 


AACATATTAT 


CATAATATGG 


tACCCmTTTA 


TTAGAAGCnG 


960 


40 


CTTATCCACG 


GGGAGTG 










977 



(2) INFORMATION FOR SEQ ID NO: 536: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so ^^i> SEQUENCE DESCRIPTION: SEQ ID NO: 536: 

AGACAGTGAT yGaATTTCAT TTACAGTACA CAAATCATCG AAAAATTGGT AACATTCTTC 60 
TCTATTTTCT AACGTTAgwA TTgCATCAAA CAATTC\TCT AACGCTGCAC CTCGTAATTT 120 
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AATCTATTAA AGTGTAGCGa TTTATATTTT ATTAAATCTG AATCGTTACT TTATTTAATT 24 0 

rrATGCTAAT CCAGCGCGTT CGAAAATAGT GTCAACTTGa TTCAAATGAT GTTTAGGATC 300 

5 

GAAACATTCA TCCAATTCTT CTTTTGTTAA AACACTTGTA ATAGACTCAT CTTGTTCGAT 3 60 

TAATTCACGG AACGGTGTTT TCGTTTCCCA AGATATCATC GCTTTTGGTT GTA C TTTGTC 4 20 

GTATGCTTCT TCACGAACCA TACCTTTATT AATTAATGCT AATAAGACAC GTTGTGAGAA 4 80 

10 

AATCAGACCA AATGTTTTAT CTATGTTATT ACGCATATTA TCTTCAAATA CAGTTAAACG 54 0 

GTCCACAATA TTTGTGAACG ATTCAATGCA TAATCTAGTG CtATTGTAAC ATCTGGTAAC 600 

,5 ATAATACGCT CAGCAGAAGA ATGAGAAATA TsT C TTTCAT GCCATAATGG CACATTCTCA 660 

TAAGCTGTAG TAATATAACC ACGAATGACT CTTGAAATAC CTGTGATATT TTCAGAACCA 72 0 

ATTGGATTTC GTTTATGAGG CATTGCAGAT GAACCTTTTT GGCCTTTTGC AAATGCTTCT 780 

20 TCAACTTCTC TCGTTTCX^GT TTTTTGAAGG TTACGTATTT CAACGGCAAA TTTTTCTAGT 84 0 

GATGTCGCGA TTAATGCTAA TGTCGCAATA TAGTATGCAT GTCGATCGCG TTGCAATGTT 900 

TGCGTTGATA CAGGCGCTGT GCCAATACCT AAATGTTTAC ACACATAACT TTCTATTTCA 960 

GGAGGAATGT TAGCAAAAGT ACCTACTGCA CCACTCATTT TCCCTACTTC AATTTCTTCT 1020 

CTTACTTGTT TGAAACGTTG TAAGTTACGT TGCATTTCCG TGTACCACAA TGCCATTTTG 1080 

ACACCAAATG TAGTTGGTTC TGCATGCACT CCATGTGTAC GTCCCATCAT CAATGTATAT 114 0 

30 

TTATAATTTT TTGCTTTTTC AGCTAAAACG TCXIATAAATC- TTTCTAAATC TTTTTCAATA 12 00 

ATGTCATTTG CTTGTTTAAT AACGAAACTT AAAGCTGTAT CTACAACATC AGTAGAAGTT 1260 

AAACCATAAT GTACcACTTA CGTTCTTCAC CTAGCGTTTC AGAAACTTGT CTAGTAAAGG 1320 

35 

CTACAACATC ATGGCGCGTT TCTTGTTCAA TTTCTTGTGC ACGTTCGACA TTTACCTTTG 13 80 

CGTTTTGACG AATTTTTTGT ACGTCAGCTT TCGGTATATG TCCTAATTCA CTCCATGCTT 1440 
^ (2) INFORMATION FOR SEQ ID NO: 537: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
^5 (D) TOPOLOGY: linear 



so 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537: 
GATAAATCTA TnCAGTTTCC GTCCAAATAT CtGCaCCTAA AGCTTTTAaG TGTTCTACAA 60 
TATCTgTATA ACCTCTATAA ATATGTTTAA CATTGTA/AT TGTAGTTACA CCCTCAGCAA 120 
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CATGTAATGT TGATGGTTTT ATCGTTGCTG TGCCTTCGTC AACTTCAATA TTTGCACCCA 
TGCGCTTTAA TTCTTCAACA TGTTTAAAAC GCTCCGGATA AATCGTATCA GTTACAAATG 
AAGGACCATT TGCCATAAAT AATAATGGTG TAATAGGCTG TTGCAAATCA GTAGCAAAAC 
CTGGATATAC TAGTGTTTTA ATATCAACAA ATTGATATGG CGCATTATTA TTGATGCGAA 
TTCTTTCGTC TCTTACATCA ACATTCACAC* CTAATTCACT AAATTTAGCA GTTAATGTTT 
CTACATGTTT CGGAACAATA TTATTTAATA TAACATTTTC TCCACATGCT GCAGCGATAC 
ACATATATGT GCCTGCTTCA ATTCTATCAG GTATAACTTG ATACTCAGAA CCATGTAATT 
CTTTGACGCC ATTGATTTTA ATTGTTGATG TACCCGCTCC CTTAATATTA GCTCCCATAC 
TTGTTAAGAA GTTAGCAACA TCAACTACTT CCGGTTCTTT AGCAGCATTT TCAATTACAG 
TTTGTCCTGT TGCATAAACT GCAGCTAGCA TAATGtnAAT TGTTGCACCT ACGCnAACCA 
TATC 

(2) INFORMATION FOR SEQ ID NO: 538: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538: 
CAATCTCCAC CAATGACACG TCGTCATTCG TGACCTCATA CCArACAAAA AACAGTCTCG 
CAATCAAGAC TGTTTTCCAC TCAATATATT CATCCATTAG CGTAATAGAT TATTTGACTT 
CTGTAGCTAC AAAGaTTTTA CGTTTTTCCC AAACGCCTGT cTTTTCATTG TAATCATCAC 
AAGTAATTAA TGTTAATTGT TTATCTTTAC CTTTTTGTTC ATCTAGAAcT CCTACATCTG 
TAGGCTTAAC ATCTCTTATA CTTGTCATTT TATACTTACG TGTTTCATTA CCAACTTTAA 
AGTACACCAT ACTACCTTTT TTGGCTGCTT TAAGATTTGT AAATTGATAG TTCGGACGGT 
CAATGAAAGT GTGTCCTGCA ATTGAAATAT TTTGATCATC TAGTGATTCA TTTTCTTCTG 
CAAAGCTTAC ACCTCTATTT AATTGTTCAG GTGTTGCTGG TCCTGGATAT ACTGGTTCTT 
TAATATCAGC ATCTGGAATT TCAATATAGC CTGCCACTTT CGATTTATCT TTCGGAATTT 
GAGGTTTAGC TTGCTGCTTT TTATCTTTAC TCGCCTGTTC TTTTACATTT TTATCATATT 
GTTCAATCTT TTCATCTTTA TCTTTATCGT GAAGATAATT ATCGATATGT GGTTTAGCAA 
ACAAATATGC TGCCACTAGG aTAAGTACCA CACCAGCGAT TGTCATTAAT CGATTTGTCC 
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50 



AGTATACCAT TAATTTCAAA ATGACTCATA GCAATTCATT TTATACTATA AAATTTACAT 84 0 

GTATACTTTT ACGTTAGATT TCATTACACA TATTTGCATT CAAATAACGA AACGCTTTTA 900 

ATAATTACTA AGGGGGAATT GATATGATTA GATACGCTAA AAAAGAGGAT TTAAACGCTA 960 

TATTAGCGAT ATACAATGAT GCCATTATCA ATACTACAGC TGTTTATACT TATGAACCAC 1020 

AAACCATAGA CGAACGTGTC GCATGGTTTG AAACGAAACA ACGTAAGCAT GAGCCTATCT 1080 

TTGTATTTGA GGAAAATGGA AGTGTCTTAG GGTTTGCGAC GTTCGGTTCA TTTAGACCTT 114 0 

GGCCAGCATA CCTATATACA ATCX3AACATT CTATTTATGT CGATGCTTCA GCTAGAGGAA 12 00 

AAGGTATTGC TAGTCAATTA CTACACCATT TAATTGTGGA AGCAAAAGCT AAAGGTTATC 1260 

GTGCGCTAGT TGCAGGCATT GATGCTTCCA ACAAAGCGAG TATTCAGTTG CATCAAAAGT 1320 

TTGCTTTTAA GCATGCCGGC ACACTGACCA ATGTAGGTTT TAAATTTAAT AGATGGTTAG 13 80 

ATTTAGCATT TTACGAATTA GATTTACAAG ACTAGTAATG TTTGAATCAC ATAATATAAA 144 0 

CAAGACAACC ATGTTAATTC CCTTAACATA ACAAGCCAAC ATATAAAATT TTAAACTTCT 1500 

CAGGGGAGTG GGACAGAAAT GATAAAGAGC CACTAATGAT TTATTATGTA GTGGTTCTTA 156 0 

CACATTAGCC ACAGCTAATG TGTAcTTAAA AATAGGAATA CATGAGTAAA ACTCATGCAT 162 0 

AAGAAATACT AATTTCTATA GAAAAAGTAT TTCTTTATCG TCGTCCCACC CCAACTCGCA 168 0 

CATTATTGTA AGCTGACTTT TCGTCAgCTT cTGTGTTGGG GCCCAAAAAG CTTGTTACAA 174 0 

GCGCATTTTC GTTCAGTCAA CTACTGCCAA TATAACTTTG TAGAGCATAT TACATTGATT 1800 

TACATTGTCC CTTTTATTTA TTCTTTTCAA ATACTATCCC CATAGCTTTG ATTTAACGCT 18^0 .. 

TTTTCTCAAT AACAAAACGA ATATAGTAGA ACATGAAAAC GATAATCATG CTGAGCGATA 1920 

AAGATTTAAA TAATAGATTG ACCCACGTTC CCTCAGTCGT ATATCCATAT GTAATCGTTG 1980 

TGTTAATGAT GAATGCTATA AAGATGATTG ATAGTCTTAG CATATCATCA CTCCTTTTAA 204 0 

GTTATTTTAG ATATACGGGG GCGCTTTTGC AATCACTATT TTGATTAGTA TGCATTTTCC 2100 

ATAAATCTTT CAACTTCTTC AGAGATAATT AAGAAGCATC TATCTGGTAC TAATGATCCA 2160 

GACAGATGCT TCTTTTTTAT CAATATTTTA TTGTTATCTC ATTAATTATT TTTAACCATA 2220 

TCTTCAGCTG TGCCAAAGAT TTTACGTTTA ATTGCTTCGC CAGTTGGTGT GCCTGCTAGT 2280 

CCACCCAATC CAGTTTCACG TAATGATGCA GGAAGGTTAC GACCAACCTT ATCCATTGCT 234 0 

TCAATAACTT CATCAACAGG GATTCTACTT TCAATACCTG CTAATGCTAA ATCTGCTGAA 24 00 

ATTAAAGCGT TACCCGAACC AATTGCATTT CTCATAACAC AAGGAATTTC AACAAGTCCG 24 60 

GCTACTGGAT CACAAACTAA ACCTAATAAA TTACTTATCG CTAATGCCAT AGCGTGCCCG 2520 
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GAACCAACTT 


CAGnTTGGCA 


GCCACCTGTT 


GCACCAGCTA 


CACTTGCATT 


\J X X X v>v« 




ACACGCCCAA 


ACAATGCTGA 


AGTGAATAAG 


AAATCAATCA 


TTTGCTCTTC 


TGTTAAATCA 

X \ J X X w^^^^^ X V»^l 






CTAATTTAAA 


AAGTGCACCG 


GGAATGGTAC 


CCGAGGAACC 


XVJw X W X A 




GTtGCACAAA 


TAATACCCAT 


CGCAGCATTG 


ACTTCATTTG 


TTGCAATGGC 


AcCtTTGcTG 




CGTCAATCAT 


TTCATATCCA 


GACAAAGCAT 


GATGTGTTTC 


ATTATAATCA 

• • X X «^ X X Va^l 




^ o o u 


CAGCATCATG 


ACCAGTGTAG 


CCCGTTACAC 


TTTCAACCCC 


AT CACCTGTC 


WX^arVvV^X X X 




TTACTGCGTC 


TCGCATGACA 


TCTAAATTTT 


GTTTCATTTG 


CG CTCGCAOT 


TCATCACGTG 

X ^^%X X w 


3000 


ATTTACCG CT 


TAATT C CATT 


TCTTCTTTAA 


CCATGATATC 


CGCAAATGAC 


^%X Ax X^^X XXX 


3 060 


CTACGGCATA 


ATCTATAGTC 


T CTCTAATTG 


AATCAAACAT 


G*i"l*l*AXT C CC 

wX X AX^X X wWV» 


X ^ XxvAX X X 


J x^ V 


ATATAGGAAA 


CGTTTACGTC 


ACTGTATTTC 


X X X A ^^^^ X X ^7 


1 "1" f & &*T*ff'P 


X Otn X X ^ X V3 AV9 


1 ft n 
J X o u 


ATTG CTTT AT 


TTAATGGTAT 


TACAACCAAG 


PATT**T*A*Ff'**T*T 

Vn'^ XXX <■* X W X X 


^■ai'^ X w X X W A A 


AAT A A ATTO A 


7 "5 A n 

J 4& *s u 


TCTTTACAGT 


CTAATTTCAT 


ATCGTTGATA 


TCATTGATGA 


AATGATTTAC: 


X XVjXvJ^X X Xrt 


J o u u 


GTCATATTTC 


CGTCAACAAC 


TAAAATTGGT 


AATCCATGAT 


TTAAATCTAC 


X X Vb. X aw X Vv^wiA 


J 3 o U 


TTTATATGAA 


TACCTTTAAT 


TTTAATTGTA 


CCACCACCGA 

W Svi4^Vr WS^f^ 


TTGAAATACC 


GATAATTTCA 

w A X XXX 


.3 *t ^ V 


ATGTAGCGAC 


CATCATTACG 


AGATGATTTG 


ATATAAGCAC 


r^AJ XXX ^^^3X% X ^7 


TTGAPCAATA 
X X vVAv«^«^v\ x.n 


J 1 o u 


CTAT CGC CTT 


CTTCTTCGAT 


GATATCTATT 


TT AAT AC CAT 




AAX X X\>Xn/\X 




GAAGATTTAA 


TTCGGTTATC 


AAATGTTGAA 


TATCCCATTG 


GTCCACCCAC 


AATAGCGACA 


3600 


TCTGTACCAT 


GTCCTTGGTG 


TGTTTGAGCA 


AATGATTCAT 


AATAATGTAT 


TTCAATATTT 


3660 


TTATATCTCC 


CAATATTGCG 


CGTGCTGAAT 


TCCCCTTTAC 


TGCACCAGCC 


GTATGAGAAC 


3720 


TTGAAGGGCC 


CAT 










3733 



(2) INFORMATION FOR SEQ ID NO: 539: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) StRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539: 
TGGCTGTCTT CTCTATGAGT GTAGTAAGTA AGTTAACGGA TTTAACGCCA AGGCAAATAC 60 
GTTACTATGA AACACATGAA CTCATCAAAC CTGAAAGAAC AGAAGGTCAA AAACGTCTGT 120 
TCTCACTCAA TGATTTGGAA AGATTACTAG AAATTAAATC ATTATTAGAA AAAGGATTTA 180 

55 
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AAGAGATAAG AAAAAAGATG ATTGTAGATG CCACGCAAAA GCCTATTGGA GArACTTTGC 300 

CAATAAATCG TGGTGATTTA TCCCGATTTA TTAAATAAAA TTTGGAGGAT TTTAAAATGC 360 

CAAAACGTAC TTTCACTAAA GACGACATTC GTAAATTTGC AGAaGAGGAA AaTGTAaGaT 420 

ATTTAAGATT ACAATTCACT GATATTTTAG GAACAATTAA AAATGTTGAA GTGCCTGTAA 4 80 

GCCAATTAGA AAAAGTACTT GATAACGAAA TGATGTTTGA CGGTA 525 



10 

(2) INFORMATION FOR SEQ ID NO: 540: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 08 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540: 

TTGATTTGCT ACAAAGTATC TnCTCATTTC TGTATCCTGA AAAATCTTTA GTGTAATAAT 60 

GTTGTTCAGT TTTAATATTT TCAGTCATAG TTGACTACCT CCGTATATTT TGATTTAATT 120 

AAGTTGTATA TTTTGATGAA CACTTATTGT TACTTGTTGG CGCAAGTAGC AGTTTTTTCA 180 

TTCTTCATAA AAGTATTCCT TATAGAATAT GAATGTTGCG ATACTTGCGA ATCCTGCAAT 240 

TGaCCATGCT GtAGTGAAGT ATAGAAACGG CATAAGTACA ATCGCTAAGA CTGTGAAGCA 300 

30 

TAGTACTGCT ACTAGGTAGC TTTTATAAAT GTTACTCATT TTCTTTTTTC AACTCCTCCA 3 60 

TTATTCTCTG GTCTGATAAG TCGTGATAAG GGAATTTTTT CCtAGCTAAT TGGACTGGTA 420 

TTCTGCCTCG TATCGCAATG TATCCTTCAT CTTCAAGCTC TTTATTCAGT TCTCTTATTA 4 80 

35 

TTTGTCCTGC TTTGGATTTA GAAACAGATA AAATTACCgC AAGTTCTTTA GCTTGCAAAC 540 

TATTTTTCAT CATATCTTTT CCTCCTTTAA AATAACTGTT GATTCTCTGG GTTATCTGCT 600 

TCGTAATTAT CTGCAATAAT ACTTTTAGCG AAAAAGTCCA AACTGACCTT ATATAGGTTG 660 

TTCATAGATr TCTTTACGTT AACCCCTTCC TCAAGTACAT AAGGCACCCT AAAATCATTT 720 

ATAAACAGTC CGTITTCGTC TAAAGTAACG GTTGGTAATT CAGGTTTGTT CCGTCTATAA 780 

-iS ACTTCTCCTA GTGTAGGTTT TTGCTTTTCA GCTTGTTTAG TGAAGTCGGA AAATGCCTTA 840 

AGTAGTTTTA TTCCTGAATC AGGATCACTG TGTCGCTCAA TCGTTTCTGC TGTAGACTCT 900 

TTACTAAAAT CATTTCTATT GATTACAGGC TTTCTCGTAT TTCGTTCAAT CTTCCAAACC 960 

TTCCACGTCA CAACTGCCAT TGTGGTGAGG AGGGTTGTTT TGTATAGTGC GTTCATTTGT 1020 

AATTCCTCCT ATTAAGTTGT TTGTTCAATT GTGTGTGTTA TTCTTCTTCG TCTAAATCAA 1080 

55 



1475 



EP 0 786 519 A2 



10 



15 



CGACTTCGTA AGTGTGCTCA ATCTCGCCTG CATATGTCAC AGTAAGAGTA TCTTTGTCTG 

TGTATGTTTG ACTTTTGTTy TCTtTAACTG CATAAAGTGT TAATACTATA TTGTTTAGCT 

TTyCTTTTTG TTCTGGTGTC ATTTACGCTC CCCCTAmATT AGCyTCATAA CCGAATTCAG 

TCATGATTTC ATGTATTTTC AATCTGCCTT TTTGTGTCCA TCTAGTTTGT AAAACTGTGT 

CTTCTCTGCC ATCAGAACGC ACAATTGT 

(2) INFORMATION FOR SEQ ID NO: 541: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 4 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1200 
1260 
1320 
1380 
1408 



20 



25 



30 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 
GTTCGTCAAT GATTTTTATC CGATGCTATG AGCATTaTCA AATACAAAAT GCTCTCTTAA 
AAGCAGTTAT TGACTGAAAA TCTACTTCTA AGAGAGCACT TTATTTAATT ACTTAAGAAA 
TCTTGAAATT TCAATATACG ATGTTTATGA TAAGTCGCTT ATTTCATCTT TAGGCTTGTT 
ATTAGTAAGT AGTTTAATAC CACTGATTAA CCATAAAGCA AATGTAATTA TGTTACCACT 
TATTACAGCT CCAATAATCA ACAATATACC ACTCATTTTT TTGTTTTTAG ATGCTTTAAA 
CATACCGATT GCACCTAAAA TAATTGAAAT GATTCCAAAT ATGAATAGGG ATAAGAATAA 
TACAGTGAAA ATTGCTGCTG CTGTTTCTGc ATCAACTGGG nCAACCTCAC CATTAACTGT 

tgttggacac at 

(2) information for seq id no: 542: 

(i) sequence characteristics: 

(A) LENGTH: 2426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
432 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 

ATAATCATGA AGTnGCTAAA nCGCCAAATA ATGATGGTTC TGGACATGTT GTGTTAAATA 60 

AATTCCTTTC AAATGAAGAG AATCAAAGCC ATAGTAATCG ACTCACTGAT AAATTACArG 120 

GAAGCGATAA AATTAATCAT GCTATGATTG AAAAATTAGC TAAAAGTAAT GCCTCAACGC 180 

AACATTACAC ATATCATAAA CTGAATACGT TACAATCTTT AGATCAACGT ATTGCAAATA 24 0 
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GTATAAAAAG TCAACGAAAT ATTATTTTGG AAGAACTTGC ACGTACTGAT GATAAAAAGT 
ATGCTACACA AAGCATTTTA GAAAGTATAT TTAATAAAGA CGAGGCAGTT AAAATTCTAA 
AAGATATACG TGTTGATGGT AAAACAGATC AACAAAITGC AGATCAAATT ACTCGTCATA 
TTGATCAATT ATCTCTGACA ACGAGTGATG ATTTATTAAC GTCATTGATT GATCAATCAC 
AAGATAAGTC GCTATTGATT TCTCAAATTT TACAAACGAA ATTAGGAAAA GCTGAAGCAG 
ATAAATTGGC TAAAGATTGG ACGAATAAAG GATTATCAAA TCGCCAAATC GTTGACCAAT 
TGAAGAAACA TTTTGCATCA ACTGGCGACA CGTCTTCAGA TGATATATTA AAAGCAATTT 
TGAATAATGC CAAAGATAAA AAACAAGCAA TTGAAACGAT TTTAGCAACA CGTATAGAAA 
GACAAAAGGC AAAATTACTG GCAGATTTAA TTACTAAAAT AGrAACAGAT CAAAATAAAA 
TTTTTAATTT AGTTAAATCG GCATTGAATG GTAAAGCGGA TGATTTATTG AATTTACAAA 
AGAGACTCAA TCAAACGAAA AAAGATATAG ATTATATTTT ATCACCAATA GTAAATCGTC 
CAAGTTTACT AGATCGATTG AATAAAAATG GGAAAACGAC AGATTTAAAT AAGTTAGCAA 
ATTTAATGAA TCAAGGATCA GATTTATTAG ACAGTATTCC AGATATACCC ACACCAAAGC 
CAGAAAaCGt TAACACTTGG TAAAGGTAAT GGATTGTTAA GTGGATTATT AAATGCTGAT 
GGTAATGTAT CTTTGCCTAA AGCGGGGGAA ACGATAAAAG AACATTGGTT GCCXyVTATCT 
GTAATTGTTG GTGCAATGGG TGTACTAATG ATTTGGTTAT CACGACGCAA TAAGTTGAAA 
AATAAAGCAT AATTATATTG GGGGAAGAGC ATCTATATAT TTTTTTAAGT ATATAAGACG 
TCTTATTTCC CCTTAATTTA TTGTGAAGTA TATGCAAAAT GCAATGAATA GATTGTCCAT 
CATTTTAACG TTATAATGAA TTTAACGACT TAGAACTACA CAAGTAAAGG AGAATGAAGA 
TGTCTCGAAA AACGGCGCTA TTAGTTTTGG ATATGCAAGA AGGTATAGCG AGTAGTGTAC 
CTAGAATAAA AAATATTATT AAAGCGAATC AGAGAGCAAT TGAAGCAGCA AGACAACATC 
GAATACCAGT CATTTTCATA CGTTTAGTGT TAGATAAGCA TTTTAATGAT GTCTCXTCGA 
GTAATAAAGT GTTTTCAACA ATTAAAGCTC AAGGATATGC GATTACTGAA GCAGATGCAT 
CTACACGAAT ACTTGAAGAT TTAGCACCAC TAGAAGATGA GCCGATTATT TCTAAGCXy^C 
GCTTTAGCGC ATTTACAGGT AGTTACTTGG AAGTTTATTT ACXTTGCAAAT GATATTAATC 
ATTTAGTATT AACGGGTGTC TCTACAAGTG GAGCTGTATT GAGCACGGCA TTAGAAAGTG 
TAGATAAAGA CTATTATATT ACTGTTTTAG AAGATGCTGT TGGTGATAGA TCAGATGATA 
AACATGACTT TATTATTGAA CAAATTTTAT CACGCTCATG TGACATTGAA TCCGTAGAGT 
CATGGAAAAG TAGTTTATAG TTAATATAAC GTCAATTAAA GCTCGGCAGT AATGTTT G AG 
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GAGGAACATT TGAACATAAA ATAATATATT TATATAAAAC GACCgAGGCG TTCGAACTGA 
ATGtCCTCGG GTTTAATTGA ATAGAAATCG GACTTATGAA CGAAATATGT TTAAGTCGAA 
CTCCTTGTTT ATACTTATAA ATTTTACGGG TTTAATATAA TACTTATTTA CCTGTAATAT 
ATGCATAATT nCTTCAGTCG GTCAGCCTGT CGTTGCATAG TTCCTATGCA GCAAATGCAT 
ATCCTAATCC TTTAACATTG GCATTnCTGC AAATGAACGC ATAGAATCCA TTCACTGTTA 
ACTTTTTnCA ACAAATGTCT nACATG 
(2) INFORMATION FOR SEQ ID NO: 543: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1874 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2160 
2220 
2280 
2340 
2400 
2426 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 

GAGTTGGGGA ATGTGCTCAA AATATGCGGA CTTTATGCAT TyCGGAATTG sCCaATTGCA 60. 

GCTTTAAGCT ATGGTCAAAA AAAGAGGGTC ACTATAGCAT CTGTTTTAGT CTTAAATCCG 120 

GAAATAATCA TATTGGATGA ACCGACTGCT GGTCAAGATT TCTATCATTA TAATGAGATA 18 0 

ATGTCATTTT TAATTGAACT AAACAGACAG GGGAAGACGA TTATTATGAT TACGCATGAT 240 

ATGCATTTAT TGTCTGAGTA TAGTTCAAGA ACAGTTGTAT TATCAAAAGG TCAAGTCGTT 300 

GCTGATACCA CGCCAgTATT GGTTTTAAAT GATAAAAAAA TCTGTGAGAT TGCATCATTG 360 

AGACAAACAT CGCTATTTGA AATGGCCGAA TATATAGGGA TTAGCGAGCC ACAGAAATTA 42 0 

GTACAATTAT TTATTAACCA TGATAGGAAG GTGAGACGcC AATGAATCAA TATAATACTA 480 

TAGGTTTTCA CCCGGGAAAT AGTCGTATTC ATCAATTAAA TGCGACTOTT AAACTTTTAT 540 

TCTTATTAGT TGTTTCTATT TCTGCAATGG TGACTTATGA CACAAGATAT TTAATTTTAA 600 

TTAGTGCTTC ATCTATTTTA TTGGTCAAAT ATGCTCATAT TGAATGGAAA CAAGTTCGCT 660 

TTGTTGTTAA ATTCATTCTG TTTTTCACAA TAtTAAATAT TATTGCCGTG TACATATTTG 720 

45 ACCCTGAATA TGGTGTGAAG ATTTATAATC AGCGTACAGA GTTAGTCAAT GGTATTGGTC 780 

GATTTACGCT AACATCACAG GAATTATTCT ATCTTTTTAA TCTAATATTA AAATATATTA 84 0 

GTACAGTTCC TTTAGCGTTA ATATTTTTAT TCACAACGAA TCCGAGTCAT TTTGCTGCAA 900 

GTTTAAATCA GCTAGGTGTG AATTATAAAA TCAGTTACGC AGTCTCACTA GCATTAAGGT 960 

ATATTCCAGA TATTCAAGAA ACATATTttA ATATTTCACA AGCGCAACAA GCAAGAGGAT 1020 



55 
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TACCTTTAAT ATTTTCTAGT 
GACAATTCGG ACAGTATAAA 
5 ATTATGTTGT TTTGTGTTTG 

TAAATAATAG TCGATATTTC 
GAAATAGGAG GGAGACATTT 

JO 

TCCTCGCATT TAATTGAACT 
CCCAAAGGTT TACATAATTC 
CATTGGCAAC AATATCAATT 
ATTGGTATTT GTGGATATGA 
TATAAATTAC ATCCAAGTTT 
20 ACAAATTTTG TGAATTATAT 

AACCAACGTT CAATAAATTT 
GACATTATCA ATCAAGGAGA 

25 

AAATAAAATT AAAA 1874 
(2) INFORMATION FOR SEQ ID NO: 544: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5280 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544: 



40 



TCAACATTTC 


TAACACCAAT 


GTGAAAATGA 


TCTATGTGAT 


TTGCAATGGC 


TTGATTTGTA 


60 


ATATGTGTGC 


CTAAATGACC 


TGTAGCACCT 


GTTAACATAA 


TATTCATTCA 


CTTCATCTCC 


120 


TAATCTTTAT 


ATACATAACA 


TAATACTTAT 


TTGATGGTTT 


TCAAAACATT 


TGATTTTATA 


180 


AAAAATTCTA 


ATCTGTATTT 


ATTGTCGACG 


TGTATAGTAA 


ATACGTAAAT 


ATTATTAATG 


240 


TTGAAAATGC 


CGTAATGACG 


CGTTTTAGTT 


GATGTGTATC 


ACTAATATCA 


TTGAAAATTT 


300 


TAATCaGGTA 


CTACGACAAT 


ATGATGTCTG 


TTTTGTGTCT 


GAAAGTTTTA 


CAGTTTTTAA 


360 


AATAAAAATG 


GTATAAAGTG 


TGATTTGTAT 


AAAAAAGAGT 


CTCGACGGAT 


AAGAATTGAT 


420 


TAATAACAGT 


TAGCATTTTA 


TTAATTACCT 


TAACAATGAT 


TCAAGTTTAG 


TTAAATGAGG 


480 


TTTAATTTGA AAGGOGATAG 


CGCCTCAATA 


TAATGTAGGT 


AGATTGTTCA 


TATTACGTAA 


540 



ATCGAAAGAA 


TTGACACTAT 


TAGTACTGCT 


ATGGAGTTAA 


1140 


AGGAGAACCT 


GGTACGTCAA 


AAAACAATTA 


AAAAAAGATG 


1200 


ACGTTAATAC 


TTCTGATGTT 


AGTAGTTACA 


TTATTCTTTT 


1260 


AACCCGTGGC 


ATTAGTATTC 


ATATAAATAG 


TCTTTAAATA 


1320 


AATGATAAAT 


ACTGAAAGAT 


TAAATTTAAT 


GATTCCAAGT 


1380 


TTATAATATT 


TGTAGTCATC 


CACAAGCAAA 


TATATACACT 


1440 


CAAATTAGAC 


ACACAACGGT 


GGATTGAAAA 


ATGGCGAAAC 


1500 


TGGTTACTTT 


GTATTGGTAA 


AAAAAATAGA 


TTGTAGTGTT 


1560 


ATATCGACAA 


TTAAAGCAAG 


AaACAGTATT 


AAATTTATTT 


1620 


TGAAGGACAA 


GGGTACGCAT 


GTGAGGCTAT 


TACAGCAATC 


1680 


CGATCAAGAA 


ACAGTAAAAG 


TTATCAGGAC AAATAAGTGT 


1740 


AGCAGAAAGG 


CTTAAATTCA 


AGCGAGACGA 


TACTATGGAC 


1800 


TATTGTGTTT 


TAAnAATAAA 


ATACTATGAC 


ATTATCTAAA 


1860 
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AGCATTGGGA 


ATGTTAGCAA 


CAGGTGTAAT 


TACaTCGAAT 


GTACAATCAG 


660 




aGCAGrAGTT 


AAaCAACAAa 


GTGAaTCAGA 


GTTAAAACAC 


TATTATAATA 


720 


AmCCAATTTT 


AGAGCGTAAA 


AATGTGACTG 


GATTTAAATA 


TACTGATGAG 


GGTAAACACT 


780 


ATTTAGAAGT 


CACAGTAGGG 


CAACAGCATT 


CTCGAATCAC 


TTTACTTGGA 


TCTGATAAAG 


840 


ATAAATTTAA 


AGACGGAGAA 


AACTCAAATA 


TAGATGTGTT 


TATCCTTAGA 


GAAGGTGACA 


900 


GTAGACAAGC 


AACAAATTAC 


TCAATTGGTG 


GCGTTACAAA 


ATCAAATAGT 


GTGCAGTATA 


960 


TTGATTATAT 


CAATACGCCA 


ATTTTAGAAA 


TCAAGAAAGA 


TAATGAAGAT 


GTACTTAAAG 


1020 


ATTTTTACTA 


CATTTCAAAA 


GAAGACATCT 


CATTAAAAGA 


ACTTGATTAT 


AGATTAAGAG 


1080 


AACGTGCGAT 


TAAACAACAC 


GGCTTGTATT 


CAAATGGTCT 


TAAACAAGGT 


CAAATTACAA 


1140 


TTACAATGAA 


TGATGGCACA 


ACACATACAA 


TCGATTTAAG 


TCAAAAACTT 


GAAAAAGAAC 


1200 


GTATGGGTGA 


GTCAATCGAC 


GGCACTAAGA 


TTAATAAAAT 


TCTAGTAGAA 


ATGAAATAAT 


1260 


ACTTTCTAAC 


AACAAAGCGC 


TATGTTGAAT 


AGTGCTTGTT 


ATGGAAATAT 


ATGGAAGTTA 


1320 


AGCGACGTAC 


TGTTGCTTAG 


CTTCTTTTTT 


TGAGGGGAAA 


AGTTACAAAA 


CTCACACAAA 


1380 


CAGTCGCACC 


ACGCATTATC 


TTTTGCTTAA 


ATAGCTTAAT 


CATATTTTAT 


GAATAGTTAA 


1440 


AAACAGGTTA 


ATGTGAATAT 


CCGAATACAG 


CTCCTATAAT 


ATGGGTGTAT 


GATTCAAATT 


1500 


ACGTAATAAA 


ACAATCTAAT 


TATAATAGAT 


TGGAGCATAC 


AACTATGAAA 


ATGAAAAATA 


1560 


TTGCAAAAAT 


AAGTTTGTTA 


TTAGGAATAT 


TAGCAACAGG 


TGTAAACACT 


ACAACGGAAA 


1620 


AACCAGTTCA 


TGCCGAAAAG 


AAACCTATTG 


TAATAAGTGa 


AAATAGCAAA 


AAATTAAAAG 


1680 


CTTATTATAA 


TGAACCTAGT 


ATTGAATATA 


AAAATGTGAC 


AGGTTATATC 


AGTTTCATTC 


1740 


AACCAAGTAT 


TAAATTTATG 


AATATCATAG 


ATGGTAATTC 


TGTTAATAAT 


ATTGCTTTAA 


1800 


TTGGCAAAGA 


TAAGCAACAT 


TATCATACGG 


GTGTACATCG 


TAATCTTAAT 


ATATTTTACG 


1860 


TTAATGAGGA 


TAAGAGATTT 


GAAGGTGCAA 


AGTACTCTAT 


TGGGGGTATC 


ACGAGTGCAA 


1920 


ACGrTaAAGC 


TGTCGACCTA 


ATAGCAGAAG 


CAAGAGTTAT 


TAAAGAAGAT 


CATACTGGTG 


1980 


AATATGATTA 


TGACTTTTTC 


CCATTTAAAA 


TAGATAAAGA 


AGCGATGTCA 


TTGAAAGAGA 


2040 


TTGATTTTAA 


ATTAAGAAAA 


TACCTTATTG 


ATAATTATGG 


TCTTTACX3GT 


GAAATGAGTA 


2100 


CAGGAAAAAT 


TACAGTCAAA 


AAGAAATACT 


ATGGAAAGTA 


TACATTTGAA 


TTGGATAAAA 


2160 


AGTTACAAGA 


AGACCGTATG 


TCCGATGTTA 


TCAATGTCAC 


AGATATTGAT 


AGAATTGAAA 


2220 


TCAAAGTTAT 


AAAAGCATAA 


CACATATACT 


TGATGACGAA 


ATAAGTTGAA 


ATTGAAATAG 


2280 


AGAGGTTAAG 


TGACGATCAA 


ACGTTGCTTA 


ACTTCTTTTT 


AATGCTTAAA 


AATTATTTCA 


2340 
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TTAATAATAC TTCAATAATT GTTAAAAGGG GTTTAATGTG ATTATCTTAG AACGCCATCT 24 60 

ATAATGATGT TGTATGATTC AAATTACGTA AAAAGACAAT CGAATATAAT ATAGATTGGA 2520 

GCATACAATT ATGAAAATGA GAACAATTGC TAAAACCAGT TTAGCACTAG GGCTTTTAAC 2580 

AACAGGCGCA ATTACAGTAA CGACGCAATC GGTCAAAGCA GAAAAAATAC AATCAACTAA 264 0 

AGTTGACAAA GTACCAACGC TTAAAGCAGA GCGaTTAGCA ATGATAAACA TAACAGCAGG 2700 

TGCAAATTCA GCGACAACAC AAGCAGCTAA CACAAGACAA GAACGCACGC CTAAACTCGA 2760 

AAAGGCACCA AATACTAATG AGGAAAAAAC CTCAGCTTCC AAAATAGAAA AAATATCACA 2820 

ACCTAAACAA GAAGAGCAGA AAACGCTTAA TATATCAGCA ACGCCAGCGC CTAAACAAGA 2 8 80 

ACAATCACAA ACGACAACCG AATCCACAAC GCCGAAAACT AAAGTGACAA CACCTCCATC 294 0 

AACAAACACG CCACAACCAA TGCAATCTAC TAAATCAGAC ACACCACAAT CTCCAACCAT 3 000 

AAAACAAGCA CAAACAGATA TGACTCCTAA ATATGAAGAT TTAAGAGCGT ATTATACAAA 3060 

ACCGAGTTTT GAATTTGAAA AGCAGTTTGG ATTTATGCTC AAACCATGGA CGACGGTTAG 3120 

GTTTATGAAT GTTATTCCAA ATAGGTTCAT CTATAAAATA GCTTTAGTTG GAAAAGATGA 3180 

GAAAAAATAT AAAGATGGAC CTTACGATAA TATCGATGTA TTTATCGTTT TAGAAGACAA 324 0 

TAAATATCAA TTGAAAAAAT ATTCTGTCGG TGGCATCACG AAGACTAATA GTAAAAAAGT 3300 

TAATCACAAA GTAGAATTAA GCATTACTAA AAAAGATAAT CAAGGTATGA TTTCACGCGA 3360 

TGTTTCAGAA TACATGATTA CTAAGGAAGA GATTTCCTTG AAAGAGCTTG ATTTTAAATT 34 20 

GAGAAAACAA CTTATTGAAA AACATAATCT TTACGGTAAC ATGGGTTCAG GAACAATCGT 34 80 

35 TATTAAAATG AAAAACGGTG GGAAATATAC GTTTGAATTA CACAAAAAAC TGCAAGAGCA 354 0 

TCGTATGGCA GACGTCATAG ATGGCACTAA TATTGATAAC ATTGAAGTGA ATATAAAATA 3600 

ATCATGACAT TCTCTAAATA GAAGCTGTCA TCGGAAAAAC AAGAAGTTAA GTGACAACGG 3660 

TTTACATGTT GCTTAGCTTC TTTTATTATG CGTAATGATG TAAAAAGACG AATATTCATT 3720 

TGTTTGTAAA AGTGGCATTT CTATGTCTTA AAAGTGACGA AACTTCAAAT GTGCCAAGTG 3780 

TTGAATCACA TCAAAATCAT TTTTATTTAA CGAACATTAT GGATTTCTTA ATTTACTTAA 3840 

CGATGATTCA AATATAGTTA AACAAGGTTT AATGTGAATG GAGCAATACX5 CCATCTATAA 3 900 

TAAAGCTGTA TGATTCAATG AATGTAATCG AACAAATCTA ATAATTACGA ATGGAGCATA 3960 

so CAACTATGAA AATAACAACG ATTGCTAAAA CAAGTTTAGC ACTAGGCCTT TTAACAACAG 4020 

GTGTAATCAC AACXSACAACG CAAGCAGCAA ACGCGACAAC ACTATCTTCC ACTAAAGTGG 4 080 

AAGCACCACA ATCAACACCG CCCTCAACTA AAATAGAAGC ACCGCAATCA AAACCAAACG 414 0 
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CGCCTTCAAC TAAAGTGACA ACACCTCCAT CAACAAACAC GCCACAACCA ATGCAATCTA 4260 

CTAAATCAGA CACACCACAA TCGCCAACCA CAAAACAAGT ACCAACAGAA ATAAATCCTA 4320 

AATTTAAAGA TTTAAGAGCG TATTATACGA AACCAAGTTT AGAATTTAAA AATGAGATTG 43 80 

GTATTATTTT AAAAAAATGG ACGACAATAA GATTTATGAA TGTTGTCCCA GATTATTTCA 4440 

TATATAAAAT TGCTTTAGTT GGTAAAGATG ATAAAAAATA TGGTGAAGGA GTACATAGGA 4 500 

ATGTCGATGT ATTTGTCGTT TTAGAAGAAA ATAATTACAA TCTGGAAAAA TATTCTGTCG 4 560 

GTGGTATCAC AAAGAGTAAT AGTAAAAAAG TTGATCACAA AGCAGGAGTA AGAATTACTA 4 620 

AGGAAGATAA TAAAGGTACA ATCTCTCATG ATGTTTCAGA ATTCAAGATT ACTAAAGAAC 4680 

AGATTTCCTT GAAAGAACTT GATTTTAAAT TGAGAAAACA ACTTATTGAA AAAAATAATC 474 0 

TGTACGGTAA CGTTGGTTCA GGTAAAATTG TTATTAAAAT GAAAAACGGT GGAAAGTACA 4800 

20 CGTTTGAATT GCACAAAAAA TTACAAGAAA ATCGCATGGC AGATGTCATA GATGGCACTA 4 86 0 

ATATTGATAA CATTGAAGTG AATATAAAAT AATCATGACA TTCTCTAAAT AGAAGCTGTC 4 920 

ATCGGAAAAA CAAGAAGTTA AGTGACAACG GCCTACATGT TGCTTAGCTT CTTTTGTTAT 4 980 

GTTCGATGAT TTGAGAACCC GAATTTTCGA TGGGTCCAAA TATGACGTGG AAGAGACCTG 504 0 

AATTTATCTG TAAATCCCTA TCTATCGGGT GTGAAGCACA ACGGGATCAG TTTTATTTAA 5100 

CGAACATTAT AGATTCCTTA ATTTACTTAA TAATGATTCA ATGATTATTA AACATGGTTT 5X60 

AATGTGAAAG GTCAAATACG CTAACTATAA TAAAGCTGTA TGATTCAATA GACGTAAGCG 5220 

AACAAATCTA ATAATTACGA ATGGAGCATA CAACTATGAA AATGACAGCA ATTGCGAAAG 5280 
35 (2) INFORMATION FOR SEQ ID NO: 545: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 886 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



2S 



30 



4S 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545: 

AGTAAAATTG CCGGTATGAT GGACACAAAC GGTGACCTTG GTCAAGGTGA ATTAGCGATT 60 

AATCCACCtA AATCAGATTT gAACGAATTA CCTTGGGCTA CACGTAAAAA TAAACAGCCA 120 

GCTTCATCCG AAAAAgGTTC AAGTGGTCAT CATGGTAATG CAGCAATGCC TCAAACCAmA 180 

TTAGATTATC AAATATCTAT TGATAAGGTC GTTGAACAGG CGCAAAAAGC TGGTATTAAA 240 

AAGCCGTTTT CAATCGTATA TCCAAGTGAT AAAAATGGTA CCTTTATTGT ATCTAATACT 300 
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GATCAATATA GCGGTAAAAA GCTAGGTACG ATTAAATATG ATGACTACGG TATTATTGCT 420 

AAATGGTTTA CATGGGGCAT TCCGCTTCAC GAAGGTCATT TATTCGGCAT TTTAAATAAA 480 

ATCATTAATT TATTTGTATG TATCGCTTTA TTAGTAGCCA TTGGCATGGG GTTTGTCTCT 54 0 

TGGATAAAGC GTACAAAAAA TACTGCAGTA AAAGTACCAC ATCGCGTAAA AAAACCAGCA 600 

TCTATATCAC TCATAATATG TTTAATTGTA TTAGGATTAT TAATGCCATT ATTTGGATTA 660 

TCACTTATCC TTGTATTTAT AATTGAATTA ATATTATATA TTAAAGATCG TCGTGCTAAA 720 

CAATAATGCA CTTAAAGTTT TGAACTGACG AAATTTACAA AATGgATTCT CGTCTCTCTA 780 

ATTACtTAAA ACGGGgTtCy AaTAATAAAT CgTACTGaTG GgAAAGTTTT TACTTTTTAt 840 

CTGtCCGAtT TTTTnGAAwT TGAAGATAAA AAAGCATCTA AAACGC 886 
(2) INFORMATION FOR SEQ ID NO: 546: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 

. GGCATTTGTG TCCTTATATA AGGAACTGTG tTAAATACAT TACTGTTGTT AAGTTGTTTT 60 

30 

TGTAATTCAA AGAGCAGAAC AGAGTAACAT CATCAGTTGT AGTAAACGAT AATCCGGTAA 120 

AACAACTAAA TGAAATAATG AAAGTCATTT AACCTGAACA TTAAAATATA TTTGTTTTTC 180 

35 ATTAAGAATA ATTCAAGTAT ATTTAAATCG AGGTTAATTA TCGTATGAAA CGATGCACGT 240 

TATAATAAAA ATGTATGATT CAAATTACGT AATGAAAACA ATCCAATATA TTAAGATTGG 300 

AGCAAATAAA TATGAAATTT ACAGCATTAG CAAAAGCGAC ATTAGCTTTA GGAATTTTAA 360 

CAACAGGAAC TTTAACAACA GAAGTTCATT CAGGTCATGC AAAACAAAAT CAAAAGTCAG 420 

TAAATAAACA TGACAAGGAA GCATTATACC GATACTACAC TGGAAAGACT ATGGAAATGA 4 80 

AAAATATTAG TGCTTTGAAA CATGGTAAAA ACAACTTACG TTTTAAGTTT AGAGGTATTA 54 0 

AGATTCAAGT TTTACTGCCT GGAAATGATA AAAGTAAATT TCAACAGCGT AGTTATGAGG 600 

GGTTAGATGT TTTCTTTGTT CAAGAAAAAA GAGATAAGCA CGATATATTT TATACTGTTG 660 

GTGGTGTAAT ACAGAATAAT AAAACATCTG GAGTTGTCAG TGCACCAATA TTAAATATTT 720 

CAAAAGAAAA GGGTGAAGAT GCTTTTGTGA AAGGTTACCC TTATTACATT AAAAAAGAAA 780 

AAATAACACT AAAAGAACTG GATTATAAGT TGAGAAAGCA TCTAATTGAA AAATACGGAC 84 0 
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ATAACCTTGA 


TTTAAGATCT 


AAATTAAAAT 


TTAAATATAT 


GGGGGAAGTC 


ATAGAAAGCA 


960 


AACAAATTAA 


AGATATTGAA 


GTTAACTTAA 


AGTAAATCAT 


TACGAATAAT 


TAAAAGTAAT 


Z020 


TGAAGCGGCT 


TAACGGTGAA 


ATGTAAATTG 


GTGCGCATAG 


CTTATACAAA 


AAGGATGCAT 


1080 


CAATCGATAT 


CGTCGTTAAG 


CCGTTTTGGT 


TTGTGTGTCA 


TGAATCCTAT 


CCCAATCTCC 


1140 


ATAAAGGTAA 


AATTTCCACC 


ACCAACATCA 


AAATTCTCCA 


CATCGCAACA 


TAACCAAATG 


1200 


TTATAATAAA 


TCTATTACAC 


AAAGAGATAA 


ATTACTTATT 


CAAAGGCGGA 


GGAATCACAT 


1260 


GTCTATTACT 


GAAAAACAAC 


GTCAGCAACA 


AGCTGAATTA 


CATAAAAAAT 


TATGGTCGAT 


1320 


TGCGAATGAT 


TTAAGAGGGA 


ATATGGATGC 


GAGTGAATTC 


CGTAATTACA 


TTTTAGGCTT 


1380 


GATTTTCTAT 


CGCTTCTTAT 


CTGAAAAAGC 


GGAACAAGAA 


TATGCAGATG 


CCTTGTCAGG 


1440 


TGAAGACATC 


ACGTATCAAG 


AAGCATGGGC 


AGACGAAGAA 


TACCGTGAAG 


ACTTAAAAGC 


1500 


AGAATTAATT 


GACCAAGTCG 


GTTACTTCAT 


TGAGCCAGAA 


GATTTATTCA 


GTGCGATGAT 


1560 


TCGTGAAATT 


GAAACGCAAG 


ATTTCGATAT 


CGAACACCTG 


GCGACGGCAA 


TTCGTAAAGT 


1620 


TGAAACATCA 


ACATTAGGTG 


AAGAAAGTGA 


AAATGACTTT 


ATCGGTCTGT 


TCAGCGATAT 


1680 


GGATTTGAGT 


TCAACGCGAC 


TAGGTAACAA 


TGTCAAAGAA 


CGTACTGCTT 


TAATCTCTAA 


1740 


AGTCATGGTT 


AATCTTGACG 


ACTTACCATT 


CGTTCACAGT 


GACATGGAAA 


TTGATATGTT 


1800 


AGGTGATGCA 


TATGAATTCC 


TAATTGGGCG 


CTTTGCGGCG 


ACAgCGGGTA 


AAAAAGCAGG 


1860 


CGAGTTCTAT 


ACACCACAAC 


AAGTATCTAA 


GATACTGQCG 


AAGATTGTCA 


CAGACGGTAA 


1920 


AGATAAATTA 


CGTCACGTGT 


ATGACCCAAC 


ATGTGGTTCA 


GGTTCACTGT 


TGTTACGTGT 


1980 


TGGTAAAGAA 


ACACAAGTGT 


ATCGTTATTT 


CGGTCAAGAA 


CGTAACAATA 


CTACATACAA 


2040 


CTTAGCACGC 


ATGAATATGT 


TATTACATGA 


TGTGCGTTAT 


GAGAACTTCG 


ATATCCGTAA 


2100 


TGATGACACA 


TTGGAAAACC 


CAGCCTTTTT 


AGGCAATACA 


TTTGATGCGG 


TTATTGCGAA 


2160 


CCCACCGTAT 


AGTGCGAAAT 


GGACTGCAGA 


TTCAAAGTTT 


GAAAATGACG 


AACGATTCAG 


2220 


TGGTTACGGC 


AAACTTGCGC 


CTAAGTCTAA 


AGCAGACTTT 


GCCTTTATTC 


AACACATGGT 


2280 


ACATTACCTA 


GACGATGAAG 


GTACCATGGC 


CGTTGTACTC 


CCACATGGTG 


TATTATTCCG 


2340 


AGGTGCTGCA 


GAAGGTGTCA 


TTCX3TCGTTA 


TTTAATTGAA 


GAAAAGAACT 


ACTTAGAAGC 


2400 


TGTGATTGGT 


TTGCCAGCGA 


ATATTTTCTA 


TGGGACAAGT 


ATTCCAACAT 


GTATTTTAGT 


2460 


ATTTAAAAAA 


TGTCGCCAAC 


AAGACGACAA 


CGTACTATTT 


ATCX3ATGCAT 


CCAATGATTT 


2520 


TGAAAAAGGA 


AAAAATCAAA 


ATCATTTAAG 


CGATGCCCAA 


GTCGAACGTA 


TTATAGACAC 


2580 


ATATAAGCGT 


AAGGAAACAA 


TTGATAAATA 


TAGCTACAGC 


GCGACACTAC 


AAGAGATTGC 


2640 
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GATTGATTTA GATCAAGTCC AACAAGATTT GAAAAATATC GATAAAGAAA TCGCAGAAAT 2760 

TGAGCAAGAA ATCAATGCAT ACCTGAAAGA ACTTGGGGTG TTGAAAGATG AGTAATACAC 2820 

AAAAGAAAAA TGTGCCAGAA TTGAGGTTCC CAGGGTTTGA AGGCGAATGG GAAGAGAAGC 2880 

AGTTAGGGGA TCTTACAGAT AGAGTAATTA GGAAAAATAA AAACTTAGAA TCGAAAAAGC 2 940 

CTTTAACAAT ATCCGGACAG TTAGGTTTAA TTGATCAAAC AGAATATTTT AGTAAATCAG 3000 

TTTCGTCGAA AAATCTAGAA AATTATACAC TAATAAAGAA TGGAGAATTC GCGTATAACA 3 060 

AAAGTTATTC TAATGGATAC CCATTAGGGG CTATTAAAAG ATTAACTAGA TATGATAGTG 3120 

GTGTATTGTC CTCTTTGTAT ATTTGTTTTT CTATTAAAAG TGAAATGTCT AAAGACTTCA 3180 

TGGAAGCATA TTTTGATTCG ACACACTGGT ATAGAGAAGT TTCTGGAATT GCAGTTGAGG 3240 

GTGCAAGAAA TCACGGATTA TTAAATGTTT CTGTGAATGA TTTTTTTACT ATTCTAATTA 3 300 

2^ AATATCCAAG TTTAGAAGAA CAGCAAAAAA TAGGCAAGTT CTTCAGCAAA CTCGACCGAC 3 360 

AAATTGAATT AGAAGAACAA AAGCTTGAAT TACTTCAACA ACAGAAAAAA GGCTATATGC 34 20 

AGAAAATTTT CTCACAGGAA CTGCGATTCA AAGATGAGAA TGGTGAAGAT TATCCAGATT 34 80 

GGGAAAATAG CAAAATAGAA AAATATTTAA AAGAGAGAAA CGAACGTTCT GACAAAGGGC 3 54 0 

AAATGCTTTC AGTAACTATA AATAGTGGCA TTATAAAATT TAGTGAATTG GATAGAAAAG 3600 

ATAATTCAAG TAAAGATAAA AGTAATTATA AAGTAGTTAG GAAAAATGAT ATTGCATATA 3660 

ATTCTATGAG AATGTGGCAA GGGGCTAGTG GTAAATCAAA TTATAATGGG ATTGTTAGCC 3720 

CTGCATATAC TGTGCTTTAT CCAACACAAA ATACTAGCTC ATTATTTATT GGATATAAGT 3 780 

35 TTAAAACACA TAGAATGATT CATAAATTTA AAATTAATTC ACAAGGATTA ACATCAGATA 3 840 

CATGGAACTT AAAATATAAA CAATTAmAAA ATATAAATAT AGATATACCT GTATTGGAGG 3 900 

AACAAGAAAA GATAGGTGAT TTCTTTAAAA AAATGGATAT ATTGATAAGT AAACAGAAAA 3 960 

TGAAAATTGA AATATTAGAA AAAGAGAAAC AATCCTTTTT ACAAAAAATG TTCTTATAAC 4020 

TTTGATAAAT ACATAGATTG CATAAGAATA AAATTTGTAT AATTTAACAT AAAAGTTGTA 4 080 

AAAGTAAAGT GAATTAAAAA CGAACATTAA ATTTAGGCAC TGTGAAAGCG CAGTGTCTTT 414 0 

TTTGTGTCGA AATTGTGTAC AGAATAAGTA GTTAAATAAA GATTAAGTTG AGATAAAGTG 4 200 

TTATTCGTAA ATAAAAGAGA GTAGATCGAT AGGAATTGAA TGATATTAGT TAACTATTTA 4 2 60 

TTAAATTACTT TAATAATGAT TAATTTTTAG TTAAAGTAAG TTTAATGTGA AGCACGACCA 4 3 20 

TTGCTCATTA TAATGA 4 3 36 
(2) INFORMATION FOR SEQ ID NO: 547: 

55 



30 



40 



45 



so 



1485 



EP0 786 519 A2 



(A) LENGTH: 4 87 base pairs 

(B) TyPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLCX3Y: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 7: 
^0 TAAGCTATCT GAGATAATTG CTGATAACAT TAAACCGGCA ATTTCAGGTT TAATTTCAAA 60 

GCCACGTTCT CTAAACATTT TGTATAAAAT TGTAGCTGTA CAACCAACTG GTTCAGCACG 120 

ATAACATAAA GGACCAGCAG TTTCGAAATT TGCAATTCTG TGATGATCAA TTACATGCTT 180 

AATTGTAGCA GAGGCAATCG TATCAGAACT TTGTTGGctAT TCGTTATGAT CAACTAAGAT 240 

AACATCTTGA CCATCTAAAT CATCTGTTAA TAATTCCGGA GCAGGTACAT TAAATGTATC 300 

TAACGCGAAT TGAGTTTCTG CACTCACATC ACCTAAACGG TATGCTTTGG CTCCTGAATT 360 

ACCTCGAAGT TGTTCaAATT CTGCCaTAAT AATCGCAGAT GAAATTGCAT CAGTGkCTGG 420 

aTTCTTATGT CCGAAAATAT ATGTTTTAGC CAnTGTCAAA TATCTCCCTT GTAAATTGTA 4 80 

TTCTTTA 487 
(2) INFORMATION FOR SEQ ID NO: 54 8: 



IS 



20 



2S 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548: 

TTGGTGGTGC AGCAGTTATA GCAATTGTTT TAGCATTCAT TGGTAAGTTC ACTGCATTAA 60 

TTTCTTCTAT ACCTACmCCA GTTATGGGAG GAGTATCTAT ATTACTTTTC GGTATTATTG 120 

CAGCAAGTGG CTTAAGAATG TTAGTTGAAA GCAAAGTAGA TTTTGCGAAC AATCGAAATT 180 

TAGTTATAGC TTCTGTAATT TTAGTTGTAG GTATCGGTAA TTTAGTATTT AACTTAAAAG 240 

45 AAATTGGTAT CAACCTTCAA ATTGAGGGGA TGGCATTAGC TGCA CT TTCA GGAATTATTT 300 

TGAACTTAAT CTTACCTAAA GAGAAAAAAC AAAACAATTA AGATTTACAA ATTAAGGAGG 36 0 

GCGCTTTTAT GAATCATTTA TTATCAATGG AACATTTATC TACAGATCAA ATATACAAAC 420 

SO TTATCCAAAA GGCAAGTCAA TTTAAATCTG GTGAACGTCA ACTACCAAAC TTTGAAGGGA 4 80 

AaTATGTCGC AAATTTATTC TTTGAAAATT CTACTCGrAC AAAATGTAGT TTTGAAATGG 54 0 

CAGAACTTAA GCTAGGGTTA AAAACGATTA GCTTTGAAAC ATCAACATCA TCTGTTTCAA 600 
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TCATTAGACA TCCGTTTAAT AACTACTATG AAAAATTAGC GAATATTAAC ATCCCAATTG 720 

CGAATGCTGG TGATGGTAGT GGACAACATC CAACACAAAG TTTACTTGAT TTAATGACGA 780 

TATATGAAGA ATATGGATAT TTTGAAGGCT TGAATGTATT GATTTGTGGA GAcATTaAAA 840 

ATTCACGTGT CGCACtAGTA ATTACCAnAG T 871 
(2) INFORMATION FOR SEQ ID NO: 549: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
,5 (D> TOPOIXX3Y: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549: 
20 TCAATCTAAT ATATACTTCA TGACTTCCCG CCAAAGGCGC AATCGTAGGA TTAGTCTGTT 60 
TATCGATAAG ATCTATTAAT ATTGTTTCTA CTTTAGATTC ACCTATTCCC GCAAATCTTA 120 

ATAGTTCAGA ATGTATAATT CGATTATGGT TTATAAAATG TGACAACAAT TCATTTTTCA 130 

2S 

CCATrGGTTG CATTTCTTTC GGTGGACCTG GTAATAAAAT AATTTGTTTG TTTTCAAAAT 240 

TCACCATCAT TCCTGGAGCC ATGCCATGAT GATTTGTTAA TACAGTTGAA CCTTCAATTA 300 

CTAAAGCCTG TTGTcTATTA TTAGGTGTCA TTTCTtGTCC TTGTTCCTCA aAAwAGCTTT 360 

30 

CAATATATTG AAAGAAGGCT CATCAATAAC TAAATCTnTA 4 00 

(2) INFORMATION FOR SEQ ID NO: 550: 

^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 
ATACCrCCAG CTAGAATACC AGCGTATTTC ATAAAATACT TCCTCCATTC AACTATATCT 60 

45 

ATATTTAATT ATTTAAATTT CGTTGCATTT TCCAATTGAA AACTCATTTT AAAATCAAAA 120 
CTCTAAATGT CTGTGTATTA CTTAAAATTA TACATATTTT GCTTATATTT TAGCATATTT 180 
TGTTTAAACC TATATTACAT TATATCAGAC GTTTTCATAC ACAAATAATA ACATACAAGC 240 

SO 

AAACATTTCG TTTATTATTT ATATCACTTA ACTAATTAAT TTATAATTTT TTATTGTTTT 3 00 

TAAGTTATCA CTTAAAAATC GTTTGGCaAA TTCGTTGTGA CGCTTGTCCA TCTTCTAATG 3 60 
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X X X X ^ xxv\ 


X wViBn X Vai^U\X V9 


AGTGC-TGX X X 


GAX X X XLmAAL. 


AATTGGACCT 


GGCAACTCTT 


480 




1 X X XAXxvvXV- 


CATGT AA A A A 
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TTTATCTAAG 
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TCATATgCaT 


540 
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AA X AV.^L. ^ V^A 


A (^Trri A A r» A T 

A*j X L.GAA w\ X 


n A ar* A 1*0 AO 
GAGAGA X\3AG 


XAGTCGGTAA 


600 






vjv^ X wA X 1 AAv7 


TATS R AT/^/T* 

X A X AAA X Cv^G 


AAAXGXUX XL. 


a*pa a ^ otv^ a a 
A^ AAC G XajAA 


ACGTCTTTCA 


660 


10 


A a 21 iiTr*a*rf^ 


Al u X 1 L.A1 L.A 


AX AUGXu X CA 


w\ACXAAAXA 


ATG GATGGGT 


K IV IV IV IV IV ^^fV IV 

AAkAAAATAA 


720 




V>-rvx Ax 1 i ^- 


A Trr* A r^i'vzr^ 

A 1 W W^tjV^^V,* 1 


X GAL.GV..AAAG 


L.X XL X AX AX 


a a a/^TTa a^a 
AAAGX TAAGA 


f fV^^«v IV A ^PT*/^ TV ^ 

X X AAATTGAT 


780 






1 U.UAAX V.v»V» X 


X uAX CG X vJAw 


a ao*T*TV7o 
GLLAAG X XXiG 


GG GGTACaTA 


ATCAg X"! X 'X"X' 


84 0 


IS 




AATAX 1 i AAT 


CTTGTCTTAA 


•T^ TV /^/^ IV < 1 M pTV TV 

TAG CATT AAT 


ATATTCAGTA 


TCATTGCGTT 


900 




xAxvjTvSATAA 


TTTATCATTT 


CTTGGATAAC 


CTGTTTCCAA 


AATgx~xATCT 


CGACTAACAT 


960 




G AAATG CATT 


X X GAAATAT C 


^ IV ^P^^^/^/^ TV TV 

GATGTCGAAT 


TV rtv^^ TV » I«Cf » TV /"^/^ 

ATGGA X X AGG 


TGACACTAGA 


TAATCCCACC 


1020 


20 


GTTGGCTTTC 


"i''J."X~i~l~rAAAG 


CCATCTTGGT 


AATTTTGAGT 


ATTTGTTCCT 


AGCATTTTAA 


1080 




C-GTTAQ71 AA X 


ATu WiAAC CA 


ATCTTTTTTA 


ATGGCGTG C C 


ATGCCATGTT 


TGTAAGTACG 


1140 


2S 


TCGTTCGCGG 


TGATTXATAT 


Ts TV /^/^ TV TV f«/ Mll^ 

AACCAATCTG 


i>^^n/^mTv »t7v^O^ 

GTGTACGTGT 


gTTAATCATC 


CvfCGCTTTCG 


1200" 


CTCTTGGCAT 


CGCTAAAAAC 


CATTTCATTG 


AAAACTTTGT 


TV TV ^^1V TV ^Hi^^i 'II n 

AACATATGGT 


ACATTGTGCT 


1260 




GTTGGAATAT 


GTGTTCATAT 


CCmTTTCA 


CZACCCCATAT 


rri TV TV'IWIW 

TAATTGGGCA 


TCGCTATGTT 


1320 


30 


CAGTTAAGTA 


TTCATATAAT 


GCTTTGGGGT 


TGTCGCTGTA 


TTGTTTACCA 


TGAAAGCTTT 


1380 




CAAAATAAAT 


TAGATTCTTG 


TTTGGCAATT 


TTGATAGTAA 


TTTAAAAGTC 


GTATATATAC 


1440 




TATGTTCTAT 


CAATTTTTTA 


ATTGTATTTT 


TAATCATGTC 


GTACCTCCGA 


CGTGTTTTTG 


1500 


35 


TAATTATATT 


AATATGTATG 


AGC 








1523 



(2) INFORMATION FOR SEQ ID NO: 551: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOUXSY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 

CAGTAAGAGA TTTTCTTAAT TGAAAATAAT CTTACTGCTT TTTTAAATTT AATTTCGAGA 60 

TTCAATATTA GTTTATCTCA TTGTGGCATT AATTGATAAA ATTGTTTTAA TGTTATAAAT 120 

CAAAGTCtTC TTCAACAATT TCAATGTCTG CATCAGATCT ATGATATGTG AAAAAGCTAA 180 

TTCTTATGCG GTCTAAATGC TCTAAATGGT GTCGATACTC TTCGATGGCT GCAACAATTT 24 0 
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ATGTGGACAA 


TAAATCTTTT 


TTCTGCGGAT 


TATCTAACTC 


ATAATCAACA 


TGTGTCACAT 


260 




TATAACGTGC 


TTTTTTAGAA 


AGGCTAGCTA 


AAATTTGCTC 


GTGAAAAGCA 


GTTAATGAAT 


420 


5 


CTAAATCTAA 


TTTGATTTGT 


AATAGGAAAT 


TGTTATTAAG 


TAAATATAAG 


TCGTTTTGAT 


480 




AACGCGACAA 


TTTGTTTAAT 


ACTTCATAAG 


CTTGTCTAGT 


CGTCTGAACT 


ACTTCTCTGA 


540 


10 


AAAGTATTTT 


CTTTCTATTC 


TGTTGGTGAA 


TATGTTTTTT 


TGTAATAGGA 


CGTTCTTCGC 


600 


TATAGTAATC 


ATAAATTTTC 


TCTAACrriT 


CGACACGTTG 


TTTTAAATTA 


TGACTATCTT 


660 




GTTTAATATT 


ATTAAACTCC 


GTCGTATCAT 


TTAATACTAA TTTAAACCAC ATAAAAATAT 


720 


15 


CTGAGGATAT 


ATTTAATGAA 


TTATAGTAAA 


TTTTTGTTTC 


AAATTTAGGT 


GGTAGAAACA 


780 




CAAAGTTAAC 


TAGAGATGAA 


CTTATGACAC 


CAATCATTAC 


AAGTACAAAC 


CTGTAAAAGG 


840 




CGGTAATATA 


GAAAGAACCG 


GTATGTTGTC 


CCATAATGAT 


TAATGCTGTT 


ACACTCGCCA 


900 


20 


AAGTAGCAAC 


ATGTGCTAAA 


TTAAATTTAA 


ATAAAATAGC 


AATAAGTACT 


ATGACGGTAA. 


960 




CACCCATAAT 


GATAAAATTA 


TCACTAAAAA 


TTGTTACCAT 


TGTAACAGAT 


AGTATGGCAC 


1020 




CTATAATGTT 


ACCCAATGCT 


TGATCAGAAA 


CTGTTTTAAA 


TGAACGATAA 


ATACTAGGTT 


1080 


25 


GCATTGCACA 


ACAGCACTGA 


CACCAGCTAA 


GGCTTTCAGA 


CCAACATCAT 


CCGGTAGTAA 


1140 




AGAAGCXSATA 


GACATAGCTA 


AAATAATGGC 


TATACCAGTT 


TTAAAAATCC 


GAGCTCCTAG 


1200 


30 


TCTCAAAAAT 


AATGACGCCC 


CTTTTAAGTT 


TATTGAATAT 


CTAATATTCG 


TATTCATTAC 


1260 


TGTTATACAC 


TTACTAGTTA 


CAAAATTCAA 


GCTTATTTAT 


AGTTGTTAAA 


ATAAATCATA 


1320 




CATAATACTG 


ATAGCGATGT 


AAAACTTTAG 


TCAGAGATTA 


AAATAGTATA 


AATTTGTAAA 


1380 


35 


ATAAAAACTC 


ACATAGTGAC 


ATATCAAGTT 


AAACGTTAAT 


AGTTAACGAT 


ATAAAATGAA 


1440 




TCTACTATGT 


GAGCATTTGC 


TTTATTTTAA 


TTCAATTAAA 


AATATACTTC 


CTTAAAAGTT 
TCTTCTTCTG 


1500 
1560 




ATTTCATTTG 


ACTAAAAGCA 


TAGTCTGCAG 


CTTTTAAAGT 


TTGTTTAATA 


40 


TATGTTCAGT 


TGTTAAGAAC 


CAAGCTTCAA 


ACTTAGAAGG 


TGCTAAATTG 


ATACCTTGAT 


1620 




TTAACATTAA 


TTTGAAAAAT 


TTACCGAACG 


CTTCGCCGTC 


AGAATGTTCA 


ACTTGATCAT 


1680 




AATGTGTGAC 


TTTTTCATCT 


GTAAAGTACA 


ATGTTAAAGA 


TCCATAAATA 


CGATTAATTG 


1740 


45 


TAGCTGTGAT 


ATTATGTTTT 


TCGATTAATT 


TAAGTAAACC 


TTCTTCTAGT 


TGTTGGCCTA 


1800 




AGCTGTCTAA 


TTTTTCATAA 


ACACCGTCTT 


GTTCTAGTAC 


TTCGAGTAAT 


GCAATACCTG 


1860 


50 


CTTTCATAGA 


TAACGGGTTA 


CCAGCCATTG 


TACCAGCTTG 


ATATGCAGGT 


CCTAGAGGTG 


1920 


CTACTTGTTC 


CATAATATCT 


TGACGTCCAC 


CATAGCCTCC 


AATTGGTAAA 


CCACCGCCAA 


1980 




CAATTTTACC 


AAATGCAGTT 


AAATCAGGGA 


. TAACACCTAA 


. TAAATCTTGA 


. GCGGCACCGT 


2040 
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AAATTTCATT AACCTCTTCT 


AAAAATCCAG 


GTTGAGGCAT 


TACCATTCCA 


AAGTTACCAA 


2X60 


5 


CAATTGGTTC 


TACTAATACT 


GCGGCAATTT 


CATCACCCCA 


AAATTCAATT 


GCTTCTTTAT 


2220 




AGGCGTTAAT 


ATCATTGAAA 


GGTACAGTAA 


TGACTTCACG 


TGCGACGCTT 


TCTGGAACAC 


2280 




CAGCTGAGTC 


TGGAGAACCG 


AGCTGAGATG 


GGCCGCTACC 


TGCTGCAACC 


AATACTAAAT 


2340 


10 


CAGAATGOCC 


ATGATAAGAT 


CCAGCAAATT 


TTATAATTTT ATTTCTTTTA 


GTATATGCAC 


2400 




GTGCAACACG 


AATTGTTGTC 


ATGACTGCTT 


CTGTTCCAGA 


ATTTACAAAG 


CGAATTTTCT 


2460 




CAAGAGATGG 


AATTGCATCA 


CGTAATTTTT 


TGCTGAATTC 


AATTTCTAAT 


TCAGTCGGTG 


2520 


IS 


TACCAAATAA 


AACACCTTTA 


GCAGCTTGTT 


CTTGAATTGC 


TTTAGTAATA 


TGAGGATGTG 


2580 




CATGCCCCGT 


AATAATTGGA 


CCGTATGCTT 


GAAGGTAATC 


AATAAATTTA 


TTGCCATCGA 


2640 




CATCATATAA 


ATATGCACCG 


TGTCCTTCTT 


TCATAACAAC 


AGGTGCACCG 


CCTCCTACAG 


2700 


20 


CTTTATAAGA 


ACGAGAAGGG 


GAATTGACAC 


CGCCTAGAAT 


ATATTCGTTT 


GAAAGTTGTT 


2760 




GTAAACGTTC 


ACTTTCACTA 


AAATTCATTT 


ATATCAACCT 


CTTTTAATTT 


AATATTTTCA 


2820 


2S 


TCTAATATCG 


TATCATAAAA 


TTATTATAAT 


GAAGAAAAAG 


GTGATTATAT 


GTTGCAAAAA 


2880 


GGAGAACAAT 


TTCCAATATT 


TAAATTAGAA 


AATCAAGACG 


GAACTGTCSVT 


TACAAATGAT 


2940 




ACATTAAAAG 


GTAAAAAGGC 


GATTATATAT 


TTTTATCCTA 


GAGATAATAC 


ACCTACTTGT 


3000 


30 


ACCACAGAAG 


CTTGTGACTT 


TAGAGACAAT 


TTAGAAATGT 


TCAATGATTT 


AGATGTTGCA 


3060 




GTATATGGTA 


TAAGCGGTGA 


TTCAAAGAAA 


AAACACCAAA 


ATTTTATTGA 


GAAACACGGA 


3120 




TTGAATTTCG 


ATTTATTAGT 


AGATGAAGAT 


TTTAAATTAG 


CTAAAGAAAt 


GGCGTATATC 


3180 


3S 


AGTTAAAAAA ATCATTTGGC AAAGAAAGTA 


TGGGCATTGT 


AAGAACGACT 


TTTATAATAG 


3240 




ATGAACAAGG 


TAAAGTATTA 


GATGTTATCG 


AGAAGGTTAA 


GGTTAAAACA 


CAAATAGAAG 


3300 




AACTTAAAAA 


CATTTTGGGG 


TGACATATAT 


GAAAGTTGTT 


GGGTTAAATC 


GTATGCGTGA 


3360 


40 


AGTTGAAACT 


GAATTACAAC 


AACGCTTTTC 


AGATTTAGAT 


TTTAAATTTT 


ATAAAAAAGC 


3420 




ATCAGAAATA CCTGAGAGCG 


ACTTGGCTGA 


TTTAGATATA 


TTAGTTGGTT 


ATGATGGCGG 


3480 


45 


TATCAATGAG 


GCATTTri'AC 


GACGTTGCCC 


GAATTTAAAA 


TGGATTGCAT 


GGTTTGCAAC 


3540 


GGGTGTAAAT 


ACATTGCCGT 


TAGATTATAT 


TGCAGATCAC 


GGGATACTTT 


TAACTAATGG 


3600 




AAAAGGTGTT 


CAAGCTAAAC 


AATTATCTGA 


ATACATTTTA 


GCTTTCATTT 


TAGATGATTA 


3660 


SO 


TAAAAAGATG 


AAACTATCAT 


ATGATAACCA 


ACGACAACAT 


ATATATGATT 


CGAAAATAAC 


3720 




TGGTAAACGC 


CTATCAGGAC 


AAACAGTTTT 


ATTTTTAGGT 


ACAGGTGCAA 


TTGCTACTAG 


3780 


SS 


AACTGCGAAG 


TTAGCAAAGG 


crrrrAATAT 


GAATTTAATT 


GGTCTGAGCA 


AGTCAGGTCA 


3840 
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TGCTGACATT ATTATAAATG CTTTACCAGA AACGCAAGAA ACGATTCaTT TaCTAAAGAA 3960 

AAAACATTTT GAATTAATGA AAGATGAAGC ACTTrTTATA AATATAGGAC GAGGTAGCAT 4020 

AGTTAAAGAA GCGCTCTTAA TAGAAGTATT AAAAAGTAAA GTTATTCGAC ATGCATATTT 4080 

AGATGTGTTT GAAAATGAAC CTTTGAAACC TAATCATGAA TTATATGAAT TGGATAATGT 414 0 

AACTATAACA GCGCATATAA CTGGTAATGA TTATGAAGCA AAGTATGACT TATTAGATAT 4200 

TTTTAAAAAC AATCTAGTTA ATTTTCTCAA TAAGAATGGT CTAATTGAGA ATGAAGTTGa 4 260 

TGCTAAAAAA GGCTATTAAA TGArATCATC ATGTAAATAT TGACACGCGC GCAATACTAC 4320 

AGTTATATTT aTAGTAAgTt AATaATgATT ATATAAGAaA GATGGTgATA TAGATGAGTG 4380 

TTGAAATAGA ATCAATTGAA CATGAACTAG AAGAATCAAT TGCATCATTG CGACAAGcAG 444 0 

GCGTAAGAAT TACACCTCAA AGACAAGCAA TATTACGTTa TTTaATTTCT TCACATACTs 4 500 

20 ATCCAACASC TGaTGaAATT TATCAAGCAC TTTCACCTGa TTTTCCAAAT ATAAGTGTTG 4560 

CGACAATATA TAATAACTTA AGAGTGTTTA AAGATATTGG AATTGTAAAA GAATTAACAT 4 620 

ATGGAGACTC ATCAAGTCGA TTCGACTTTa ATACACATAA TCATTATCAT ATTATATGTG 4680 

AACAATGTGG TAAGATTGTT GATTTTCAAT ATCCACAGTT AAATGAAATT GAAAGATTAG 4740 

CTCAGCATAT GACTGACTTT GACGTAACAC ATCATCGAAT GGAAATTTAT GGAGTTTGTA 4 800 

AAGAATGCCA AGATAAATAA TTTAACTTTG GTAGTATGAC AAATTAAAAA AGCGTTACTw 4 860 

ACTTCATATA AGTAAGCGTA ATATTTAAGA nGTTAAACGA CATGaAAGTt GTTTAACTTT 4920 

TTT 4923 
35 (2) INFORMATION FOR SEQ ID NO: 552: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 917 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 



2S 



30 



45 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 

TCCGGCTTTA AAAACTTTTC CCAATTCCAG CTTGGGCCTT TGGCATTAAT ATTAACCTCC 60 

TGGTTCGGAT TAATTGGGAA CCTAACCnTT TAGGCAATAA TTGGTTTAGG CAACTTCCAA 120 

ATTGGTGGTT CAACCAACGT CTTTGGATAC CyTGcTCATT TAATTCTAAA ATGgTyrGAA 180 

CGCATTTTGG TACCCAAAAC GgTGACGTTC GTTTGCACGG TCTAATAAAT TGTCTAAGTT 24 0 

GTCGATTGGT TTCATTAATT GAaCACCATT TTGCTCTTCA GCAAGACdT TGTCTACTTT 300 
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GCGATCCATT GTTTGTTCGA AAAGAATAGC ACCCAAAATT TTATCTGGTC AGAATGAAGG 
TGAAGTTACC ACACGTGTAC GCATATCGTG AACAAGTTGG AACATTTCGT CTTCATTGCT 
ATATTGATCT TCGTTTACAC CATATTCTTT AAGTGCTTTT GGTGTACTAC CACCACTTTG 
GTCTAATGCG GCAATAAAGC CTTTTCCATT TTTCATTTTT TCTAATTGCT CTTTATTCAT 
ACTTTCCACT CCTTAACTTT TCAATACACC TCCAGTATGA TAAAAATGAG AACATTTCTC 
AAGTCATAAA CCTTGAAAAG TGTATAAAAT GTGAAAAATA ATTGTCAGTT TAATTAAAAA 
TATTATTTTA TTCTAGGTAT GACTAACGCC ATTAATGACA TAAAGAAAAT ATGTGTAATA 
ATCCAACCGA TTAATTCTGT CACACTAAAT TGAAAAATTG GACGTTOCGC AATAAATACT 
AAAAAGGGAT ACAATGCTAT AAATAAGAAA AATAAAGGGA TATAACATAG ATAGTAAAGC 
CTTTTAGAAG TATGAAA 

(2) INFORMATION FOR SEQ ID NO: 553: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic, acid 

(C) STRANDEDNESS : double 

(D) TOPOXx:x5y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553: 
ACTCCCATAT CCACTGGCAA GATTTATAAT TTGTAATTCT AAAGTATAAT AGCCTATATT 
TTTCAAAATC TCTTTTTTTA TGATTAACAT CCCTCTAAGT GCACCTAAAG GTTTTTATGT 
AATGTATTAA ATATTTCCAT TTATACATAT ACATCTTGTA TAAAAAGAAA GAACTCCATA 
TATACTCAAA AGGTATACTG AAGCTCTATA TATTATATAA AATTATTATA CTATTTTGAA 
AAAGATAAGT AAATCACTAA ACATGATTTC TTTCACTTTT AGAAAACTTT TAATACTATA 
AAAGCACCCA CTCAGTCACT AGTTTGGGCA GTTATTGTAT GCCTATTGaA CTCAATGCgT 
ATATTACAAT ACCTTTTtCG CATATTCATA TAAGacTTTG CATCTTTAAG CTTAATTGCT 
ATCTCTTTCT CT 

(2) INFORMATION FOR SEQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1374 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TTTTGCTTTA 


TGTTAAATGG 


ATTATTaTTA 


GAATAAAAAA 


TCGGTGATGA 


GCTAAAAAAG 


60 




TGTGTAGGAT 


GTTTTCCkAA 


cccAA'rrnT 


ACATCCGAAG 


ATATCGAACA 


ATATCTTCCT 


120 


5 


ACTTCTACAT 


TATTAAAATC 


ACTACCAAAT 


CCAATATAAC 


TGTATTCACC 


AATGTGAGAA 


180 




TTCCTGATTT 


TACACCATCT 


ATCTATATAG 


TTATTGCCAT 


CAAATTTTGA 


GTTTGTAATA 


240 


lU 


TACGCCAAGC 


GATGAATCTT 


AACATTCGAT 


TCTTTAGAGG 


ACTGGTTTTT 


CAGCAAACCA 


300 


ATTATCTTTT 


CAATCGCTAT 


CCTCATCGTC 


ATTTCCTCCA 


AGTATTTCGA 


TTGATAATAT 


360 




CTTTATAGCT 


TTGAATAATT 


TTAACTACCT 


TTGTCGAAAC 


GTTAGTGTCT 


TTATAATCAA 


420 


15 


TAGCATCAAT 


CATCGGTTCG 


TTATTGTTTT 


GCATCTCTCT 


TGCTAGTTCA 


ACGGATTGGA 


480 




TTAGATTGTT 


ATAGGTAATA 


CCACCTACAA 


TAACCGTACC 


TTTATCTAGT 


ACTTCCGGTC 


540 




TTTCTGTGGA 


AGTTCGAATA 


AGGACACCAG 


GGAACTTCAA 


AATAGACGAC 


TCTTCTGACA 


600 


20 


ATGTTCCACT 


ATCTGATAGC 


ACAACAAATG 


CATCITTTTG 


CAATGCATTA 


TAATCAAAGA 


660 




AACCAAATGG 


CTTTAACTGT 


TTAACTAATG 


GATCAAATTC 


AAATTTACTT 


TCTTCAATTT 


720 




TCTTCCAACT 


TCTTGGATGC 


GTTGAATAAA 


TCACAGGCAT 


TTTATACTTT 


TTGGCAATAT 


780 


25 


CATTTATCGC 


ATTCATTAAT 


GATTTAAAAT 


TCTTTTCATT 


ATCGATATTC 


TCTTCTCTAT 


840 




GCGCAGaTAC 


TAAAATGTAT 


TGTTGCGGTT 


CTAATCCTAG 


TTTATTTAAA 


ACGTCACTGT 


900 




GATTAATTTT 


ATCTCGATGC 


GCTTCTATCA 


CTTCTGTCAT CGGTGATCCT GTyACAAAGA 


960 


TATTCGCTTT 


ATTGAAGCCT 


TCATCTAATA 


AATAACGTCT 


GCTATGTTCC 


GTATAAGGTA 


1020 




GATTCACATC 


ACTGACATGG 


TCAACAATTT 


TACGATTGAT 


TTCTTCAGGT 


ACATTCTGAT 


1080 


35 


CAAAGCATCT 


ATTACCCGCT 


TCCATGTGGA 


ACACAGGAAT 


CTTTAATCGT 


TTAGCAGATA 


1140^ 




CTGCTGCTAA 


ACAACTATTT GTATCACCAA 


GAATTAAAAG 


TGCATCTGGT 


TGTTCGCGTA 


1200 




ATAAAACATC 


ATATGTCTTC 


GCAATAATAT 


TCCCCAtCGT 


TTCTCCAnGt 


TACTTCCAAC 


1260 


40 


TGCCTCTAAG 


TAGTGGTCCG 


GTTGTCTTAA 


TTCCAAATCA 


TCAAAGAAAA 


, TTTGATTCAA 


1320 




TGTATAATCA 


TAATTTTGAC 


CAGTGTGTAC 


TAATATCTGA 


. TTAnAAATAT 


' TGAT 


1374 



(2) INFORMATION FOR SEQ ID NO: 555: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555: 

55 
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20 



2S 



30 



3S 



40 



TTACCTCTAG 


GCATGTCCCT 


TTCACGGTTT 


GCTTATGATA ACGTTATCGA 


CATTGTTAAA 


120 


TTGTCCTTTT 


TGAAAAAATA 


ATTCTGGTGA 


GATAGACGCT 


ACTTGGaTTT 


CaTCaGTTTG 


180 


TAAtAACGCA 


GtATAAtTAC 


CaTTACTAAA 


tTGaGTTAAT 


CGTTCaTATA 


AAGTACTAAT 


240 


AGGATAATAA 


ATGTTATCCG 


TTAAgCGCGC 


CGTATAGTTC 


ACTTGATACG 


TTTCGCCTTC 


300 


AACAATTGCT 


TGTTGGACAC 


GTTTAATATT 


AGTCATCATA 


ACCTCAGAAG 


ATTCAACAAA 


360 


TGAAAAATGA 


TACTTTGATA 


CATAAGAAGT 


TTGATGTTCA 


TATGTTGAAT 


TTATGCTTTC 


420 


CGCrriTTCA 


AAACTATAAG 


CTGCTGCATA 


AATATCATCT 


TTAGCTAATG 


AATGTGTACA 


480 


CATAGCATGA 


TTAAAATACT 


TTGCCGCTTC 


GTAACTTAAA 


TATAACGAGA 


CATATCTACC 


54 0 


TTGTCGTkGt 


GCGCTTGTGC 


AAAGTGTATC 


ACTTCTCCCA 


CATCAGCCAA 


CTTAGTAGCA 


600 


ACATACTTCT 


TTATAAATCC 


CTTTAATTGA 


ATATGGTATT 


GCTTATATTC 


ATTTTCAGTT 


- 660 


AAATAGTAGC 


GATAATTATA 


TTCTATTCTC 


ACAGTAATCA 


CCTACCTTCG 


ATAAAAATAA 


720 


TTCAACTTGT 


CGATAACCGT 


ATTCACTCAA 


AATAGATTCA 


GGATGATATT 


GCACACCAAA 


780 


AACCGGAAAT 


CTAATATGCT 


CAAATGCCAT 


AATAATCGCT 


TCATCGTTTT 


TTG CTGT AAT 


840 


CTTTAAGCAA 


TTTGGAAAAG 


TCGCTCCGTC 


AGCAATTAAT 


GAATGATAAC 


GCATTACATT 


900 


GAAATTTTGA 


GGCAGTCCTT 


GAAAAATACC 


TTCATTGGTA 


TGGCGTAACT 


GTGTAGTATG 


960 


TCCGTGTACA 


GGATGATAGC 


CGTGAATGAT 


ATTTCCACCA 


AAATAAGACA 


CGATACATTG 


1020 


AAATCCTAAA 


CATACACCTA 


GTATAGGTAC 


ACGCTGATAA 


AATTGTTCTA 


ACACTTCATT 


1080 


CAAGATAGGA 


TAATCATCCG 


GaTTACCCGG 


CCCAGGCGAA 


ATAACAATTG 


CTTTTGGCTT 


1140 


CATATTAATG 


ACGTCTTCTA 


TCAGCAGATT 


ATCAATACCA 


ACAACTTGAA 


CTGTTAGTTT 


1200 


CGTTTGAGTC 


TTAATATAGT 


CTATTAAATT 


ATATGTAAAT 


GaATCATTAT 


TATCTATGAC 


1260 






TTCTAAATGT 


GTTTTATTTT 


TATAATATGT 


ATTGGATGTA 


1320 


GCTAAAACTT 


TAAAAGCATT 


GTCATTATCC 


TGACCTTGAT 


TTTAACTAAT 


ATATGGTATA 


1380 


TTCTATTCAT 


CGTACATAAA 


TGAATATCAG 


AGGTTCCTAG 


CTGAAACCCT 


CTATAAAAAA 


1440 


CTAGGCCATT 


GAAATTTCAA 


ACATTCGTTG 


GG 






1472 



45 

(2) INFORMATION FOR SEQ ID NO: 556: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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AGAACAGCAA GGATTACTTA CTGAGGAATT AAAGAAGGAT ATTTTAAAAC AGAACAAATT 60 

ACAACGTGTT GAAGACCTAT ATAGGCCTTT TAAACAAAAG AAAAAGACAA GGGCAACTGA 120 

GGCGAAACGT AAAGGGTTAG AGCCATTAGC GATATGGATG AAGGCACGTA AACATGAAGT 180 

CTCAATTGAA GAAAAAGCAC AACAATTTAT AAATGAAGAA GTGCAATCGG TTGAAGATGC 240 

TATCAAAGGT GCACAAGATA TTATTGCGGA ACAAATTTCA GATAATCCTA AATATAGAAC 3 00 

AAAAATTTTA AAAGATATGT ATCATCAAGG TGTGTTAACT ACATCTAAAA AGAAAAATGC 360 

TGAAGATGAA AAAGGTATTT TTGAAATGTA CTATGCATAT AGTGAGCCAA TTAAACGCAT 4 20 

TGCTAATCAT AGAGTTTTAG CTGTTAATCG TGGTGAAAAA GAGAAAGTAT TATCTGTAAA 480 

GTTTGAATTC GATACGACAT CAGTAGAGGA TTTCATTGCA CX3TCAAGAAA TCAATCATAA 540 

TAATGTAAAT CGCAGTTATA TTTTAGAGGC GATTAAAGAT AGCTTGAAAC GCTTAATTGT 600 

20 CCCTTCGATA GAGCGTGAAA TCCATGCTGA TTTAACTGAA AAAGCTGAAA ATCATGCAAT 660 

AGATGTTTTT AGTGAAAACT TAAGAAATCT ATTACTGCAA CCTCCAATGA AAGGTAAACA 720 

AATATTAGGC GTAGATCCAG CATTTAGAAC AGGTTGTAAA TTAGCAGTCA TTAACCtATT 780 

CGGTACTTTT ATAGCAAAAG GTGTGATTTA TCCGCATCCA CCAGTTTCTA AAAAAGAGGC 840 

AGCAGAGAAG GATTTTGTAC AAATGGTTAA AGCGTATGAT GTGCAATTAA TTGCAATTGG 900 

CAATGGTACT GCAAGTCGTG AAACAGAACA ATTTGTTGCA GATTTAATTA AAAAGCATCA 960 

GTTGCCAGTA CAATTCATCA TTGTCAATGA AGCGGGCGCT TCAGTATACT CAGCATCAGA 1020 

AATTGCTAGA GATGAATTTC CTGATTTTCA AGTG 1054 
35 (2) INFORMATION FOR SEQ ID NO: 557: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 1057 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2S 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557: 
AAATGTCAGA ATACAAGAAA AAAATAATTG AATTAATTGA AAGTAATTTA ACAGGATATG 60 
AAATTTCTAA AAAAACTGGA GTTTCTCAAT ACGTACTTTC ACAATTAAGA CAGGGCAAAC 120 
GCGAAGTAGA TAATCTAACC CTGAATACAA CAGAAAAATT ATATGAATAT GCCAATAAAG 180 

SO 

TTTTGTAATT TAACTAATGT ATAAATTAAT CAAGCTATGT TTATTTGATT TAACTATTAA 24 0 

TAAAAATCAT ATGGTGAATG GATATTATAA TAATTAAAAT ACAAAAATAG TAGATTCCAA 300 
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5 



10 



20 



AAGGGAAAAT 


AAGTGTTAAG 


TTTTAAATGA 


TAAAAAAGAT 


TGGAATGGAT 


CGTCTTGAAA 


420 


TGCTCCCTTC 


AAAGTTTTCA 


TTTTTTCAAT GTCGACTTCX3 AAGGGGGCAT TTTCATTAAA 


480 


TTGTTATAGC 


TTTTTATATT 


TGTATAATGA 


ACATATAAGT 


TTAAGAAGGT 


GCGAGTGAAG 


540 


GPAATPiPiPiPiA 


AGCTCAAATG 


TACCAAATTG 


TTAATCTTAA 


TAAATCTCTA 


CTTTATAAAG 


600 


ATTGAATGGA 


CATTCGAGCG 


TTAATCAGTC 


AGGAGGGACT 


TTCCCTCCTA 


CAATTTAATA 


660 


ATAATACTTG 


CTTCACCACT 


ATACAAGGAG 


TGAGTTGTTA 


TGTTCAAAGT 


GAATTATTCG 


720 


ATTTTAAGTT 


ATTATCCAGA 


ATATAATATC 


GCAGTAAGTT 


GGCAACGTTT 


AAGAGAAGGA 


780 


AAAACAATAA 


AAAACAAGAT 


TTAATACTGC 


TGCGTCATGA 


GGCGCTTGAA 


CATTATTTGA 


840 


TGAATAAGTA 


TAATTTCAAC 


TATGATTATG 


CACATAAAAT 


TGTATCAAAA 


AAATACGATT 


900 


ATTCAATTTT 


TATAAAAaAG 


AAGGTGGATT 


AAATGCTTAC 


ATTAATAAAA 


TTGGAAAGAA 


960 


GATGaACAGG 


tTATAATATA 


TGraTATATa 


CCTGaAGATG 


aTATAAGTAC 


CGGGTAAAGG 


1020 


GTCCCGTACC 


TTTTTAATTA 


AAAAAGTTCC 


AGGGGGT 






1057 



(2) INFORMATION FOR SEQ ID NO: 558: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 
{ D ) TOPOLOGY : 1 i ne ar 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 



45 



SO 



CTGTGCTGTA 


TTTACtTTAA 


TTTGACGACA 


TTTGAGAAGT 


ATTATGATGG 


nTGTAGTTAT 


60 


TTTATGAAAG 


TAATGTATTA 


ACAATCGAAT 


TACTAAGTCT 


ATATTGGTAG GAATATCGAT 


120 


GTTTAGTTTA 


AATGGAATGC 


ACTATTTAAG 


TTTTTAATAT 


GGAGATGTTT 


GTGACTTTTC 


180 


GATGATTAAG 


ATTTTTATAG 


GTGTGCATCA 


TTTCAAATAA 


ACTTTGTGTT 


TAAAATTGAG 


240 


CTTAGGAAAT 


CGATAGGTTT 


AGATGAGGAT 


ATTGTTGAAG 


TTATGTGTCT 


TGTATCCTTA 


300 


GTTGTTATAA AAGCGCAAAA 


AATAGCACCG 


CTTTCTCTTT 


ATCTGTGTAG 


AAAGGATGCT 


360 


ATTATTGTAA 


AACAATAGTT 


TTAATTTnAT 


TTTCTGATaT 


ATCATATGTr 


ATTcTACCTG 


420 


TATCAATTTC 


TATCGAATTA 


TAACCATCAA 


AATTATCAAC 


TTTATCATCA 


AAGTCTATCA 


480 


CTTTTCCAAT 


TAATATTTTA 


TTATTAGTAA 


GCGTTAATTT 


GACTAATTTG 


CCTATGTATG 


540 


ACTGTAAATT 


CATATTTAAT 


CACTCCTTTT 


TAATATACGG 


AACTACATGA 


AAACCAGTTT 


600 


TAGAATAATG 


AACCTTACCT 


AATTTCGTTT 


CAATATATTT 


ACCATTCACK TAAGATTTAC 


660 
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AAACTGGAAT ATCAATAAAT TCTTTTTTAG TCATCTTTTC ACTTTCATTA GAATCTATTA 780 

TAGTATAACT TGGTAATAAA GATGAATTAT TTTTCTTATT ATTATATTCA TTTGTGCATA 840 

^ AAATATGACG CTTTTGCTTA ACATTATTCA ATTTCATCGT CATTTTACCA TTACTTATCA 900 

TTTCAAGCAT TTCTTTTTTA GCTTTTTGTA GAAAGGATGC TATTTTGTAA TTATTTTAAT 960 

AATCTAATAC TTTTTATCTC AGTTTCATCA AACGAATAAA CAGCAAAATC AGTCTGTATA 1020 

10 

TCTACAACAA GATTTCCTGT TTCGCTCTCG AATTCATTTT CATAGTCAGT TACAAACCCT 1080 

TCGTATTCTT CATTATTCAA AAGCGTGATA ATAACATCTT TACGATATGC ATCTnCAATT 114 0 

^5 CTCAATACTT TTCACCTACT TATCAATATA AGGTACTATA TGaGcACCTG GTCTTCGAAT 1200 

AATGGCACTT CCCTTTCTGG TTTCAATATA CACATTTTCG ATATGTATTT TTCCAATAAT 1260 

TTGATTAAAA TTAATAATCT CTTTCAAATC AAATCGCTCA TCACTTAATA TTAGATTGCC 1320 

TGTTGACATT TTTTCTCTTA ACAATTCATT CAATAAATCT ATAGAAAGTA TTGTATAGCT 1380 

AGGCAATTTC TTATTATTTA AAATGGCTCT TTTTTTATTT TCATTATATA GGTGATGACC 144 0 

TAACATATAT CTATTTTATT TTCCGCAATT TATTTCTATT TTTATTTTAC CATTTTTAAT 1500 

25 

CATTTCTTTC ATCTGGTTTT TAGCTTTTtC CTGtAATTAT GCTTCTTTTA CTTCTACTTG 1560 

ATATTTACCt TCACGCTCTT TAAAGAACTT GTCCCGCCAA TTGCCAACAT GTGGCACTGT 1620 

GGTACTTCTA CACCAAGGAT GCATAGGTGG CGCATTCACA CCTGGTATCA TATCTTTAAC 1680 

30 

TTTAAATATT TTTCCGTTAA GTGAATGACA TAATTTAGAT GTTTTACTAT CTATTTTGGC 174 0 

AACATATTTA TATTCGCCAT CTTCACCAAG TTCTTTTAAA TATGTTAACT TTTGTGCTTC 1800 

35 TGCATTTTCA GTAAATAGTT AAAAAAGCGT ATAAAAATAG CACCACTTTC TCTTTAkCTG 1860 

TCTAAAAAGG ATGCTATTTA TCTTTTGAAT TTGAATTCTT TTTCGCTTTT TCTATACTTT 1920 

CAAATTCTTC AACTAATTCT TTAAAAGATT CACTCAATTC TTTTGCAGTT ACATTTCCAT 1980 

CTAATTGTGA ATCTAACATA ATTAAAATCA TCTCACTTTA TATTTAATCa TATTTATACT 2040 

ATAAAGTTTT TTCAATAATT TTTCAATATG GCTATCATTA ATGATATTAA TATGCGTAAA 2100 

ATATTTAGCA CAAAATTTAC TCACTATTTT ACCATGAAAC CTATTTGACT TGGTAATAAA 2160 

45 

TTTTACTTGT CCCTTATTAG TAACGATTGT CATTGATTTT ATTGATGGAT GCTTAAAAAA 2220 

TGTAAATAAA TCATATTCTG AAAATCCTGA CTGTCCAGGA TGGTTATGTA ACATAACAAT 2280 

TGAATTCGGT TTACTGTTAA ATAATAATTC GGTTGCTTGT TACCCTGGCA CAAAAGATAC 234 0 

SO 

ACTATCTTGA TTGACATATA CTTTTGTAAA TTTACCATCT TTTAACAAAT AAGCTACTTC 24 00 

ATTGCTATCA TTGTTTTCCA TGAGATATAC CTCCGTTTAT AGtCTGTCGA CTGATATTCC 24 60 
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GAAGGTTAAA 


ATTTTTAAAG 


TAAACTTTCT 


TTATCCCCAT 


GCTACGAGTT 


CAGATTCAGG 


2580 




AAATAGCCCT 


TTACTAGTAT 


TTATGTATAT 


TCTGTCTATG 


GCATGAATAA 


AATAATTATC 


2640 


5 


TCTTGTATTT 


tTTTCTAAAC 


TAGATTrrrc 


AGCATTGATA 


ACTTCAAGAC 


TATCTATATC 


2700 




CATTTGAATA 


ATACCAGGCT 


TAATATTTTC 


ATCATTATTA 


GGAAAATATT 


TATATGTAAC 


2760 


10 


ACTTTTATCA 


TTAATTTCTT 


TTATTTTTAA 


TATTAGCAAT 


CATTTCCACC 


TCTAATTAAT 


2820 


TAAAATACTA 


TAATTATATT 


TTATTTCTGT 


AAGTTTATGT 


GCCTCTATAT 


AGTGTAAATT 


2880 




ATACTTATTC 


ATTAGATAGT 


GTTCAAGAGC 


TTCATGTTTC 


TACATTATTA 


TATCCATTTT 


2940 


IS 


TTTAATATTT 


TTCCCTTCTC 


TTAAACGTTG 


CCAACTTTGA 


GCCATATAAA 


AGTCAGGATC 


3000 




AAAriXiTlTA 


AATCCACTTT 


CTAATAAATA 


CTTATTTTCA 


AATATATGTT 


CATAAACTCT 


3060 




TTGAATTAAA 


TTTTTATTTA 


TATTAgTATT 


TTTAGCAATT 


TTAGAAATCT 


CTATCTGTTT 


3120 


20 


ATCTCGATTT 


CTAACTGAAT 


TATAATAAAT 


TTGAGCATGT 


CTGTTCCTTT 


TGATACCGTA 


3180 




TTCATCACTT 


TTATTATTAA 


GTGCACCTGA 


TTCAATAAAA 


CAACCTTCTA 


CTTGATATTT 


3240 




ACCTTCACGC 


TCTTTAAAGA 


ACTTGTCTCG 


CCAATTGCCG 


ACATGTGGCA 


CTGTGGTACT 


3300 


25 


TCTAOICCAA 


GGATGCATAG 


GTGGCGCATT 


CACACCTGGT 


ATCATATCTT 


TAACTTTAAA 


3360 




TATTTTTCCG 


TTGAGTGAAT 


GACATAATTT 


AGATGTTTTA 


CTATCTATTT 


TGGCAACATA 


3420 


30 


TTTATATTCG 


CCATCTTCCA 


CCAAGTTCyT 


TTAAATATGT 


TAACTTTTGT 


GACTCtTCyT 


3480 


TTTCAACGAA 


TAATGAAAAA 


AGCATATAAA 


AATAGCATCG 


CTTTCTCTTT 


ATCTGTGTAG 


354 0 




AAAGGATGCT 


TTAATACCaT 


GCTATTTTAT 


AATTTT CGGG 


AAATTcTTG C 




^ O V V/ 


3S 


AGTCTCTTAC 


TACAGAAAAA 


GACTTATTAC 


GATATAACAT 


AAAATATTCT 


TCATTTTCTA 


3660 




TTTCAGATAA 


TGAGAAATCT 


ATAACCTCTG 


CATCTTTTTT 


ATTAAAAGTT 


ACTGAACCTT 


3720 




TACCGTTACT 


TATATCATCT 


TCAGGTATAT 


ATTC 






3754 



40 (2) INFORMATION FOR SEQ ID NO: 559: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



so 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559: 
ATTTAACTAA AACTATAaAT AATCAAATGA TATTGGAAGA TATTaGCATA GATATCGAAA 60 
AAGGTAAATT GACTTCTTTA ATTGGACCTA ATGGTGCGGG TAAGAGTACT TTACTTTCAG 120 
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CTC3ATTATAA AAATAATGAC TTGTCGAAAA AAATATCTAT ATTAAAACAA ACAAACCATA 24 0 

CTGAAATGAA TATTACGGTA GAGCAGTTGG TAAACTTTGG ACGATTCCCT TATTCTAAAG 300 

GTCGTTTGAC GAAAGAGGAT CATGATATTG TCAATGATGC GCTAGATTTG TTGCAACTAC 3 60 

AAGATATCAG AAATCGTAAT ATTAAGTCAT TATCTGGTGG ACAACGTCAG CGTGCATACA 4 20 

TTGCAATGAC AATAGCACAA GATACTGAAT ATATTTTGCT AGATGAACCA TTAAATAATT 4 80 

TAGATATGAA GCATGCTGTT CAAATTATGC AAACGTTAAA AATGTTAGCG CATAAAATGA 54 0 

ATAAAGCXSAT TGTCATTGTG TTACATGATA TTAACTTTGC GTCCTGTTAT TCAGATCAGA 600 

TTGTAGCATT GAAAAACGGA CAACTAGTTA AGTCAGATTT GAAAGATAAT GTCATTCAAA 660 

GTAGTGTTTT AAGTGATTTA TATGACATGA ATATTCAAAT TGAACATATA AGAAATCAAA 720 

GGATTTGTTT ATATTTTAAG GATTGATAAT TTGGAGaCAC TTTAAAGGGG TGATGCGCCA 7 80 

^0 ATTAAAGAAG GGTTAAACGT AAAGCATTTA TTTAT 815 

<2) INFORMATION FOR SEQ ID NO: 560: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 919 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : doxoble 
(D) TOPOLOGY: linear 

30 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 560: 
GAAACGAATA ATAAATTTAC TGAGTTATTA GTTGAAAAAG CTAATAAACA TGATGATGTT 60 

35 CTCGATAmGT TGATTAATAT TTTAAAATAA GCGATACACA CTACTAAAAT TGTATTATTA 120 

TTATGTTHAA TTGACnCCTC CTAAATTTGC AAAGATAGCA ATTTAGGAGG CGTGTTTATT 180 

TTTATTGACG TCTAACTCTA AAAGATATAA ATTAGACATT TACAAATGAT GTAAATAACG 240 

CAATTTCTAT CATCGCTGAT AACAATTCAT GGTTTAATAT GCAATGAGCA TATACTTTTT 300 
AAATAGTATT ATTCACTAGT TTTAACAATC AATTAATTGG TATATGATAC TTTTATTGGT 360 
TATTTTTATC CCATAGTGTG ATAATTACTA TTTTTCATTC ATAATAAAGG TTTAAAGCAT 420 
GTTAATAGTG TGTAAGATTA ACATGTACTG AAAAACATGT TTAACATAAT GATATAAGGA 480 
GTGACGTACA TGATCCGTCT AGGTAAAATG TCAGATTTAG ATCAAATCTT AAATCTAGTA 540 
GAAGAAGCAA AAGAATTAAT GAAAGAACAC GACAACGAGC AATGGGACGA TCAGTACCCA 600 
CTTTTAGAAC ATTTTGAAGA AGATATTGCT AAAGATTATT TGTACGTATT AGAGGAAAAT 660 
GACAAAATTT ATGGCTTTAT TGTTGTCGAC CAAGACCAAG CAGAATGGTA yGnyGACATT 720 

SS 
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25 



30 



TATAAAGGAG CTGCTACAGA ATTATTCAAT TATGTTATTG ATGTAGTTAA AGGACGTGGT ' 840 

GCAGAAGTTA TTTTAACGGA CACCTTTGCG TTAAACAAAC CTGCACAAGG TTTATTTGCC 900 

^ AAATTTGGAT TTCATAAGG 

(2) INFORMATION FOR SEQ ID NO: 561: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561: 
ATCATATAAA CCGGCTGCTT CTAAAAACGA AAATACTGTT ACTGGACCTA AAAATTTAAA 60 
20 CCCGTATTGT TTTAAATCTT TAGATAGTTG TGTTGCAGTA TCATCAACTG TGATACGATC 120 

AGAAGCATGT TCATACTGCA AATCTTTAGG CTTACCATTT ACATATGACC ATAAAAATTT 180 
ACTAAAACTA CCATATGCTT GTTCAATTTT TAAATACCCT TGAGCTTGAT TAACAATTGC 240 
TTCTAATTTT TTACGATGAT GAACGATATT TGGAAAAGTC ATTAAGCSGT CGATATCTTG 300 
AGCGGTCATT TGTGCTACCT TTTCTGGTTC GAAATCATAA AATGCTTCTT CATAGGCTTC 360 
TTTCTTTTTT AAAATAGTTA ACCAAGATAG CCCAGCATGT TGTGaTTCTA ATGCTAAAAG 420 
TTTAAACAAT GCCTTGCnAT CATAGAGCGG TTGTCCCCAT ACATGGATCm TGATAGTCTA 4 80 

AGTAGACTGG GATCTTTAGT ACCAAATGCG CATTCATT 518 
(2) INFORMATION FOR SEQ ID NO: 562: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 



SO 



CTTATTTAAA 


AGTAAATCAA 


TCAATGTATT 


ATAATCCGAA 


TAGTCCGCAT 


AAAGCTGGTT 


60 


TGCaGGCAAA 


tCAATTACTA 


CAACAAGCAA 


AAACCCAAAT 


TAATGCAATG 


rXTaATTCAA 


120 


AAACAAATTA 


TGATGTTGTA 


TTCACTAGTG 


GTGCACTGAA 


TCCAATAATC 


TTGCTTTAAA 


180 


AOGTATTGCC 


TATCGTAAAT 


TTGATACAGC 


GAAGGAAATA 


ATTACATCCG 


TGTTAGAGCA 


240 


TCCGTCCGTA 


TTAGAGGTTG 


TAAGATATTT 


GGAAGCACAC 


GAAGGATTTA AAGTTAAATA 


300 
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